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CHLAMYDIA PNEUMONIAE GENOME SEQUENCE 

CROSS-REFERENCES TO RELATED APPLICATIONS 
The present application is related to 60/128,606, filed April 8, 1999 and 
5 60/108,279, filed November 12, 1998, which are incorporated herein by reference. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 



1 0 FIELD OF THE INVENTION 

This invention relates to nucleic acids and polypeptides from Chlamydia 
pneumoniae and to their use in the diagnosis, prevention and treatment of diseases 
associated with C pneumoniae. 

1 5 BACKGROUND OF THE INVENTION 

Chlamydiaceae is a family of obligate intracellular parasite with a tropism 

for epithelial cells lining the mucus membranes. The bacteria have two morphologically 

distinct forms, "elementary body" and "reticulate body". The elementary body is the 

infectious form, and has a rigid cell wall, primarily of cross-linked outer membrane 
20 proteins. The reticulate body is the intracellular, metabolically active form. A unique 

developmental cycle between these two forms characterizes Chlamydia growth. 

C. pneumoniae is a human respiratory pathogen that causes acute 

respiratory disease, and approximately 10% of community-acquired pneumonia. 

Antibody prevalence studies have shown that virtually everyone is infected with C 
25 pneumoniae at some time, and that reinfection is common. In addition to respiratory 

disease, studies have shown an association of this organism with coronary artery disease. 

It has been demonstrated in atherosclerotic lesions of the aorta and coronary arteries by 

immunocytochemistry and by polymerase chain reaction (Kuo et al. (1993) J Infect Pis 

167(4):841-849). 

30 Recent reports have further demonstrated the presence of C pneumoniae 

in the walls of abdominal aortic aneurysms (Juvonen et al. (1997) J Vase Surg 
25(3):499-505). Abdominal aortic aneurysms are frequently associated with 
atherosclerosis, and inflammation may be an important factor in aneurysmal dilatation. 
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C. pneumoniae may play a role in maintaining an inflammation and triggering the 
development of aortic aneurysms. 

Muhlestein et al (1996) JACC 27:1555-61, reported a differential 
incidence of Chlamydia species within the coronary artery wall of patients with 
5 atherosclerosis versus those with other forms of cardiovascular disease. The extremely 
high rate of possible infection inpatients with symptomatic atherosclerotic disease 
compared to the very low rate in patients with normal coronary arteries or coronary artery 
disease from chronic transplant rejection provides evidence for a direct link between the 
atherosclerotic process and Chlamydia infection. Because a history of chlamydial 

10 infection is so prevalent in the population, the issue of causality remains. On a 

physiologic and pathologic level, abnormal interactions among endothelial cells, platelets, 
macrophages and lymphocytes may lead to a cascade of events resulting in acute 
endothelial damage, thrombosis and repair, chronically leading to the development of 
atheroma in blood vessels. 

15 C. pneumoniae is related to other Chlamydia species, but the level of 

sequence similarity is relatively low. Very little is known about the biology of this 
organism, although it appears to be an important human pathogen. Allelic diversity and 
structural relationships between specific genes of Chlamydial species is described in 
Kaltenboeck et ai (1993) J Bacteriol 175(2):487-502; Gaydos et al (1992) Infect Immun 

20 60(12):5319-5323; Everett etal (1997^ Int J Svst Bacteriol 47(2^:461-473: and 
Pudjiatmoko et al (1997^ Int J Svst Bacteriol 47(2):425-43 1. 

A number of studies have been published describing methods for detection 
of C. pneumoniae, and for distinguishing between Chlamydial species. Such methods 
include PCR detection (Rasmussen et ai (1992) Mol Cell Probes 6(5):389-394; Holland 

25 et ai (1990) J Infect Pis 162(4):984-987); a simplified polymerase chain reaction-enzyme 
immunoassay (Wilson et al. (1996) J Appl Bacteriol 80(4):43 1-438); sequence 
determination and restriction endonuclease cleavage (Herrmann et al (1996) J Clin 
Microbiol 34(8): 1 8 97- 1 902). 

Antigenic and molecular analyses of different C pneumoniae strains is 

30 described in Jantos et al (1997) J Clin Microbiol 35(3):620-623. Some genes of C 

pneumoniae have been isolated and sequenced. These include the Gro E operon (Kikuta 
et al. (1991) Infect Immun 59(12):4665-4669); the major outer membrane protein Perez et 



2 
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ai (1991) Infect Immun 59(6):2195-2199; the DnaK protein homolog (Kornak et ai 
(1991) Infect Immun 59(2):721-725); as well as a number of ribosomal and other genes. 

5 

SUMMARY OF THE INVENTION 
This invention provides the genomic sequence of Chlamydia pneumoniae. 
The sequence information is useful for a variety of diagnostic and analytical methods. 
The genomic sequence may be embodied in a variety of media, including computer 
10 readable forms, or as a nucleic acid comprising a selected fragment of the sequence. 

Such fragments generally consist of an open reading frame, transcriptional or translational 
control elements, or fragments derived therefrom. Proteins encoded by the open reading 
frames are useful for diagnostic purposes, as well as for their enzymatic or structural 
activity. 

15 

DEFINITIONS 

The term "amino acid" refers to naturally occurring and synthetic amino 
acids, as well as amino acid analogs and amino acid mimetics that function in a manner 
similar to the naturally occurring amino acids. Naturally occurring amino acids are those 

20 encoded by the genetic code, as well as those amino acids that are later modified, e.g., 

hydroxyproline, y-carboxyglutamate, and O-phosphoserine, Amino acid analogs refers to 
compounds that have the same basic chemical structure as a naturally occurring amino 
acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and 
an R group., e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl 

25 sulfonium Such analogs have modified R groups (e.g., norleucine) or modified peptide 
backbones, but retain the same basic chemical structure as a naturally occurring amino 
acid. Amino acid mimetics refers to chemical compounds that have a structure that is 
different from the general chemical structure of an amino acid, but that functions in a 
manner similar to a naturally occurring amino acid. 

30 Amino acids may be referred to herein by either their commonly known 

three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB 
Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by 
their commonly accepted single-letter codes. 

3 
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"Amplification" primers are oligonucleotides comprising either natural or 
analogue nucleotides that can serve as the basis for the amplification of a select nucleic 
acid sequence. They include, e.g., polymerase chain reaction primers and ligase chain 
reaction oligonucleotides. 
5 "Antibody" refers to an immunoglobulin molecule able to bind to a 

specific epitope on an antigen. Antibodies can be a polyclonal mixture or monoclonal. 
Antibodies can be intact immunoglobulins derived from natural sources or from 
recombinant sources and can be immunoreactive portions of intact immunoglobulins. 
Antibodies may exist in a variety of forms including, for example, Fv, F ab , and F(ab) 2f as 
10 well as in single chains. Single-chain antibodies, in which genes for a heavy chain and a 
light chain are combined into a single coding sequence, may also be used. 

An "antigen" is a molecule that is recognized and bound by an antibody, 
e.g., peptides, carbohydrates, organic molecules, or more complex molecules such as 
glyco lipids and glycoproteins. The part of the antigen that is the target of antibody 
15 binding is an antigenic determinant and a small functional group that corresponds to a 
single antigenic determinant is called a hapten. 

"Biological sample" refers to any sample obtained from a living or dead 
organism. Examples of biological samples include biological fluids and tissue specimens. 
Such biological samples can be prepared for analysis of the presence of C. pneumoniae 
20 nucleic acids, proteins, or antibodies specifically reactive with the proteins. 

The term U C pneumoniae gene" shall be intended to mean the open 
reading frame encoding specific C. pneumoniae polypeptides, as well as adjacent 5' and 
3* non-coding nucleotide sequences involved in the regulation of expression, up to about 
2 kb beyond the coding region, but possibly further in either direction. The gene may be 
25 introduced into an appropriate vector for extrachromosomal maintenance or for 
integration into a host genome. 

"Conservatively modified variants" applies to both amino acid and nucleic 
acid sequences. With respect to particular nucleic acid sequences, conservatively 
modified variants refers to those nucleic acids which encode identical or essentially 
30 identical amino acid sequences, or where the nucleic acid does not encode an amino acid 
sequence, to essentially identical sequences. Specifically, degenerate codon substitutions 
may be achieved by generating sequences in which the third position of one or more 
selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues 

4 
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(Batzer et a/., Nucleic Acid Res. 19:5081 (1991); Ohtsukaef ai, 1 Biol. Chem. 260:2605- 
2608 (1985); Rossolini et a/., Afo/. Ce//. /Votes 8:91-98 (1994)). Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids 
encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all 
5 encode the amino acid alanine. Thus, at every position where an alanine is specified by a 
codon, the codon can be altered to any of the corresponding codons described without 
altering the encoded polypeptide. Such nucleic acid variations are "silent variations," 
which are one species of conservatively modified variations. Every nucleic acid sequence 
herein which encodes a polypeptide also describes every possible silent variation of the 

10 nucleic acid. One of ski ll will recognize that each codon in a nucleic acid (except AUG, 
which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only 
codon for tryptophan) can be modified to yield a functionally identical molecule. 
Accordingly, each silen: variation of a nucleic acid which encodes a polypeptide is 
implicit in each describ :d sequence. 

15 As to amino acid sequences, one of skill will recognize that individual 

substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 

20 Conservative substitution tables providing functionally similar amino acids are well 
known in the art. Such conservatively modified variants are in addition to and do not 
exclude polymorphic variants, interspecies homologs, and alleles of the invention. 

The following groups each contain amino acids that are conservative 
substitutions for one another: 



25 


1) 


Alanine (A), Glycine (G); 




2) 


Serine (S), Threonine (T); 




3) 


Aspartic acid (D), Glutamic acid (E); 




4) 


Asparagine (N), Glutamine (Q); 




5) 


Cysteine (C), Methionine (M); 


30 


6) 


Arginine (R), Lysine (K), Histidine (H); 




7) 


Isoleucine (I), Leucine (L), Valine (V); and 




8) 


Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 



see, e.g., Creighton, Proteins (1984)). 
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The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences 
that are the same or have a specified percentage of amino acid residues or nucleotides that 
are the same, when compared and aligned for maximum correspondence over a 

5 comparison window, as measured using one of the following sequence comparison 
algorithms or by manual alignment and visual inspection. This definition also refers to 
the complement of a test sequence, which has a designated percent sequence or 
subsequence complementarity when the test sequence has a designated or substantial 
identity to a reference sequence. For example, a designated amino acid percent identity 

10 of 95% refers to sequences or subsequences that have at least about 95% amino acid 
identity when aligned for maximum correspondence over a comparison window as 
measured using one of the following sequence comparison algorithms or by manual 
alignment and visual inspection. Such sequences would then be said to have substantial 
identity, or to be substantially identical to each other. Preferably, sequences have at least 

1 5 about 70% identity, more preferably 80% identity, more preferably 90-95% identity and 
above. Preferably, the percent identity exists over a region of the sequence that is at least 
about 25 amino acids in length, more preferably over a region that is 50-100 amino acids 
in length. 

When percentage of sequence identity is used in reference to proteins or 
20 peptides, it is recognized that residue positions that are not identical often differ by 
conservative amino acid substitutions, where amino acids residues are substituted for 
other amino acid residues with similar chemical properties (e.g., charge or 
hydrophobicity) and therefore do not change the functional properties of the molecule. 
Where sequences differ in conservative substitutions, the percent sequence identity may 
25 be adjusted upwards to correct for the conservative nature of the substitution. Means for 
making this adjustment are well known to those of skill in the art. Typically this involves 
scoring a conservative substitution as a partial rather than a full mismatch, thereby 
increasing the percentage sequence identity. Thus, for example, where an identical amino 
acid is given a score of 1 and a non-conservative substitution is given a score of zero, a 
30 conservative substitution is given a score between zero and 1 . The scoring of 

conservative substitutions is calculated according to, e.g., the algorithm of Meyers & 
Miller, Computer Applic. Biol Set. 4:1 1-17 (1988) e.g., as implemented in the program 
PC/GENE (Intelligenetics, Mountain View, California, USA).. 

6 
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For sequence comparison, typically one sequence acts as a reference 
sequence, to which test sequences are compared. When using a sequence comparison 
algorithm, test and reference sequences are entered into a computer, subsequence 
coordinates are designated, if necessary, and sequence algorithm program parameters are 
5 designated. Default program parameters can be used, or alternative parameters can be 
designated. The sequence comparison algorithm then calculates the percent sequence 
identity for the test sequence(s) relative to the reference sequence, based on the 
~ designated or default program parameters. 

A comparison window includes reference to a segment of any one of the 

10 number of contiguous positions selected from the group consisting of from 25 to 600, 
usually about 50 to about 200, more usually about 100 to about 150 in which a sequence 
may be compared to a reference sequence of the same number of contiguous positions 
after the two sequences are optimally aligned. Methods of alignment of sequences for 
comparison are well-known in the art. Optimal alignment of sequences for comparison 

15 can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. 
Appl Math, 2:482 (1981), by the homology alignment algorithm of Needleman & 
Wunsch, 1 Mol Biol 48:443 (1970), by the search for similarity method of Pearson & 
Lipman, Proc. Nat 'I Acad. Scu USA 85:2444 (1988), by computerized implementations 
of these algorithms (GAP, BESTFIT, FAST A, and TFASTA in the Wisconsin Genetics 

20 Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by 
manual alignment and visual inspection (see, e.g., Ausubel et aL, supra). 

One example of a useful algorithm is PILEUP. PILEUP creates a multiple 
sequence alignment from a group of related sequences using progressive, pairwise 
alignments to show relationship and percent sequence identity. It also plots a tree or 

25 dendogram showing the clustering relationships used to create the alignment. PILEUP 
uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol 
EvoL 35:351-360 (1987). The method used is similar to the method described by Higgins 
& Sharp, CABIOS 5:151-153 (1989). The program can align up to 300 sequences, each 
of a maximum length of 5,000 nucleotides or amino acids. The multiple alignment 

30 procedure begins with the pairwise alignment of the two most similar sequences, 

producing a cluster of two aligned sequences. This cluster is then aligned to the next 
most related sequence or cluster of aligned sequences. Two clusters of sequences are 
aligned by a simple extension of the painvise alignment of two individual sequences. The 

7 
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final alignment is achieved by a series of progressive, painvise alignments. The program 
is run by designating specific sequences and their amino acid or nucleotide coordinates 
for regions of sequence comparison and by designating the program parameters. Using 
PILEUP, a reference sequence is compared to other test sequences to determine the 

5 percent sequence identity relationship using the following parameters: default gap weight 
(3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be obtained 
from the GCG sequence analysis software package, e.g, version 7.0 (Devereaux et al, 
Nuc. Acids Res. 12:387-395 (1984). 

Another example of algorithm that is suitable for determining percent 

10 sequence identity (i.e., substantial similarity or identity) is the BLAST algorithm, which 
is described in Altschul et al., J. Mol Biol. 215:403-410 (1990). Software for performing 
BLAST analyses is publicly available through the National Center for Biotechnology 
Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying 
high scoring sequence pairs (HSPs) by identifying short words of length W in the query 

15 sequence, which either match or satisfy some positive-valued threshold score T when 
aligned with a word of the same length in a database sequence. T is referred to as the 
neighborhood word score threshold (Altschul et al, supra). These initial neighborhood 
word hits act as seeds for initiating searches to find longer HSPs containing them. The 
word hits are then extended in both directions along each sequence for as far as the 

20 cumulative alignment score can be increased. Cumulative scores are calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues, always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 
word hits in each direction are halted when: the cumulative alignment score falls off by 

25 the quantity X from its maximum achieved value; the cumulative score goes to zero or 
below, due to the accumulation of one or more negative-scoring residue alignments; or 
the end of either sequence is reached. The BLAST algorithm parameters W, T, and X 
determine the sensitivity and speed of the alignment. The BLASTN program (for 
nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, 

30 M=5, N=4, and a comparison of both strands. For amino acid sequences, the BLASTP 
program uses as default parameters a wordlength (W) of 3, an expectation (E) of 10, and 
the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad Sci. USA 
89:10915 (1989)). 

8 
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The BLAST algorithm also perfonns a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'L Acad. ScL USA 
90:5873-5787 (1993)). One measure of similarity provided by the BLAST algorithm is 
the smallest sum probability (P(N)), which provides an indication of the probability by 
5 which a match between two nucleotide or amino acid sequences would occur by chance. 
For example, a nucleic acid is considered similar to a reference sequence if the smallest 
sum probability in a comparison of the test nucleic acid to the reference nucleic acid is 
less than about 0. 1, more preferably less than about 0.01, and most preferably less than 
about 0.001. 

10 An indication that two nucleic acid sequences or polypeptides are 

substantially identical is that the polypeptide encoded by the first nucleic acid is 
immunologically cross reactive with the antibodies raised against the polypeptide 
encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically 
substantially identical to a second polypeptide, for example, where the two peptides differ 

1 5 only by conservative suostitutions. Another indication that two nucleic acid sequences 
are substantially identical is that the two molecules or their complements hybridize to 
each other under stringent conditions, as described below. 

Another indication that polynucleotide sequences are substantially 
identical is if two molecules hybridize to each other under stringent conditions. Stringent 

20 conditions are sequence dependent and will be different in different circumstances. 
Generally, stringent conditions are selected to be about 5°C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm 
is the temperature (under defined ionic strength and pH) at which 50% of the target 
sequence hybridizes to a perfectly matched probe. Typically stringent conditions for a 

25 Southern blot protocol involve hybridizing in a buffer comprising 5x SSC, 1 % SDS at 
65°C or hybridizing in a buffer containing 5x SSC and 1% SDS at 42°C and washing at 
65°C with a 0.2x SSC, 0.1% SDS wash. 

A "label" is a composition detectable by spectroscopic, photochemical, 
biochemical, immunochemical, or chemical means. For example, useful labels include 

30 32 P, fluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an 
ELISA), biotin, dioxigenin, or haptens and proteins for which antisera or monoclonal 
antibodies are available. 



9 
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The tenn "nucleic acid" refers to deoxyribonucleotides or ribonucleotides 
and polymers thereof in either single- or double-stranded form. The term encompasses 
nucleic acids containing known nucleotide analogs or modified backbone residues or 
linkages, which are synthetic, naturally occurring, and non-naturally occurring, which 
5 have similar binding properties as the reference nucleic acid f and which are metabolized 
in a manner similar to the reference nucleotides. Examples of such analogs include, 
without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, 
chiral-methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs). 
Unless otherwise indicated, a particular nucleic acid sequence also 

1 0 implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon 
substitutions) and complementary sequences, as well as the sequence explicitly indicated. 
The term nucleic acid is used interchangeably with gene, cDNA, mRNA, oligonucleotide, 
and polynucleotide. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 

15 nucleic acid capable of binding to a target nucleic acid of complementary sequence 
through one or more types of chemical bonds, usually through complementary base 
pairing, usually through hydrogen bond formation. As used herein, a probe may include 
natural (i.e., A, G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In 
addition, the bases in a probe may be joined by a linkage other than a phosphodiester 

20 bond, so long as it does not interfere with hybridization. Thus, for example, probes may 
be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather 
than phosphodiester linkages. It will be understood by one of skill in the art that probes 
may bind target sequences lacking complete complementarity with the probe sequence 
depending upon the stringency of the hybridization conditions. The probes are preferably 

25 directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
labeled such as with biotin to which a streptavidin complex may later bind. By assaying 
for the presence or absence of the probe, one can detect the presence or absence of the 
select sequence or subsequence. 

A labeled nucleic acid probe or oligonucleotide is one that is bound, either 

30 covalently, through a linker, or through ionic, van der Waals or hydrogen bonds to a label 
such that the presence of the probe may be detected by detecting the presence of the label 
bound to the probe. 

10 
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"Pharmaceutically acceptable" means a material that is not biologically or 
otherwise undesirable, i.e., the material can be administered to an individual along with a 
Chlamydia antigen without causing any undesirable biological effects or interacting in a 
deleterious manner with any of the other components of the pharmaceutical composition. 
5 The terms "polypeptide," "peptide" and "protein" are used interchangeably 

herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an analog or mimetic of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino 
acid polymers. 

10 The phrase "specifically or selectively hybridizing to," refers to 

hybridization between a probe and a target sequence in which the probe binds 
substantially only to the target sequence, forming a hybridization complex, when the 
target is in a heterogeneous mixture of polynucleotides and other compounds. Such 
hybridization is determinative of the presence of the target sequence. Although the probe 

1 5 may bind other unrelated sequences, at least 90%, preferably 95% or more of the 
hybridization complexes formed are with the target sequence. 

The term "recombinant" when used with reference to a cell, or nucleic 
acid, or vector, indicates that the cell, or nucleic acid, or vector, has been modified by the 
introduction of a heterologous nucleic acid or the alteration of a native nucleic acid, or 

20 that the cell is derived from a cell so modified. Thus, for example, recombinant cells 
express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. 

The phrase "specifically immunoreactive with", when referring to a protein 
25 or peptide, refers to a binding reaction between the protein and an antibody which is 

determinative of the presence of the protein in the presence of a heterogeneous population 
of proteins and other compounds. Thus, under designated immunoassay conditions, the 
specified antibodies bind to a particular protein and do not bind in a significant amount to 
other proteins present in the sample. Specific binding to an antibody under such 
30 conditions may require an antibody that is selected for its specificity for a particular 

protein. A variety of immunoassay formats may be used to select antibodies specifically 
immunoreactive with a particular protein and are described in detail below. 

11 
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The phrase "substantially pure" or "isolated" when referring to a 
Chlamydia peptide or protein, means a chemical composition which is free of other 
subcellular components of the Chlamydia organism. Typically, a monomeric protein is 
substantially pure when at least about 85% or more of a sample exhibits a single 

5 polypeptide backbone. Minor variants or chemical modifications may typically share the 
same polypeptide sequence. Depending on the purification procedure, purities of 85%, 
and preferably over 95% pure are possible. Protein purity or homogeneity may be 
indicated by a number of means well known in the art, such as polyacrylamide gel 
electrophoresis of a protein sample, followed by visualizing a single polypeptide band on 

10 a polyacrylamide gel upon silver staining. For certain purposes high resolution will be 
needed and HPLC or a similar means for purification utilized. 

DETAILED DESCRIPTION 
The present invention provides the nucleotide sequence of the C. 

15 pneumoniae genome SEQ ID NO: 1 or a representative fragment thereof, in a form which 
can be readily used, analyzed, and interpreted by a skilled artisan. As used herein, a 
"representative fragment" of the nucleotide sequence depicted in SEQ ID NO: 1 refers to 
any portion which is not presently represented within a publicly available database. 
Preferred representative fragments of the present invention are open reading frames, 

20 expression modulating fragments, uptake modulating fragments, and fragments which can 
be used to diagnose the presence of C. pneumoniae in sample. Using the information 
provided in the present application, together with routine cloning and sequencing 
methods, one of ordinary skill in the art will be able to clone and sequence all 
"representative fragments" of interest including open reading frames (ORFs) encoding a 

25 large variety of C. pneumoniae proteins. A non-limiting identification of such preferred 
representative fragments is provided in Tables 2 and 3. 

Diaenostic use of C. pneumoniae nucleic acids 

Hvbridization-based assavs 

Using the nucleic acids disclosed here, one of skill can design nucleic acid 
30 hybridization-based assays for the detection of C pneumoniae. Any of a number of well 
known techniques for the specific detection of target nucleic acids can be used. 
Exemplary hybridization-based assays include, but are not limited to, traditional "direct 

12 
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probe" methods such as Southern Blots, dot blots, in situ hybridization (e.g.. FISH), PCR, 
and the like. The methods can be used in a wide variety of formats including, but not 
limited to substrate- (e.g. membrane or glass) bound methods or array-based approaches 
as described below. As noted above, this invention also embraces methods for detecting 
5 the presence of Chlamydia DNA or RNA in biological samples. These sequences can be 
used to detect Chlamydia in biological samples from patients suspected of being infected. 
A variety of methods of specific DNA and RNA measurement using nucleic acid 
hybridization techniques are known to those of skill in the an (.see Sambrook et ai, 
supra), 

10 In situ hybridization assays are well known (e.g., Angerer (1987) Meth. 

Enzymol 152: 649). Generally, in situ hybridization comprises the following major steps: 
(1) fixation of tissue or biological structure to analyzed; (2) prehybridization treatment of 
the biological structure to increase accessibility of target DNA, and to reduce nonspecific 
binding; (3) hybridizatic n of the mixture of nucleic acids to the nucleic acid in the 

1 5 biological structure or tissue; (4) post-hybridization washes to remove nucleic acid 

fragments not bound in the hybridization and (5) detection of the hybridized nucleic acid 
fragments. The reagent used in each of these steps and the conditions for use vary 
depending on the particular application. 

In a typical in situ hybridization assay, cells are fixed to a solid support, 

20 typically a glass slide. If a nucleic acid is to be probed, the cells are typically denatured 
with heat or alkali. The cells are then contacted with a hybridization solution at a 
moderate temperature to permit annealing of labeled probes specific to the nucleic acid 
sequence encoding the protein. The targets (e.g., cells) are then typically washed at a 
predetermined stringency or at an increasing stringency until an appropriate signal to 

25 noise ratio is obtained. 

The nucleic acids of this invention are particularly well suited to array- 
based hybridization formats. Arrays are a multiplicity of different "probe" or "target" 
nucleic acids (or other compounds) attached to one or more surfaces (e.g., solid, 
membrane, or gel). In a preferred embodiment, the multiplicity of nucleic acids (or other 

30 moieties) is attached to a single contiguous surface or to a multiplicity of surfaces 
juxtaposed to each other. 

In an array format a large number of different hybridization reactions can 
be run essentially "in parallel." This provides rapid, essentially simultaneous, evaluation 

13 
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of a number of hybridizations in a single "experiment". Methods of performing 
hybridization reactions in array based formats are well known to those of skill in the art 
(see, e.g., Pastinen (1997) Genome Res. 7: 606-614; Jackson (1996) Nature 
Biotechnology 14:1685; Chee (1995) Science 274: 610; WO 96/17958. 
5 Arrays, particularly nucleic acid arrays can be produced according to a 

wide variety of methods well known to those of skill in the art. For example, in a simple 
embodiment, "low density" arrays can simply be produced by spotting {e.g. by hand using 
a pipette) different nucleic acids at different locations on a solid support (e.g. a glass 
surface, a membrane, etc.). 

10 This simple spotting, approach has been automated to produce high 

density spotted arrays (see, e.g., U.S. Patent No: 5,807,522). This patent describes the 
use of an automated systems that taps a microcapillary against a surface to deposit a small 
volume of a biological sample. The process is repeated to generate high density arrays. 
Arrays can also be produced using oligonucleotide synthesis technology. Thus, for 

15 example, U.S. Patent No. 5,143,854 and PCT patent publication Nos. WO 90/15070 and 
92/10092 teach the use of light-directed combinatorial synthesis of high density 
oligonucleotide arrays. 

Many methods for immobilizing nucleic acids on a variety of solid 
surfaces are known in the art. A wide variety of organic and inorganic polymers, as well 

20 as other materials, both natural and synthetic, can be employed as the material for the 
solid surface. Illustrative solid surfaces include, e.g., nitrocellulose, nylon, glass, quartz, 
diazotized membranes (paper or nylon), silicones, polyformaldehyde, cellulose, and 
cellulose acetate. In addition, plastics such as polyethylene, polypropylene, polystyrene, 
and the like can be used. Other materials which may be employed include paper, 

25 ceramics, metals, metalloids, semiconductive materials, cermets or the like. In addition, 
substances that form gels can be used. Such materials include, e.g., proteins (e.g., 
gelatins), lipopolysaccharides, silicates, agarose and polyacrylamides. Where the solid 
surface is porous, various pore sizes may be employed depending upon the nature of the 
system. 

30 In preparing the surface, a plurality of different materials may be 

employed, particularly as laminates, to obtain various properties. For example, proteins 
(e.g., bovine serum albumin) or mixtures of macromolecules (e.g., Denhardt's solution) 
can be employed to avoid non-specific binding, simplify covalent conjugation, enhance 

14 
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signal detection or the like. If covalent bonding between a compound and the surface is 
desired, the surface will usually be poly functional or be capable of being 
polyfunctionalized. Functional groups which may be present on the surface and used for 
linking can include carboxylic acids, aldehydes, amino groups, cyano groups, ethylenic 
5 groups, hydroxyl groups, mercapto groups and the like. The manner of linking a wide 
variety of compounds to various surfaces is well known and is amply illustrated in the 
literature. 

~ For example, methods for immobilizing nucleic acids by introduction of 

various functional groups to the molecules is known (see, e.g., Bischoff (1987) Anal. 

10 Biochem., 164: 336-344; Kremsky (1987) Nucl. Acids Res. 15: 2891-2910). Modified 
nucleotides can be placed on the target using PCR primers containing the modified 
nucleotide, or by enzymatic end labeling with modified nucleotides. Use of glass or 
membrane supports (e.g., nitrocellulose, nylon, polypropylene) for the nucleic acid arrays 
of the invention is advantageous because of well developed technology employing 

15 manual and robotic methods of arraying targets at relatively high element densities. Such 
membranes are generally available and protocols and equipment for hybridization to 
membranes is well known. 

Target elements of various sizes, ranging from 1 mm diameter down to 1 
jam can be used. Smaller target elements containing low amounts of concentrated, fixed 

20 probe DNA are used for high complexity comparative hybridizations since the total 

amount of sample available for binding to each target element will be limited. Thus it is 
advantageous to have small array target elements that contain a small amount of 
concentrated probe DNA so that the signal that is obtained is highly localized and bright. 
Such small array target elements are typically used in arrays with densities greater than 

25 10 4 /cm 2 . Relatively simple approaches capable of quantitative fluorescent imaging of 1 
cm 2 areas have been described that permit acquisition of data from a large number of 
target elements in a single image (see, e.g., Wittrup (1994) Cytometry 16:206-213). 

If fluorescently labeled nucleic acid samples are used, arrays on solid 
surface substrates with much lower fluorescence than membranes, such as glass, quartz, 

30 or small beads, can achieve much better sensitivity. Substrates such as glass or fused 
silica are advantageous in that they provide a very low fluorescence substrate, and a 
highly efficient hybridization environment. Covalent attachment of the target nucleic 
acids to glass or synthetic fused silica can be accomplished according to a number of 

15 
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known techniques (described above). Nucleic acids can be conveniently coupled to glass 
using commercially available reagents. For instance, materials for preparation of 
silanized glass with a number of functional groups are commercially available or can be 
prepared using standard techniques (see, e.g., Gait (1984) Oligonucleotide Synthesis: A 

5 Practical Approach, ERL Press, Wash., D.C.). Quartz cover slips, which have at least 10- 
fold lower autofluorescence than glass, can also be silanized. 

Alternatively, probes can also be immobilized on commercially available 
coated beads or other surfaces. For instance, biotin end-labeled nucleic acids can be 
bound to commercially available avidin-coated beads. Streptavidin or anti-digoxigenin 

10 antibody can also be attached to silanized glass slides by protein-mediated coupling using 
e.g., protein A following standard protocols (see, e.g., Smith (1992) Science 258: 1 122- 
1 126). Biotin or digoxigenin end-labeled nucleic acids can be prepared according to 
standard techniques. Hybridization to nucleic acids attached to beads is accomplished by 
suspending them in the hybridization mix, and then depositing them on the glass substrate 

15 for analysis after washing. Alternatively, paramagnetic particles, such as ferric oxide 
particles, with or without avidin coating, can be used. 

A variety of other nucleic acid hybridization formats are known to those 
skilled in the art. For example, common formats include sandwich assays and 
competition or displacement assays. Hybridization techniques are generally described in 

20 Hames and Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL Press; 
Gall and Pardue (1969) Proc. Natl Acad. Sci. USA 63: 378-383; and John et al (1969) 
Nature 223: 582-587. 

Sandwich assays are commercially useful hybridization assays for 
detecting or isolating nucleic acid sequences. Such assays utilize a "capture" nucleic acid 

25 covalently immobilized to a solid support and a labeled "signal" nucleic acid in solution. 
The sample will provide the target nucleic acid. The "capture" nucleic acid and "signal" 
nucleic acid probe hybridize with the target nucleic acid to form a "sandwich" 
hybridization complex. To be most effective, the signal nucleic acid should not hybridize 
with the capture nucleic acid. 

30 Detection of a hybridization complex may require the binding of a signal 

generating complex to a duplex of target and probe polynucleotides or nucleic acids. 
Typically, such binding occurs through ligand and anti-ligand interactions as between a 
ligand-conjugated probe and an anti-ligand conjugated with a signal. 

16 
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The sensitivity of the hybridization assays may be enhanced through use of 
a nucleic acid amplification system that multiplies the target nucleic acid being detected. 
Examples of such systems include the polymerase chain reaction (PCR) system and the 
ligase chain reaction (LCR) system. Other methods recently described in the art are the 

5 nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga, Ontario) 
and Q Beta Replicase systems. 

Nucleic acid hybridization simply involves providing a denatured probe 
and target nucleic acid under conditions where the probe and its complementary target 
can form stable hybrid duplexes through complementary base pairing. The nucleic acids 

10 that do not form hybrid duplexes are then washed away leaving the hybridized nucleic 
acids to be detected, typically through detection of an attached detectable label. It is 
generally recognized that nucleic acids are denatured by increasing the temperature or 
decreasing the salt concentration of the buffer containing the nucleic acids, or in the 
addition of chemical ag ents, or the raising of the pH. Under low stringency conditions 

15 (e.g., low temperature and/or high salt and/or high target concentration) hybrid duplexes 
(e.g., DNA:DNA, RNArRNA, or RNA:DNA) will form even where the annealed 
sequences are not perfectly complementary. Thus specificity of hybridization is reduced 
at lower stringency. Conversely, at higher stringency (e.g., higher temperature or lower 
salt) successful hybridization requires fewer mismatches. 

20 One of skill in the art will appreciate that hybridization conditions may be 

selected to provide any degree of stringency. In a preferred embodiment, hybridization is 
performed at low stringency to ensure hybridization and then subsequent washes are 
performed at higher stringency to eliminate mismatched hybrid duplexes. Successive 
washes may be performed at increasingly higher stringency (e.g., down to as low as 0.25 

25 X SSPE-T at 37°C to 70°C) until a desired level of hybridization specificity is obtained. 
Stringency can also be increased by addition of agents such as formamide. Hybridization 
specificity may be evaluated by comparison of hybridization to the test probes with 
hybridization to the various controls that can be present. 

In general, there is a tradeoff between hybridization specificity 

30 (stringency) and signal intensity. Thus, in a preferred embodiment, the wash is performed 
at the highest stringency that produces consistent results and that provides a signal 
intensity greater than approximately 10% of the background intensity. Thus, in a 
preferred embodiment, the hybridized array may be washed at successively higher 

17 
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stringency solutions and read between each wash. Analysis of the data sets thus produced 
will reveal a wash stringency above which the hybridization pattern is not appreciably 
altered and which provides adequate signal for the particular probes of interest. 

Methods of optimizing hybridization conditions are well known to those of 
5 skill in the art (see, e.g., Tijssen (1993) Laboratory Techniques in Biochemistry and 
Molecular Biology, Vol 24: Hybridization With Nucleic Acid Probes, Elsevier, N.Y.). 

Labeling and detection of nucleic acids. 

In a preferred embodiment, the hybridized nucleic acids are detected by 
detecting one or more labels attached to the sample or probe nucleic acids. The labels 

10 may be incorporated by any of a number of means well known to those of skill in the art. 
Means of attaching labels to nucleic acids include, for example nick translation or end- 
labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and subsequent 
attachment (ligation) of a nucleic acid linker joining the sample nucleic acid to a label 
(e.g., a fluorophore). A wide variety of linkers for the attachment of labels to nucleic 

15 acids are also known. In addition, intercalating dyes and fluorescent nucleotides can also 
be used. 

Detectable labels suitable for use in the present invention include any 
composition detectable by spectroscopic, photochemical, biochemical, immunochemical, 
electrical, optical or chemical means. Useful labels in the present invention include biotin 

20 for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads™), 
fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and 
the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3 H, 125 I, 
35 S, !4 C, or 32 P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others 
commonly used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold 

25 particles in the 40 -80 nm diameter size range scatter green light with high efficiency) or 
colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. Patents 
teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149; and 4,366,241. 

A fluorescent label is preferred because it provides a very strong signal 

30 with low background. It is also optically detectable at high resolution and sensitivity 
through a quick scanning procedure. The nucleic acid samples can all be labeled with a 
single label, e.g., a single fluorescent label. Alternatively, in another embodiment, 
different nucleic acid samples can be simultaneously hybridized where each nucleic acid 

18 
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"Pharmaceutically acceptable" means a material that is not biologically or 
otherwise undesirable, i.e., the material can be administered to an individual along with a 
Chlamydia antigen without causing any undesirable biological effects or interacting in a 
deleterious manner with any of the other components of the pharmaceutical composition. 
5 The terms "polypeptide," "peptide" and "protein" are used interchangeably 

herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an analog or mimetic of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino 
acid polymers. 

10 The phrase "specifically or selectively hybridizing to," refers to 

hybridization between a probe and a target sequence in which the probe binds 
substantially only to the target sequence, forming a hybridization complex, when the 
target is in a heterogeneous mixture of polynucleotides and other compounds. Such 
hybridization is determinative of the presence of the target sequence. Although the probe 

1 5 may bind other unrelated sequences, at least 90%, preferably 95% or more of the 
hybridization complexes formed are with the target sequence. 

The term "recombinant" when used with reference to a cell, or nucleic 
acid, or vector, indicates that the cell, or nucleic acid, or vector, has been modified by the 
introduction of a heterologous nucleic acid or the alteration of a native nucleic acid, or 

20 that the cell is derived from a cell so modified. Thus, for example, recombinant cells 
express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. 

The phrase "specifically immunoreactive with", when referring to a protein 
25 or peptide, refers to a binding reaction between the protein and an antibody which is 

determinative of the presence of the protein in the presence of a heterogeneous population 
of proteins and other compounds. Thus, under designated immunoassay conditions, the 
specified antibodies bind to a particular protein and do not bind in a significant amount to 
other proteins present in the sample. Specific binding to an antibody under such 
30 conditions may require an antibody that is selected for its specificity for a particular 

protein. A variety of immunoassay formats may be used to select antibodies specifically 
immunoreactive with a particular protein and are described in detail below. 

11 
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The phrase "substantially pure" or "isolated" when referring to a 
Chlamydia peptide or protein, means a chemical composition which is free of other 
subcellular components of the Chlamydia organism. Typically, a monomelic protein is 
substantially pure when at least about 85% or more of a sample exhibits a single 

5 polypeptide backbone. Minor variants or chemical modifications may typically share the 
same polypeptide sequence. Depending on the purification procedure, purities of 85%, 
and preferably over 95% pure are possible. Protein purity or homogeneity may be 
indicated by a number of means well known in the art, such as polyacrylamide gel 
electrophoresis of a protein sample, followed by visualizing a single polypeptide band on 

10 a polyacrylamide gel upon silver staining. For certain purposes high resolution will be 
needed and HPLC or a similar means for purification utilized. 

DETAILED DESCRIPTION 
The present invention provides the nucleotide sequence of the C. 

1 5 pneumoniae genome SEQ ID NO: 1 or a representative fragment thereof, in a form which 
can be readily used, analyzed, and interpreted by a skilled artisan. As used herein, a 
"representative fragment" of the nucleotide sequence depicted in SEQ ID NO: 1 refers to 
any portion which is not presently represented within a publicly available database. 
Preferred representative fragments of the present invention are open reading frames, 

20 expression modulating fragments, uptake modulating fragments, and fragments which can 
be used to diagnose the presence of C pneumoniae in sample. Using the information 
provided in the present application, together with routine cloning and sequencing 
methods, one of ordinary skill in the art will be able to clone and sequence all 
"representative fragments" of interest including open reading frames (ORFs) encoding a 

25 large variety of C. pneumoniae proteins. A non-limiting identification of such preferred 
representative fragments is provided in Tables 2 and 3. 

Diagnostic use of C pneumoniae nucleic acids 

Hvbridization-based assays 

Using the nucleic acids disclosed here, one of skill can design nucleic acid 
30 hybridization-based assays for the detection of C pneumoniae. Any of a number of well 
known techniques for the specific detection of target nucleic acids can be used. 
Exemplary hybridization-based assays include, but are not limited to, traditional "direct 

12 
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probe" methods such as Southern Blots, dot blots, in situ Ziybridization {e.g.. FISH), PCR, 
and the like. The methods can be used in a wide variety of formats including, but not 
limited to substrate- (e.g. membrane or glass) bound methods or array-based approaches 
as described below. As noted above, this invention also embraces methods for detecting 
5 the presence of Chlamydia DNA or RNA in biological samples. These sequences can be 
used to detect Chlamydia in biological samples from patients suspected of being infected. 
A variety of methods of specific DNA and RNA measurement using nucleic acid 
hybridization techniques are known to those of skill in the an (see Sambrook et ai. 
supra). 

!0 In situ hybridization assays are well known (e.g.. Angerer (1987) Meth. 

Enzymol 152: 649). Generally, in situ hybridization comprises the following major steps: 
(1) fixation of tissue or biological structure to analyzed; (2) prehybridization treatment of 
the biological structure tD increase accessibility of target DNA, and to reduce nonspecific 
binding; (3) hybridizatic n of the mixture of nucleic acids to the nucleic acid in the 
1 5 biological structure or tissue; (4) post-hybridization washes to remove nucleic acid 

fragments not bound in the hybridization and (5) detection of the hybridized nucleic acid 
fragments. The reagent used in each of these steps and the conditions for use vary 
depending on the particular application. 

In a typical in situ hybridization assay, cells are fixed to a solid support, 
20 typically a glass slide. If a nucleic acid is to be probed, the cells are typically denatured 
with heat or alkali. The cells are then contacted with a hybridization solution at a 
moderate temperature to permit annealing of labeled probes specific to the nucleic acid 
sequence encoding the protein. The targets (e.g.. cells) are then typically washed at a 
predetermined stringency or at an increasing stringency until an appropriate signal to 

25 noise ratio is obtained. 

The nucleic acids of this invention are particularly well suited to array- 
based hybridization formats. Arrays are a multiplicity of different "probe" or "target" 
nucleic acids (or other compounds) attached to one or more surfaces (e.g.. solid, 
membrane, or gel). In a preferred embodiment, the multiplicity of nucleic acids (or other 
30 moieties) is attached to a single contiguous surface or to a multiplicity of surfaces 
juxtaposed to each other. 

In an array format a large number of different hybridization reactions can 
be run essentially "in parallel." This provides rapid, essentially simultaneous, evaluation 

13 
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of a number of hybridizations in a single "experiment". Methods of performing 
hybridization reactions in array based formats are well known to those of skill in the art 
(see, e.g., Pastinen (1997) Genome Res. 7: 606-614; Jackson (1996) Nature 
Biotechnology 14:1685; Chee (1995) Science 274: 610; WO 96/17958. 
5 Arrays, particularly nucleic acid arrays can be produced according to a 

wide variety of methods well known to those of skill in the art. For example, in a simple 
embodiment, "low density" arrays can simply be produced by spotting {e.g. by hand using 
a pipette) different nucleic acids at different locations on a solid support (e.g. a glass 
surface, a membrane, etc.). 

10 This simple spotting, approach has been automated to produce high 

density spotted arrays (see, e.g., U.S. Patent No: 5,807,522). This patent describes the 
use of an automated systems that taps a microcapillary against a surface to deposit a small 
volume of a biological sample. The process is repeated to generate high density arrays. 
Arrays can also be produced using oligonucleotide synthesis technology. Thus, for 

15 example, U.S. Patent No. 5,143,854 and PCT patent publication Nos. WO 90/15070 and 
92/10092 teach the use of light-directed combinatorial synthesis of high density 
oligonucleotide arrays. 

Many methods for immobilizing nucleic acids on a variety of solid 
surfaces are known in the art. A wide variety of organic and inorganic polymers, as well 

20 as other materials, both natural and synthetic, can be employed as the material for the 
solid surface. Illustrative solid surfaces include, e.g., nitrocellulose, nylon, glass, quartz, 
diazotized membranes (paper or nylon), silicones, polyformaldehyde, cellulose, and 
cellulose acetate. In addition, plastics such as polyethylene, polypropylene, polystyrene, 
and the like can be used. Other materials which may be employed include paper, 

25 ceramics, metals, metalloids, semiconductive materials, cermets or the like. In addition, 
substances that form gels can be used. Such materials include, e.g., proteins (e.g., 
gelatins), lipopolysaccharides, silicates, agarose and polyacrylamides. Where the solid 
surface is porous, various pore sizes may be employed depending upon the nature of the 
system. 

30 In preparing the surface, a plurality of different materials may be 

employed, particularly as laminates, to obtain various properties. For example, proteins 
(e.g., bovine serum albumin) or mixtures of macromolecules (e.g., Denhardt's solution) 
can be employed to avoid non-specific binding, simplify covalent conjugation, enhance 
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signal detection or the like. If covalent bonding between a compound and the surface is 
desired, the surface will usually be polyfiinctional or be capable of being 
polyfiinctionalized. Functional groups which may be present on the surface and used for 
linking can include carboxylic acids, aldehydes, amino groups, cyano groups, ethylenic 
5 groups, hydroxyl groups, mercapto groups and the like. The manner of linking a wide 
variety of compounds to various surfaces is well known and is amply illustrated in the 
literature. 

For example, methods for immobilizing nucleic acids by introduction of 
various functional groups to the molecules is known {see, e.g., Bischoff (1987) Anal. 

10 Biochem., 164: 336-344; Kremsky (1987) NucL Acids Res. 15: 2891-2910). Modified 
nucleotides can be placed on the target using PCR primers containing the modified 
nucleotide, or by enzymatic end labeling with modified nucleotides. Use of glass or 
membrane supports (e.g., nitrocellulose, nylon, polypropylene) for the nucleic acid arrays 
of the invention is advantageous because of well developed technology employing 

15 manual and robotic methods of arraying targets at relatively high element densities. Such 
membranes are generally available and protocols and equipment for hybridization to 
membranes is well known. 

Target elements of various sizes, ranging from 1 mm diameter down to 1 
(im can be used. Smaller target elements containing low amounts of concentrated, fixed 

20 probe DNA are used for high complexity comparative hybridizations since the total 

amount of sample available for binding to each target element will be limited. Thus it is 
advantageous to have small array target elements that contain a small amount of 
concentrated probe DNA so that the signal that is obtained is highly localized and bright. 
Such small array target elements are typically used in arrays with densities greater than 
, 25 1 0 4 /cm 2 . Relatively simple approaches capable of quantitative fluorescent imaging of 1 
cm 2 areas have been described that permit acquisition of data from a large number of 
target elements in a single image (see, e.g., Wittrup (1994) Cytometry 16:206-213). 

If fluorescently labeled nucleic acid samples are used, arrays on solid 
surface substrates with much lower fluorescence than membranes, such as glass, quartz, 

30 or small beads, can achieve much better sensitivity. Substrates such as glass or fused 
silica are advantageous in that they provide a very low fluorescence substrate, and a 
highly efficient hybridization environment. Covalent attachment of the target nucleic 
acids to glass or synthetic fused silica can be accomplished according to a number of 

15 
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known techniques (described above). Nucleic acids can be conveniently coupled to glass 
using commercially available reagents. For instance, materials for preparation of 
silanized glass with a number of functional groups are commercially available or can be 
prepared using standard techniques (see, e.g., Gait (1984) Oligonucleotide Synthesis: A 

5 Practical Approach, IRL Press, Wash., D.C.). Quartz cover slips, which have at least 10- 
fold lower autofluorescence than glass, can also be silanized. 

Alternatively, probes can also be immobilized on commercially available 
coated beads or other surfaces. For instance, biotin end-labeled nucleic acids can be 
bound to commercially available avidin-coated beads. Streptavidin or anti-digoxigenin 

10 antibody can also be attached to silanized glass slides by protein-mediated coupling using 
e.g., protein A following standard protocols (see, e.g., Smith (1992) Science 258: 1 122- 
1 126). Biotin or digoxigenin end-labeled nucleic acids can be prepared according to 
standard techniques. Hybridization to nucleic acids attached to beads is accomplished by 
suspending them in the hybridization mix, and then depositing them on the glass substrate 

15 for analysis after washing. Alternatively, paramagnetic particles, such as ferric oxide 
particles, with or without avidin coating, can be used. 

A variety of other nucleic acid hybridization formats are known to those 
skilled in the art. For example, common formats include sandwich assays and 
competition or displacement assays. Hybridization techniques are generally described in 

20 Haines and Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL Press; 
Gall and Pardue (1969) Proc. Natl Acad. Sci. USA 63: 378-383; and John et aL (1969) 
Nature 223: 582-587. 

Sandwich assays are commercially useful hybridization assays for 
detecting or isolating nucleic acid sequences. Such assays utilize a "capture" nucleic acid 

25 covalently immobilized to a solid support and a labeled "signal" nucleic acid in solution. 
The sample will provide the target nucleic acid. The "capture" nucleic acid and "signal" 
nucleic acid probe hybridize with the target nucleic acid to form a "sandwich" 
hybridization complex. To be most effective, the signal nucleic acid should not hybridize 
with the capture nucleic acid. 

30 Detection of a hybridization complex may require the binding of a signal 

generating complex to a duplex of target and probe polynucleotides or nucleic acids. 
Typically, such binding occurs through ligand and anti-ligand interactions as between a 
ligand-conjugated probe and an anti-ligand conjugated with a signal. 

16 
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The sensitivity of the hybridization assays may be enhanced through use of 
a nucleic acid amplification system that multiplies the target nucleic acid being detected. 
Examples of such systems include the polymerase chain reaction (PCR) system and the 
ligase chain reaction (LCR) system. Other methods recently described in the art are the 

5 nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga, Ontario) 
and Q Beta Replicase systems. 

Nucleic acid hybridization simply involves providing a denatured probe 
and target nucleic acid under conditions where the probe and its complementary target 
can form stable hybrid duplexes through complementary base pairing. The nucleic acids 

10 that do not form hybrid duplexes are then washed away leaving the hybridized nucleic 
acids to be detected, typically through detection of an attached detectable label. It is 
generally recognized that nucleic acids are denatured by increasing the temperature or 
decreasing the salt concentration of the buffer containing the nucleic acids, or in the 
addition of chemical agents, or the raising of the pH. Under low stringency conditions 

15 (e.g., low temperature and/or high salt and/or high target concentration) hybrid duplexes 
(e.g., DNA:DNA, RNA:RNA, or RNA:DNA) will form even where the annealed 
sequences are not perfectly complementary. Thus specificity of hybridization is reduced 
at lower stringency. Conversely, at higher stringency (e.g., higher temperature or lower 
salt) successful hybridization requires fewer mismatches. 

20 One of skill in the art will appreciate that hybridization conditions may be 

selected to provide any degree of stringency. In a preferred embodiment, hybridization is 
performed at low stringency to ensure hybridization and then subsequent washes are 
performed at higher stringency to eliminate mismatched hybrid duplexes. Successive 
washes may be performed at increasingly higher stringency (e.g., down to as low as 0.25 

25 X SSPE-T at 37°C to 70°C) until a desired level of hybridization specificity is obtained. 
Stringency can also be increased by addition of agents such as formamide. Hybridization 
specificity may be evaluated by comparison of hybridization to the test probes with 
hybridization to the various controls that can be present. 

In general, there is a tradeoff between hybridization specificity 

30 (stringency) and signal intensity. Thus, in a preferred embodiment, the wash is performed 
at the highest stringency that produces consistent results and that provides a signal 
intensity greater than approximately 10% of the background intensity. Thus, in a 
preferred embodiment, the hybridized array may be washed at successively higher 

17 
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stringency solutions and read between each wash. Analysis of the data sets thus produced 
will reveal a wash stringency above which the hybridization pattern is not appreciably 
altered and which provides adequate signal for the particular probes of interest. 

Methods of optimizing hybridization conditions are well known to those of 
5 skill in the art {see, e.g., Tijssen (1993) Laboratory Techniques in Biochemistry and 
Molecular Biology, Vol 24: Hybridization With Nucleic Acid Probes, Elsevier, N.Y.). 

Labeling and detection of nucleic acids. 

In a preferred embodiment, the hybridized nucleic acids are detected by 
detecting one or more labels attached to the sample or probe nucleic acids. The labels 

10 may be incorporated by any of a number of means well known to those of skill in the art. 
Means of attaching labels to nucleic acids include, for example nick translation or end- 
labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and subsequent 
attachment (ligation) of a nucleic acid linker joining the sample nucleic acid to a label 
(e.g., a fluorophore). A wide variety of linkers for the attachment of labels to nucleic 

15 acids are also known. In addition, intercalating dyes and fluorescent nucleotides can also 
be used. 

Detectable labels suitable for use in the present invention include any 
composition detectable by spectroscopic, photochemical, biochemical, immunochemical, 
electrical, optical or chemical means. Useful labels in the present invention include biotin 

20 for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads™), 
fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and 
the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3 H, 125 I, 
35 S, I4 C, or 32 P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others 
commonly used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold 

25 particles in the 40 -80 nm diameter size range scatter green light with high efficiency) or 
colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. Patents 
teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149; and 4,366,241. 

A fluorescent label is preferred because it provides a very strong signal 

30 with low background. It is also optically detectable at high resolution and sensitivity 
through a quick scanning procedure. The nucleic acid samples can all be labeled with a 
single label, e.g., a single fluorescent label. Alternatively, in another embodiment, 
different nucleic acid samples can be simultaneously hybridized where each nucleic acid 
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sample has a different label. For instance, one target could have a green fluorescent label 
and a second target could have a red fluorescent label. The scanning step will distinguish 
cites of binding of the red label from those binding the green fluorescent label. Each 
nucleic acid sample (target nucleic acid) can be analyzed independently from one another. 
5 Suitable chromogens which can be employed include those molecules and 

compounds which absorb light in a distinctive range of wavelengths so that a color can be 
observed or, alternatively, which emit light when irradiated with radiation of a particular 
wave length or wave length range, e.g., fluorescers. 

Desirably, fluorescers should absorb light above about 300 nm, preferably 

10 about 350 nm, and more preferably above about 400 nm, usually emitting at wavelengths 
greater than about 10 nm higher than the wavelength of the light absorbed. It should be 
noted that the absorption and emission characteristics of the bound dye can differ from 
the unbound dye. Therefore, when referring to the various wavelength ranges and 
characteristics of the dyes, it is intended to indicate the dyes as employed and not the dye 

15 which is unconjugated and characterized in an arbitrary solvent. 

Fluorescers are generally preferred because by irradiating a fluorescer with 
light, one can obtain a plurality of emissions. Thus, a single label can provide for a 
plurality of measurable events. 

Detectable signal can also be provided by chemiluminescent and 

20 bioluminescent sources. Chemiluminescent sources include a compound which becomes 
electronically excited by a chemical reaction and can then emit light which serves as the 
detectable signal or donates energy to a fluorescent acceptor. Alternatively, luciferins can 
be used in conjunction with luciferase or lucigenins to provide bioluminescence. 
Spin labels are provided by reporter molecules with an unpaired electron spin which can 

25 be detected by electron spin resonance (ESR) spectroscopy. Exemplary spin labels 
include organic free radicals, transitional metal complexes, particularly vanadium, 
copper, iron, and manganese, and the like. Exemplary spin labels include nitroxide free 
radicals. 

The label may be added to the target (sample) nucleic acid(s) prior to, or 
30 after the hybridization. So called "direct labels" are detectable labels that are directly 

attached to or incorporated into the target (sample) nucleic acid prior to hybridization. In 
contrast, so called "indirect labels" are joined to the hybrid duplex after hybridization. 
Often, the indirect label is attached to a binding moiety that has been attached to the 
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target nucleic acid prior to the hybridization. Thus, for example, the target nucleic acid 
may be biotinylated before the hybridization. After hybridization, an avidin-conjugated 
fluorophore will bind the biotin bearing hybrid duplexes providing a label that is easily 
detected. For a detailed review of methods of labeling nucleic acids and detecting labeled 
5 hybridized nucleic acids see Laboratory Techniques in Biochemistry and Molecular 

Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed. Elsevier, N.Y., 
(1993)). 

Fluorescent labels are easily added during an in vitro transcription 
reaction. Thus, for example, fluorescein labeled UTP and CTP can be incorporated into 

10 the RNA produced in an in vitro transcription. 

The labels can be attached directly or through a linker moiety. In general, 
the site of label or linker-label attachment is not limited to any specific position. For 
example, a label may be attached to a nucleoside, nucleotide, or analogue thereof at any 
position that does not interfere with detection or hybridization as desired. For example, 

15 certain Label-ON Reagents from Clontech (Palo Alto, CA) provide for labeling 

interspersed throughout the phosphate backbone of an oligonucleotide and for terminal 
labeling at the 3' and 5* ends. As shown for example herein, labels can be attached at 
positions on the ribose ring or the ribose can be modified and even eliminated as desired. 
The base moieties of useful labeling reagents can include those that are naturally 

20 occurring or modified in a manner that does not interfere with the purpose to which they 
are put. Modified bases include but are not limited to 7-deaza A and G, 7-deaza-8-aza A 
and G, and other heterocyclic moieties. 

It will be recognized that fluorescent labels are not to be limited to single 
species organic molecules, but include inorganic molecules, multi-molecular mixtures of 

25 organic and/or inorganic molecules, crystals, heteropolymers, and the like. Thus, for 
example, CdSe-CdS core-shell nanocrystals enclosed in a silica shell can be easily 
derivatized for coupling to a biological molecule (Bruchez et ai (1998) Science, 281 : 
2013-2016). Similarly, highly fluorescent quantum dots (zinc sulfide-capped cadmium 
selenide) have been covalently coupled to biomolecules for use in ultrasensitive 

30 biological detection (Warren and Nie (1998) Science, 281 : 2016-2018). 

Amplification-based assays. 

In another embodiment, amplification-based assays can be used to detect 
nucleic acids. In such amplification-based assays, the nucleic acid sequences act as a 
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template in an amplification reaction (e.g. Polymerase Chain Reaction (PCR). Detailed 
protocols for quantitative PCR are provided in Innis et al. ( 1 990) PCR Protocols, A Guide 
to Methods and Applications, Academic Press, Inc. N.Y.). 

Other suitable amplification methods include, but are not limited to ligase 
5 chain reaction (LCR) (see Wu and Wallace (1989) Genomics 4: 560, Landegren et al. 
(1988) Science 241 : 1077, and Barringer et al. (1990) Gene 89: 1 17, transcription 
amplification (Kwoh et al. (1989) Proc. Natl. Acad. ScL USA 86: 1 173), and self- 
sustained sequence replication (Guatelli et al. (1990) Proc. Nat. Acad. Sci. USA 87: 
1874). 

10 Detection of C. pneumoniae gene expression 

The nucbic acids of the invention can also be used to C pneumoniae 

detect gene transcripts. Methods of detecting and/or quantifying gene transcripts using 

nucleic acid hybridization techniques are known to those of skill in the art (see Sambrook 

et al. supra). For example , a Northern transfer may be used for the detection of the 

15 desired mRNA directly. In brief, the mRNA is isolated from a given cell sample using, 
for example, an acid guanidinium -phenol-chloroform extraction method. The mRNA is 
then electrophoresed to separate the mRNA species and the mRNA is transferred from the 
gel to a nitrocellulose membrane. As with the Southern blots, labeled probes are used to 
identify and/or quantify the target mRNA. 

20 In another preferred embodiment, the gene transcript can be measured 

using amplification (e.g. PCR) based methods as described above for directly assessing 
copy number of the target sequences. 

Ex pression of C. pneumoniae proteins 

The nucleic acids disclosed here can be used for recombinant expression 

25 of the proteins. In these methods, the nucleic acids encoding the proteins of interest are 
introduced into suitable host cells, followed by induction of the cells to produce large 
amounts of the protein. The invention relies on routine techniques in the field of 
recombinant genetics, well known to those of ordinary skill in the art. A basic text 
disclosing the general methods of use in this invention is Sambrook et al., Molecular 

30 Cloning, A Laboratory Manual (2nd ed. 1989). 

Standard transfection methods are used to produce prokaryotic, 
mammalian, yeast or insect cell lines which express large quantities of the desired 
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polypeptide, which is then purified using standard techniques (see, e.g., Colley et al.,J. 
Biol. Chem. 264:17619-17622, 1989; Guide to Protein Purification, supra). 

The nucleotide sequences used to transfect the host cells can be modified 
to yield Chlamydia polypeptides with a variety of desired properties. For example, the 

5 polypeptides can vary from the naturally-occurring sequence at the primary structure 
level by amino acid, insertions, substitutions, deletions, and the like. These modifications 
can be used in a number of combinations to produce the final modified protein chain. 

The amino acid sequence variants can be prepared with various objectives 
in mind, including facilitating purification and preparation of the recombinant 

1 0 polypeptide. The modified polypeptides are also useful for modifying plasma half life, 
improving therapeutic efficacy, and lessening the severity or occurrence of side effects 
during therapeutic use. The amino acid sequence variants are usually predetermined 
variants not found in nature but exhibit the same immunogenic activity as naturally 
occurring protein. In general, modifications of the sequences encoding the polypeptides 

1 5 may be readily accomplished by a variety of well-known techniques, such as site-directed 
mutagenesis (see Gillman & Smith, Gene 8:81-97 (1979); Roberts et ai 9 Nature 328:731- 
734 (1987)). One of ordinary skill will appreciate that the effect of many mutations is 
difficult to predict. Thus, most modifications are evaluated by routine screening in a 
suitable assay for the desired characteristic. For instance, the effect of various 

20 modifications on the ability of the polypeptide to elicit a protective immune response can 
be easily determined using in vitro assays. For instance, the polypeptides can be tested 
for their ability to induce lymphoproliferation, T cell cytotoxicity, or cytokine production 
using standard techniques. 

The particular procedure used to introduce the genetic material into the 

25 host cell for expression of the polypeptide is not particularly critical. Any of the well 
known procedures for introducing foreign nucleotide sequences into host cells may be 
used. These include the use of calcium phosphate transfection, spheroplasts, 
electroporation, liposomes, microinjection, plasmid vectors, viral vectors and any of the 
other well known methods for introducing cloned genomic DNA, cDNA, synthetic DNA 

30 or other foreign genetic material into a host cell (see Sambrook et a/., supra). It is only 
necessary that the particular procedure utilized be capable of successfully introducing at 
least one gene into the host cell which is capable of expressing the gene. 
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Any of a number of well known cells and cell lines can be used to express 
the polypeptides of the invention. For instance, prokaryotic cells such as £. coli can be 
used. Eukaryotic cells include, yeast, Chinese hamster ovary (CHO) cells, COS cells, and 
insect cells. 

5 The particular vector used to transport the genetic information into the cell 

is also not particularly critical. Any of the conventional vectors used for expression of 
recombinant proteins in prokaryotic and eukaryotic cells may be used. Expression 
vectors for mammalian cells typically contain regulatory elements from eukaryotic 
viruses. 

1 o The expression vector typically contains a transcription unit or expression 

cassette that contains all the elements required for the expression of the polypeptide DNA 
in the host cells. A typical expression cassette contains a promoter operably linked to the 
DNA sequence encoding a polypeptide and signals required for efficient polyadenylation 
of the transcript. The term "operably linked" as used herein refers to linkage of a 

1 5 promoter upstream from a DNA sequence such that the promoter mediates transcription 
of the DNA sequence. The promoter is preferably positioned about the same distance 
from the heterologous transcription start site as it is from the transcription start site in its 
natural setting. As is known in the art, however, some variation in this distance can be 
accommodated without loss of promoter function. 

20 Following the growth of the recombinant cells and expression of the 

polypeptide, the culture medium is harvested for purification of the secreted protein. The 
media are typically clarified by centrifugation or filtration to remove cells and cell debris 
and the proteins are concentrated by adsorption to any suitable resin or by use of 
ammonium sulfate fractionation, polyethylene glycol precipitation, or by ultrafiltration. 

25 Other routine means known in the art may be equally suitable. Further purification of the 
polypeptide can be accomplished by standard techniques, for example, affinity 
chromatography, ion exchange chromatography, sizing chromatography, His 6 tagging and 
Ni-agarose chromatography (as described in Dobeli et a/., MoL and Biochem. Parasit. 
41:259-268 (1990)), or other protein purification techniques to obtain homogeneity. The 

30 purified proteins are then used to produce pharmaceutical compositions, as described 
below. 

An alternative method of preparing recombinant polypeptides useful as 
vaccines involves the use of recombinant viruses (e.g., vaccinia). Vaccinia virus is grown 
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in suitable cultured mammalian cells such as the HeLa S3 spinner cells, as described by 
Mackett et al % in DNA cloning Vol II: A practical approach, pp. 191-21 1 (Glover, ed.). 

Antibody Production 

The proteins of the present invention can be used to produce antibodies 
5 specifically reactive with C pneumoniae antigens. If isolated proteins are used, they may 
be recombinantly produced or isolated from Chlamydia cultures. Synthetic peptides 
made using the protein sequences may also be used. 

Methods of production of polyclonal antibodies are known to those of skill 
in the art. In brief, an immunogen, preferably a purified protein, is mixed with an 
10 adjuvant and animals are immunized. When appropriately high titers of antibody to the 
immunogen are obtained, blood is collected from the animal and antisera is prepared. 
Further fractionation of the antisera to enrich for antibodies reactive to Chlamydia 
proteins can be done if desired (see Harlow & Lane, Antibodies: A Laboratory Manual 
(1988)). 

15 Polyclonal antisera are used to identify and characterize Chlamydia in the 

tissues of patients using, for instance, in situ techniques and immunoperoxidase test 
procedures described in Anderson et ai JAVMA 198:241 (1991) and Barr et al. Vet. 
Pathol. 28:110-116(1991). 

Monoclonal antibodies may be obtained by various techniques familiar to 

20 those skilled in the art. Briefly, spleen cells from an animal immunized with a desired 
antigen are immortalized, commonly by fusion with a myeloma cell (see Kohler & 
Milstein, Eur. J. Immunol. 6:51 1-519 (1976)). Alternative methods of immortalization 
include transformation with Epstein Ban Virus, oncogenes, or retroviruses, or other 
methods well known in the art. Colonies arising from single immortalized cells are 

25 screened for production of antibodies of the desired specificity and affinity for the 

antigen, and yield of the monoclonal antibodies produced by such cells may be enhanced 
by various techniques, including injection into the peritoneal cavity of a vertebrate host. 

Monoclonal antibodies produced in such a manner are used, for instance, 
in ELIS A diagnostic tests, immunoperoxidase tests, immunohistochemical tests, for the in 

30 vitro evaluation of spirochete invasion, to select candidate antigens for vaccine 

development, protein isolation, and for screening genomic and cDNA libraries to select 
appropriate gene sequences. 



24 



WO0027994 [file:/A\ dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO0027994.cp c] 



WO 00/27994 PCT/US99/26923 

I^imunodiaeonostic detection of C. pneumoniae infections 

The present invention also provides methods for detecting the presence or 
absence of C pneumoniae, or antibodies reactive with it, in a biological sample. For 
instance, antibodies specifically reactive with Chlamydia can be detected using either 

5 Chlamydia proteins or the isolates described here. The proteins and isolates can also be 
used to raise specific antibodies (either monoclonal or polyclonal) to detect the antigen in 
a sample. In addition, the nucleic acids disclosed and claimed here can be used to detect 
Chlamydia-specific sequences using standard hybridization techniques. 

For a review of immunological and immunoassay procedures in general, 

1 0 see Basic and Clinical Immunology (Stites & Terr ed., 7th ed. 1991)). The immunoassays 
of the present invention can be performed in any of several configurations, which are 
reviewed extensively in Enzyme Immunoassay (Maggio, ed., 1980); Tijssen, Laboratory 
Techniques in Biochemistry and Molecular Biology (1985)). For instance, the proteins 
and antibodies disclose 1 here are conveniently used in ELISA, immunoblot analysis and 

15 agglutination assays. 

In brief, immunoassays to measure dni\-Chlamydia antibodies or antigens 
can be either competitive or noncompetitive binding assays. In competitive binding 
assays, the sample analyte (e.g., anti-Chlamydia antibodies) competes with a labeled 
analyte (e.g., anti-Chlamydia monoclonal antibody) for specific binding sites on a capture 

20 agent (e.g., isolated Chlamydia protein) bound to a solid surface. The concentration of 
labeled analyte bound to the capture agent is inversely proportional to the amount of free 
analyte present in the sample. 

Noncompetitive assays are typically sandwich assays, in which the sample 
analyte is bound between two analyte-specific binding reagents. One of the binding 

25 agents is used as a capture agent and is bound to a solid surface. The second binding 
agent is labelled and is used to measure or detect the resultant complex by visual or 
instrument means. 

A number of combinations of capture agent and labelled binding agent can 
be used. For instance, an isolated Chlamydia protein or culture can be used as the 
30 capture agent and labelled anti-human antibodies specific for the constant region of 

human antibodies can be used as the labelled binding agent. Goat, sheep and other non- 
liUman antibodies specific for human immunoglobulin constant regions (e.g., y or |i) are 
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well known in the art. Alternatively, the anti-human antibodies can be the capture agent 
and the antigen can be labelled. 

Various components of the assay, including the antigen, zxitx-Chlamydia 
antibody, or anti-human antibody, may be bound to a solid surface. Many methods for 

5 immobilizing biomolecules to a variety of solid surfaces are known in the art. For 
instance, the solid surface may be a membrane (e.g., nitrocellulose), a microtiter dish 
(e.g., PVC or polystyrene) or a bead. The desired component may be covalently bound or 
noncovalently attached through nonspecific bonding. 

Alternatively, the immunoassay may be carried out in liquid phase and a 

10 variety of separation methods may be employed to separate the bound labeled component 
from the unbound labelled components. These methods are known to those of skill in the 
art and include immunoprecipitation, column chromatography, adsorption, addition of 
magnetizable particles coated with a binding agent and other similar procedures. 

An immunoassay may also be carried out in liquid phase without a 

15 separation procedure. Various homogeneous immunoassay methods are now being 
applied to immunoassays for protein analytes. In these methods, the binding of the 
binding agent to the analyte causes a change in the signal emitted by the label, so that 
binding may be measured without separating the bound from the unbound labelled 
component. 

20 Western blot (immunoblot) analysis can also be used to detect the presence 

of antibodies to Chlamydia in the sample. This technique is a reliable method for 
confiiming the presence of antibodies against a particular protein in the sample. The 
technique generally comprises separating proteins by gel electrophoresis on the basis of 
molecular weight, transferring the separated proteins to a suitable solid support, (such as a 

25 nitrocellulose filter, a nylon filter, or derivatized nylon filter), and incubating the sample 
with the separated proteins. This causes specific target antibodies present in the sample 
to bind their respective proteins. Target antibodies are then detected using labeled anti- 
human antibodies. 

The immunoassay formats described above employ labelled assay 

30 components. The label may be coupled directly or indirectly to the desired component of 
the assay according to methods well known in the art. A wide variety of labels may be 
used. The component may be labelled by any one of several methods. Traditionally a 
radioactive label incorporating 3 H, 125 1, 35 S, ,4 C, or 32 P was used. Non-radioactive labels 
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include ligands which bind to labelled antibodies, fluorophores, chemiluminescent agents, 
enzymes, and antibodies which can serve as specific binding pair members for a labelled 
ligand. The choice of label depends on sensitivity required, ease of conjugation with the 
compound, stability requirements, and available instrumentation. 

5 Enzymes of interest as labels will primarily be hydrolases, particularly 

phosphatases, esterases and glycosidases, or oxidoreductases, particularly peroxidases. 
Fluorescent compounds include fluorescein and its derivatives, rhodamine and its 
derivatives, dansyl, umbelliferone, etc. Chemiluminescent compounds include luciferin, 
and 2,3-dihydrophthalazinediones, e.g., luminol. For a review of various labelling or 

10 signal producing systems which may be used, see U.S. Patent No. 4,391,904, which is 
incorporated herein by reference. 

Non-radioactive labels are often attached by indirect means. Generally, a 
ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds 
to an anti-ligand (e.g., streptavidin) molecule which is either inherently detectable or 

15 covalently bound to a signal system, such as a detectable enzyme, a fluorescent 

compound, or a chemiluminescent compound. A number of ligands and anti-ligands can 
be used. Where a ligand has a natural anti-ligand, for example, biotin, thyroxine, and 
Cortisol, it can be used in conjunction with the labelled, naturally occurring anti-ligands. 
Alternatively, any haptenic or antigenic compound can be used in combination with an 

20 antibody. 

Some assay formats do not require the use of labelled components. For 
instance, agglutination assays can be used to detect the presence of the target antibodies. 
In this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need be labelled and the presence of 
25 the target antibody is detected by simple visual inspection. 

Pharmaceutical Compositions 

The peptides or antibodies (typically monoclonal antibodies) of the present 
invention and pharmaceutical compositions thereof are useful for administration to 
mammals, particularly humans, to treat and/or prevent Chlamydia infections. Suitable 
30 formulations are found in Remington 's Pharmaceutical Sciences, Mack Publishing 
Company, Philadelphia, PA, 17th ed. (1985). 
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The immunogenic peptides or antibodies of the invention are administered 
prophylactically or to an individual already suffering from the disease. The peptide 
compositions are administered to a patient in an amount sufficient to elicit an effective 
immune response to Chlamydia. An effective immune response is one that inhibits 

5 infection. An amount adequate to accomplish this is defined as "therapeutically effective 
dose" or "immunogenically effective dose." Amounts effective for this use will depend 
on, e.g., the peptide composition, the manner of administration, the stage and severity of 
the disease being treated, the weight and general state of health of the patient, and the 
judgment of the prescribing physician, but generally range for the initial immunization 

10 (that is for therapeutic or prophylactic administration) from about 0.1 mg to about 1.0 mg 
per 70 kilogram patient, more commonly from about 0.5 mg to about 0.75 mg per 70 kg 
of body weight. Boosting dosages are typically from about 0.1 mg to about 0.5 mg of 
peptide using a boosting regimen over weeks to months depending upon the patient's 
response and condition. A suitable protocol would include injection at time 0, 4, 2, 6, 10 

15 and 14 weeks, followed by further booster injections at 24 and 28 weeks. 

For therapeutic use, administration should begin at the first sign of 
infection. This is followed by boosting doses until at least symptoms are substantially 
abated and for a period thereafter. In some circumstances, loading doses followed by 
boosting doses may be required. The resulting immune response helps to cure or at least 

20 partially arrest symptoms and/or complications. Vaccine compositions containing the 
peptides are administered prophylactically to a patient susceptible to or otherwise at risk 
of the infection. 

The pharmaceutical compositions (containing either peptides or 
antibodies) are intended for parenteral or oral administration. Preferably, the 

25 pharmaceutical compositions are administered parenterally, e.g., subcutaneously, 
intradermally, or intramuscularly. Thus, the invention provides compositions for 
parenteral administration which comprise a solution of the immunogenic polypeptides 
dissolved or suspended in an acceptable carrier, preferably an aqueous carrier. A variety 
of aqueous carriers may be used, e.g., water, buffered water, 0.4% saline, 0.3% glycine, 

30 hyaluronic acid and the like. These compositions may be sterilized by conventional, well 
known sterilization techniques, or may be sterile filtered. The resulting aqueous solutions 
may be packaged for use as is, or lyophilized, the lyophilized preparation being combined 
with a sterile solution prior to administration. The compositions may contain 
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pharrnaceutically acceptable auxiliary substances as required to approximate 
physiological conditions, such as buffering agents, tonicity adjusting agents, wetting 
agents and the like, for example, sodium acetate, sodium lactate, sodium chloride, 
potassium chloride, calcium chloride, sorbitan monolaurate, triethanolamine oleate, etc. 
5 The compositions may also comprise carriers to enhance the immune 

response. Useful carriers are well known in the art, and include, e.g., KLH, 
thyroglobulin, albumins such as human serum albumin, tetanus toxoid, polyamino acids 
such as poly(lysine:glutamic acid), influenza, hepatitis B virus core protein, hepatitis B 
virus recombinant vaccine and the like. 

10 For solid compositions, conventional nontoxic solid carriers may be used 

which include, for example, pharmaceutical grades of mannitol, lactose, starch, 
magnesium stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate, and the like. For oral administration, a pharrnaceutically acceptable nontoxic 
composition is formed I y incorporating any of the normally employed excipients, such as 

15 those carriers previously listed, and generally 10-95% of active ingredient, that is, one or 
more peptides of the invention, and more preferably at a concentration of 25%-75%. 

As noted above, the peptide compositions are intended to induce an 
immune response to Chlamydia. Thus, compositions and methods of administration 
suitable for maximizing the immune response are preferred. For instance, peptides may 

20 be introduced into a host, including humans, linked to a carrier or as a homopolymer or 
heteropolymer of active peptide units from various Chlamydia proteins disclosed here. 
Alternatively, a "cocktail" of polypeptides can be used. A mixture of more than one 
polypeptide has the advantage of increased immunological reaction and, where different 
peptides are used to make up the polymer, the additional ability to induce antibodies to a 

25 number of epitopes. 

The compositions also include an adjuvant. As used here, number of 
adjuvants are well known to one skilled in the art. Suitable adjuvants include incomplete 
Freund's adjuvant, alum, aluminum phosphate, aluminum hydroxide, 
N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), 

30 N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as nor-MDP), 
N-acetylmuramyl-Lalanyl-D-isoglutaminyl-L-alanine-2-( 1 '-2 , -dipalmitoyl-sn- 
glycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 1983 5 A, referred to as MTP-PE), 
and RIBI, which contains three components extracted from bacteria, monophosphoryl 
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lipid A, trehalose dimycolate and cell wall skeleton (iMPL+TDM+CWS) in a 2% 
squalene/Tween 80 emulsion. The effectiveness of an adjuvant may be determined by 
measuring the amount of antibodies directed against the immunogenic peptide. 

The concentration of immunogenic peptides of the invention in the 

5 pharmaceutical formulations can vary widely, i.e. from less than about 0.1%, usually at 
or at least about 2% to as much as 20% to 50% or more by weight, and will be selected 
primarily by fluid volumes, viscosities, etc., in accordance with the particular mode of 
administration selected. 

The peptides of the invention can also be expressed by attenuated viral 

10 hosts, such as vaccinia or fowlpox. This approach involves the use of vaccinia virus as a 
vector to express nucleotide sequences that encode the peptides of the invention. Upon 
introduction into a host, the recombinant vaccinia virus expresses the immunogenic 
peptide, and thereby elicits an immune response. Vaccinia vectors and methods useful in 
immunization protocols are described in, e.g., U.S. Patent No. 4,722,848. Another vector 

15 is BCG (Bacille Calmette Guerin). BCG vectors are described in Stover et al. {Nature 
351:456-460 (1991)). A wide variety of other vectors useful for therapeutic 
administration or immunization of the peptides of the invention, e.g., Salmonella typhi 
vectors and the like, will be apparent to those skilled in the art from the description 
herein. 

20 The DNA encoding one or more of the peptides of the invention can also 

be administered to the patient. This approach is described, for instance, in Wolff et. al., 
Science 247: 1465-1468 (1990) as well as U.S. Patent Nos. 5,580,859 and 5,589,466. 

In order to enhance serum half-life, the peptides may also be encapsulated, 
introduced into the lumen of liposomes, prepared as a colloid, or other conventional 

25 techniques may be employed which provide an extended serum half-life of the peptides. 
A variety of methods are available for preparing liposomes, as described in, e.g., Szoka et 
al., Ann. Rev. Biophys. Bioeng. 9:467 (1980), U.S. Pat. Nos. 4, 235,871, 4,501,728 and 
4,837,028. 

30 EXAMPLES 

The following examples are offered to illustrate, but no to limit the 
claimed invention. 
Example 1: 
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This example describes comparison of the C. pneumoniae genome 
disclosed here and the, previously sequenced, C trachomatis genome (Stephens, et al. 
Science 282:754-759 (1998)). 

The apparent low level of DNA homology between C. trachomatis and C 
5 pneumoniae (Campbell, er ai,l Clin. Microbiol. 25:191 1-1916 (1987)) yet analogous 
cell structures and developmental cycles, predicts that comparative analysis of the two 
genomes will significantly enhance the understanding of both pathogens. Identification 
of genes that are present in one species but not the other are of particular importance for 
the mutually exclusive biological, virulence and pathogenesis capabilities of each. 

10 Identification of genes shared between the two species strongly supports the requirement 
for these capabilities in a biological system that has, over its long-term association with 
mammalian host cells, evolved to reduce the metabolic capacities while optimizing 
survival, growth and transmission of these unique pathogens. 

The previously sequenced C. trachomatis genome contains 1,042,519 

15 nucleotides and 875 likely protein-coding genes. Similarity searching permitted the 
inferred functional assignment of sequences 636 (60%) genes disclosed here and 251 
(23%) are similar to hypothetical genes for other bacterial organisms including those for 
C trachomatis. The remaining 186 (17%) genes are not homologous to sequences 
deposited in GenBank.. Seventy C trachomatis genes are not represented in the C. 

20 pneumoniae genome. These are contained within blocks consisting of 2-17 genes and 19 
single genes. Of the 70 C. trachomatis genes without homologs in C. pneumoniae, 60 are 
classified as encoding hypothetical proteins. The remaining genes not represented in C 
pneumoniae consist of the tryptophan operon (trpA,B,R) y trpC, two predicted thiol 
protease genes, and 4 genes assigned to the phospholipase-D superfamily. 

25 It is evident that there is a high level of functional conservation between C 

pneumoniae and C. trachomatis as orthologs to C. trachomatis genes were identified for 
859 (80%) of the predicted coding sequences for C pneumoniae. The level of similarity 
for individual encoded proteins spans a wide spectrum (22-95% amino acid identity) with 
an average of 62% amino acid identity between orthologs from the two species. The 

30 percent amino acid identity between orthologous chlamydial proteins is similar among 
functional groups with the highest for proteins associated with translation and the lowest 
for proteins whose function in chlamydiae is uncharacterized and not related to proteins 
encoded by other organisms. The gene order of the homologous set of genes in C. 
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pneumoniae shows reorganization relative to the genome of C trachomatis; however, 
there is a high level of synteny for the gene organization of the two genomes. We 
identified thirty-nine blocks of 2 or more genes whose gene organization is colinear with 
homologs to C trachomatis, although some of these are inverted. The distribution of 
5 genome reorganization is not evenly distributed on the chromosome as the region 
between C. pneumoniae coding sequences 0130-0300 contains substantially more 
reorganization than other areas of the genome. This region coincides with the predicted 
chromosome replication terminus. 

We identified orthologs of enzymes characterized in other bacteria that 

10 account for the essential requirements for DNA replication, repair, transcription and 
translation including two predicted DNAhelicases of the Swi2/Snf2 family found in C. 
trachomatis. Similar to C. trachomatis, alternative sigma subunits for RNA polymerase, 
a 28 and a 54 , were identified in addition to anti-cr regulatory system factors RsbV, a 
RsbW-like single-domain histidine kinase, and a RsbU-like protein phosphatase. These 

15 findings suggest that the fundamental mechanisms of transcriptional regulation are 
conserved among Chlamydia. The C. trachomatis proteins containing SET and SWDB 
domains, and a SWIB domain fused to the C-terminus of the chlamydial topoisomerase I, 
not identified outside eukaryotes, are found in C. pneumoniae supporting their possible 
role in the chromatin condensation-decondensation characteristic of the biologically 

20 unique chlamydial developmental cycle. 

The central metabolic pathways inferred from the C pneumoniae genome 
sequence are the same as those identified for C. trachomatis C. pneumoniae has a 
glycolytic pathway and a linked tricarboxylic acid cycle, although likely functional, is 
incomplete as genes for citrate synthase, aconitase, and isocitrate dehydrogenase were not 

25 identified. C. pneumoniae has a complete glycogen synthesis and degradation system 
supporting a role for glycogen synthesis and utilization of glucose-derivatives in 
chlamydial metabolism. Genes encoding essential functions in aerobic respiration are 
present and electron flux may be supported by pyruvate, succinate, glycerol-3-phosphate, 
and NADH dehydrogenases, NADH-ubiquinone oxidoreductase and cytochrome oxidase. 

30 C. pneumoniae also contains the V (vacuolar)-type ATPase operon and the two ATP 
translocases found in C trachomatis. 

The type-Hi secretion virulence system required for invasion by several 
pathogenic bacteria and found in the C. trachomatis genome in three chromosomal 
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locationsis also present in the C pneumoniae genome. Each of the components is 
conserved and their relative genomic contexts are conserved. Genes such as a predicted 
serine/threonine protein kinase and other genes physically linked to genes encoding 
structural components of the type-Ill secretion apparatus, but without identified 

5 homologs, are also highly similar between the two species suggesting the functional roles 
in modifying cellular biology are fundamentally conserved. 

Chlamydia-encoded proteins that are not found in chlamydial organisms 
but localized to the intracellular chlamydial inclusion membrane are likely essential for 
the unique intracellular biology and perhaps differences in inclusion morphology 

10 observed between species of Chlamydia. Several such proteins, termed IncA,B&C, have 
been characterized for a C. psittaci strain (Rockey, et al. Mol. Microbiol. 15:617-626 
(1995); Rockey et al. Injzct. Immun. 62:106-1 12 (1994)). C pneumoniae and C. 
trachomatis encode orthclogs to C psittaci IncB and IncC and C. trachomatis also 
contains an ortholog to Lie A. C pneumoniae contains two genes that encode proteins 

15 with similarity to IncA (CPn0186 and CPn0585), although the level of homology is low 
suggesting analogous but possibily altered functions. 

The tryptophan biosynthesis operon {trpA, trpB, trpR) and trpC identified 
in C. trachomatis is conspicuously missing in the C pneumoniae genome. This 
represents the entire repertoire of genes associated with tryptophan biosynthesis identified 

20 in C trachomatis. Seventeen genes adjacent to the C. trachomatis tryptophan operon also 
were not found in the C. pneumoniae genome. This region is the single largest loss of a 
contiguous genomic segment and includes 4 HKD superfamily encoding genes that 
encompass a family of proteins related to endonuclease and phospHolipase D. These 
findings may be important for the ability of Chlamydia to persist in their hosts and cause 

25 disease by eliciting potent, focal and persistent inflammatory responses thought to be 
essential for pathogenesis. 

The C. pneumoniae genome contains 187,71 1 additional nucleotides 
compared to the C. trachomatis genome, and the 214 coding sequences not found in C 
trachomatis account for most of the increased genome size. Eighty-eight of these genes 

30 are found in blocks of >10 genes (1 1-30 genes/block), 41 are single genes, and the 
remainder are partnered with at least one other gene. Based upon the observation that 
-70% of all the C. pneumoniae genes have an identifiable homolog in GenBank, 
exclusive of C trachomatis, it would be expected that over 150 of the 214 genes should 

33 



WO0027994 ffil e:/A\dcwas03\firmdata \l p\ F o l eyPat\Pa ten tDocuments\WO0 0279 94 .cpc] _ Page 36 of 330 

WO 00/27994 PCT/US99/26923 

have a homolog in GenBank, many associated with a function. However, only 28 coding 
sequences have similarity to genes from other organisms. Thus the majority of the genes 
that are mutually exclusive of C trachomatis (186 of 214), and the 60 of 70 C. 
trachomatis genes that lacked an identifiable homolog in C. pneumoniae, do not have 

5 detectable homologs to genes from other organisms. We predict that most of the unique 
genes are essential for specific attributes that define the differential biology, tropism and 
pathogenesis of C. trachomatis and C pneumoniae. Moreover, this suggests that C. 
pneumoniae has more unique biological (i.e., virulence) capacity than C. trachomatis. 
The ability of C. pneumoniae to be more invasive and survive in a broader range of host 

10 cell types than C. trachomatis is consistent with this hypothesis. Not all of the 

differences in biological capacity may be associated with mutually exclusive genes. One 
explanation for the significantly lower level of homology between protein sequences 
assigned as having C pneumoniae and C trachomatis orthologs but no identifiable 
orthologs in other organisms is that this set of proteins is not only associated with 

15 biological requirements specific for Chlamydia but this polymorphism may account for 
differential biology between the two species. The determination of the genome sequence 
from a representative of the C. psittaci group will precisely delineate those genes that are 
mutually exclusive and specific for each species. 

The major functionally identifiable addition to the C pneumoniae genome 

20 is a large expansion of genes encoding a new family of chlamydial polymoiphic 

membrane proteins (Pmp), alone representing 22% of the increased coding capacity. 
While the C. trachomatis genome has 9 pmp genes, remarkably the C pneumoniae 
genome contains 21 pmp genes. Most of these genes appear to be amplified in two 
regions of the genome with three stand-alone genes. Interestingly one of the stand-alone 

25 genes is most closely related to the C. trachomatis pmpD which is the only stand-alone 
pmp gene in the C. trachomatis genome and it is located with the same relative genomic 
context, suggesting an essential and conserved function for this paralog. Six Pmp-coding 
genes are presumably not functional as five contain predicted coding frame-shifts and one 
is truncated. The amplification of this gene family and the confidently predicted frame- 

30 shifts suggest a specific molecular mechanism to promote functional or antigenic 

diversity. The biological role of this protein family remains enigmatic, although at least 
one of the proteins in C psittaci related to this family is exposed on the chlamydial 
surface. 
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While a function could not be assigned for most of the unique C 
pneumoniae genes, several have significant similarity to genes from other organisms. 
Functional assignments could be made for genes encoding GMP synthetase, IMP 
dehydrogenase, UMP synthase, uridine kinase, biotin synthase pathway proteins, 
5 methylthioadenosine nucleosidase, a DNA glycosylase and aromatic amino acid 
hydroxylase. Thus a complete pathway was identified for biotin biosynthesis. The 
additional purine and pyrimidine salvage pathway genes presumably reflect metabolic 
limitations in one of the cell types that C pneumoniae infects or differences in the ability 
of C. pneumoniae to transport precursor nucleosides or nucleotides. 

1 0 The addition of aromatic amino acid hydroxylase in C. pneumoniae is 

intriguing especially in light of the loss of tryptophan biosynthetic genes and the inability 
to synthesize other amino acids including phenylalanine. Aromatic amino acid 
hyroxlyases include three distinct enzymes that function to receptively oxidize 
phenylalanine to tyrosine, tyrosine to Dopa, and tryptophan to 5-hydroxytryptophan and 

15 serotonin. Although the chlamydial protein is similar to proteins of this family and 

incrementally more closely related to tryptophan hydroxylase, its specific function could 
not be confidently predicted. We hypothesize that it may be involved in C pneumoniae 
virulence. Tryptophan hydroxylase has not been previously identified in bacteria and the 
origin of the chlamydial gene appears to be from eukaryotes. The functional role of an 

20 aromatic amino acid hydroxylase for C pneumoniae is linked to the unique intracellular 
biology of this organism and may represent a key contribution to C. pneumoniae 
persistence and pathogenesis. 

It is understood that the examples and embodiments described herein are 
for illustrative purposes only and that various modifications or changes in light thereof 

25 will be suggested to persons skilled in the art and are to be included within the spirit and 
purview of this application and scope of the appended claims. All publications, patents, 
and patent applications cited herein are hereby incorporated by reference in their entirety 
for all purposes. 

Table 1 provides functional assignments of C. pneumoniae nonprotein- 
30 encoding genomic sequences. Table 2 provides functional assignments of protein coding 
sequences. Table 3 provides the amino acid sequences of the proteins corresponding to 
the coding sequences. 
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62790 


F 


CPn0046 


62790 


63263 


p 


CPn0047 


63455 


63652 


F 


CPn0046 


63607 


65001 


F 


CPn0049 


66296 


65017 


R 


CPnOOSO 


66013 


66499 


R 


CPO0051 


66033 


67111 


F 


CPnOOS2 


60005 


67304 


R 


CFO0053 


69344 


67906 


R 


CPtt0054 


70023 


69313 


R 


CPnOOSS 


70129 


70590 




CPn0056 


70953 


72746 




CPOOQ57 


72934 


73554 




CPnOOSS 


73639 


74562 




CPn0059 


:«616 


75050 




CPnOOSO 


75055 


75528 




CPft006l 


75514 


76208 




CPnQ062 


7^106 


77690 




OPnOfl*) 


7«u: 


78267 




OPft0fl«4 


7UJ46 


78576 




TPnQfl*? 


7H924 


40651 




OPnOO** 


H0«5 


8J6S5 





CT001 hypothetical protein 

aatC-Clu*t*XA ClA Aminotransferase (C subunit) - (CT002) 
gatA-Clu tXKA Gin A»idotranafer««-(CT003) 

♦iar.B • < PeeU2» fi'fiA OLn AnifiocrAn=t"tc*3-* (B ;;uDiinU » - iCT*)fl-t i 
pop. 1* Polymorphic Outer Meaoranrf Protma C Family 



frame-shift with 0010 



pmp_2- Polymorphic Outer Membrane Protein C family 
pmp_3 -Polymorphic Outer Membrane Protein G Family 
pap.3*PMP_) (frame-shift with 0014) 
pmp_4- Polymorphic Outer Membrmne Protein C Family 
pnp_4-FMF_4 (frame -shl ft with 0016) 
pmp_S- Polymorphic Outer Membrane Protein G Family 
pmp_S-PMP_S (frame-shift with 0018) 
Predicted OHP (leader (14) peptides outer ■ 
Predicted OKP (leader (19) peptide] -(CT3 SO) 
maf-(CT349) 

yjjX/alr-ABC Transporter Protein ATPaee- (CT348) 
xerC-Xncegrasa/rscombinase- (CT347) 
elaC/atsA-Sttlphohydrolaee/Glycoeulfataee- ( CT346) 
CT345 hypothetical protein* (CT345) 
lon-Uon ATF-dependent Protease- (CT3 44) 



•ane]-(CT351) 



gcp.l-O-Sialoglyeeprotoln tndopeptidese_i-(CT343) 
rs21-S21 Ribosomal Protein- (CT3 42) 
dnaJ-Keat Shock Protein J-(CT341) 

pdhAAB/odbAfcodbB- (pyruvate) Oxolsovalerate Dehydrogenase Alpha 6 Beta 
Fusion- (CT340) 

CT339 hypothetical protein 

CT338 hypothetical protein 

ptsH-PT5 phoephocarrier Protein Kpr-(CT337) 

ptsZ-PTS PEP Phosphotransferase- (CT33 6) 

ybafi-(CT335) 

dnaX_i-DNA Pol III Gamma and Tau_i-(CT334) 



•yqfF-8s conserved hypothetical ZM protein 



hemC-Porphobilinogen Deaminase- (CT299) 
sots-Sms Protein- (CX298) 
rnc-Ribonuclease IIX-ICT297) 
CT296 hypothetical protein 
. mrsA-Phosphoaannorautase- (CT29S) 
aodM- Superoxide Dismutase (Mn) - (CT294J 
accD-AcCoA Carboxylase /Trans fa rase Beta-(CT293) 
dut*dUTP Nucleotidohydrolase-(CT292) 
ptaN.l-PTS IIA Protein- (CT291) 

ptsN_2-PTS IIA Protein ♦ KTH DMA-Binding Oomein-(CT290J 
CT28? hypothetical protein 



hypoet\er.tou protein 
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CPn0067 


829S3 


84053 


F 


CPn006B 


64903 


84331 


R 


CPn00G9 


85236 


67086 


F 


CPn007Q 


87378 


67208 


R 


CPn007 l 


88045 


87599 


R 


CPn0072 


89061 


86057 


R 


CPn0073 


89356 


89574 




CPn0074 


89774 


90955 




CPn0075 


91102 


91350 


_ 


CPn007 € 


91358 


91903 


_ 


CPn0077 


92013 


92435 


_ 


w + aa w v * V 


92465 


93160 


_ 


CPn0079 


93179 


93688 




CPnOOSO 


93735 


94121 


- 


CPnOOSl 


94261 


98016 


_ 


CPn0082 


98043 


102221 


- 


CPn0083 


102332 


103312 


- 


CPn0064 


103362 


103751 


_ 


CPnOOSS 


104506 


103766 




CPa0086 


104904 


105527 


_ 


CPn0087 


105579 


106376 


_ 


CPnOOSS 


106373 


106145 


_ 


CPn0089 


108153 


109466 


_ 


CPn0O90 


109454 


110060 


_ 


CPn0091 


110074 


112053 




CPn0092 


112151 


112573 




CPn0O93 


112509 


113015 


_ 


CPn0094 


113152 


115971 


_ 


CPn0O95 


116037 


118790 


_ 


CPn0096 


124314 


118837 




CPn0097 


124555 


126006 


_ 


CPn0O98 


127491 


126091 




CPn0O99 


127593 


127865 


p 


CPnOlOO 


129141 


127882 


R 


CPnOlOl 


129932 


129141 


R 


CPn0102 


130123 


131466 


F 


CPn0103 


131480 


132511 




CPn0104 


133875 


132676 


R ' 


CPnOlOS 


134847 


134029 


R 


CPn0106 


135091 


136374 


P 


CPn0107 


137162 


136392 


R 


CFU0108 


137857 


137303 


R 


CPtt0109 


138655 


141783 


F 


CPnOllO 


143734 


141827 


R 


CPnOXll 


144686 


143934 


R 


CPn0112 


144767 


145093 


F 


CPn0113 


145335 


146405 


F 


CPa0114 


146398 


147261 


F 


CPnOllS 


147279 


148622 


F 


CPn0116 


148616 


148972 


F 


CPn0117 


148989 


150071 


F 


cpnOiie 


150102 


150464 


• F 


CPn0119 


150523 


151164 


F 


CPn0120 


151164 


151778 


F 


CP&0121 


151778 


152068 


F 


CPa0122 


152071 


153723 


F 


CPn0123 


155969 


153774 


R 


CPn0l24 


156614 


158068 


F 


CPI10125 


158096 


158605 


F 


CPn0126 


158809 


161085 


F 


CPn0127 


162143 


161130 


R 


CPn0128 


162277 


163053 


F 


CPn0129 


163717 


163064 


R 


CPnO130 


164245 


163751 


R 


CPnOl3l 


164549 


165580 


F 


CFn0132 


165587 


166561 


F 


CPnOi3 3 


167334 


166564 


R 


CPn0l34 


169096 


167467 


R 


CR1013 5 


169448 


169143 


R 


CPn0136 


171401 


169569 


R 


CPn0137 


172254 


171502 


R 


CPn0138 


174019 


172700 


R 



PCT/US99/26923 



CT3 60 hypothetical protein 



CT325 hypothetical protein 

CT324 hypothetical protein 

infA-Initiation Factor XF-1MC7323) 

tufA-Elongation Factor Tu-(CT322) 

secE-preprotein trans locase - (CT321) 

nusG -Transcriptional Anti termination- (CT320 ) 

rlll-Lll Ribosomal Protein- (CT3 19) 

rll-Ll Ribosomal Protein- (CT318) 

rllO-LlO Ribosomal Protein- (CT3 17) 

rl7-L7/L12 Ribosomal Protein- (CT3 16) 

rpoB-RMA Polymerase Beta-(CT315) 

rpoC-RKA Polymerase Beta* -(CT314) 

cal -Trans aldolase- (CT313 ) 

predicted ferredoxin- (CT312) 

CT311 hypothetical protein 

acpE-ATP Synthase Sub unit E-(CT310) 

CT309 hypothetical protein 

atpA-ATP synthase Subunit A- (CT308) 

atpB-ATP Synthase Subunit B-<CT307) 

atpD-ATP Synthase Subunit D-(CT306) 

atpI-ATP Synthase Subunit Z-(CT305) 

atpK-ATP Synthase Subunit K-{CT304) 

CT303 hypothetical protein 

valS-Valyl tRNA Synthetase- (CT302 J 

pknD-S/T Protein Kinase* (CT301) 

uvrA-Excinuclease ABC Subunit A-(CT333) 

pyk- Pyruvate Kinase- (CT332 ) 

htrB-Acyl transferase- (CT010) 

CT011 hypothetical protein 

ybbP family hypothetical protein- (CT012) 

cydA-Cytochrome Oxidase Subunit I- (CT013 ) 

cydB -Cytochrome Oxidase Subunit II- (CT014) 

CT017 hypothetical protein 

CT016 hypothetical protein 

phoK-ATPase- (CT015 ) 

CT058 hypothetical protein.1 

CT018 

ileS-Isoleucyl-tRNA Synthetase- (CT019J 

lepB-Signal Peptidase I-(CT020) 

CT021 hypothetical protein 

rl31-L31 Ribosomal Protein- (CT022) 

pfrA- Peptide Chain Releasing Factor (RF-1J - (CT023) 

hemK-A/G specific methylase- (CT024) 

ffh-Signal Recognition Particle GTPase- (CT025) 

rs!6-si6 Ribosomal Protein- (CT026) 

crmD-tRNA (guanine N-l) -Methyl transferase- (CT027) 

rll9-L19 Ribosomal Protein- (CT028) 

rnhfi.l-Ribonuclease HII_1- (CT029) 

gmk-GMP Kinase- (CT030) 

CT031 hypothetical protein 

metG-Methionyl-tRNA Synthetase- (CT032) 

recD_l -Exodeoxy ribonuc lease V (Alpha Subunit J_l- ICT033 ) 



ytfF-Cationic Amino Acid Transporter- (CT03 4) 
bpll-Biotin Protein Ligase-(CT035> 
similarity to CT03 6 



CHLPS hypothetical protein- (CT109) 
groEL_l-HSF-60_l- (CT110) 
groES-lOKDa Chaperonin- (CT111) 
pepF-Oligopeptidase- (CT112 ) 
ybgl-ACR family- (CT108) 

hemL-Clutamate-1 -semi aldehyde -2 , 1-aminomutaee- (C7210) 
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CFn0l39 


174636 


174093 


R 


yqgE- (CT210) 


CF&0140 


1731X0 


174673 




yqdl- (CT2121 


CP&8141 


X7S802 


179110 


R 


__4 k HI _ ^ » ft • - ■ _ — m _ / «■« 111 

rpiA-Rl0099*3~r ISOBerase A-(CT214i 


CPB0142 


176091 


179816 


R 




cfh0143 


177335 


176214 


R 


*yxjG_Bs_l Hypothetical Protein 


CPn0144 


177963 


180960 


F 


mm 7* m& A 9 _ — - _ ^ _ ^ p. S9a99.» M A f 9) 9j 1 

Clpfl-ClP PTOC99S9 ATP4S9- iCTll J ) 


CFn0145 


180777 


182369 


F 


CT114 hypothetical protein 


CPn0146 


1826X3 


183093 


F 




CPn0147 


183225 


183671 


F 




CFn0146 


183646 


183702 


F 


PKni-s/T protein Kinase- tcri43j 


CPn0149 


189715 


187700 


r 


anjJ-ONA LigaSS- (CT146 1 


CP&0190 


187834 


192444 


F 


CT147 nypotneticei protein 


cpboisi 


194142 


192625 


R 


nhpA*ttonooxygenase* (CT148 ) 


CFB01S2 


X99265 


194318 


R 


CT149 hypothetical protein 


TPB0133 


199433 


197692 


F 


1 9115- 1*9 UCyi CKNA Syntnetase* (GT2U9 f 


CP&0154 


197892 


199202 


F 


gseA*XD0 Trans fartee- ( CT208 ) 


cfboiss 


199691 


199488 


R 




CPB01S6 


2Q0117 


199770 


R 




CP&01S7 


200723 


200298 


R 




CFO01S8 


20X430 


200694 


R 




CPnQ159 


20X772 


201467 


R 




CFn0160 


203791 


202127 


R 


pckA^i-p ructose~o-P Phosphotransf erese — l-iCT207) 


CP&0161 


204622 


203798 


R 


predicted acyl transferase family- (CT206) 


CFB0162 


203826 


204803 


R 




CP&0X63 


206026 


206394 


P 




CPnOX64 


206498 


206896 


F 




CPn0163 


206996 


207982 


F 




CP&0X66 


207630 


207962 


P 




CFnOX67 


208306 


207977 


R 




cpnoiee 


208641 


208417 


R 




CPnOX69 


209501 


208710 


R 




Cfn0170 


21X026 


91 

ZX00Z5 


R 




CP&0X71 


2X2435 


91 1 1 iO 

211199 


R 


*guaA*<5K? Synthase 


CFB0172 


2X3X77 


212440 


R 


guaji/ UnpP- Xnosine 5 -iBonopnoipiiA*e Qenyorovanasc iwuim' 










only) 


CP&0173 


91 Iflll 




p 

K 




CPn017 4 


91 

214237 


214724 


F 




Cravl79 


*i*Byo 


213275 


r 




CFB9170 


2132B0 


91 ft* if) 


r 


^^*1ft1 h«%««h>M^h 4 a» 1 •* 

WT13J nypocsvcicoJi prDcsui 


urnux / / 


217A5Q 


218COB 






wniwi/v 




217789 












Q 




brnUMV 


91 Jta«1 


2163S5 


B 

A 






219175 


218777 






rpnfiin? 




219334 










220645 


p 






231775 


221221 




■ifn larlfHiHieinn pa rf nr P 1-/CT1I3I 




222451 


221765 








222199 


224066 




•■inilaplfv m rm Twcl 1 - f CT119 ^ 
llUiinCjr to V{#* iHw^i IWAiAyj 


rpnOlfl? 


224248 


225045 








2291X1 


226400 








226400 


229825 




^¥131 hBmlnn* f Dsiaiblfl frmii—iwyiw PrQCAln) 


/■on At gn 


229919 


231274 






CP&0191 


23X991 


23X3X4 




B*liiA.>&sr^ iHina jVeltl Trim T*g • ATPjus*(CT130] 


CFn0192 


232634 


23X964 




oln9*ASC Acid Tranrooreex Pentue* (CT129) 

yaMr^arvavW AiJIAjUV ff9i*li • *w9w49>pn#* 9bi9j»* 4T ^»99»»9»»» 9* 


CPB0193 


233X26 


233686 




*argR-Axginin« MpruKor 


CPB0X94 


2332X0 


234241 




9Cp_2»0-Si*lo?lycoprotein Indop«pcid-v»9_2- (CT197) 


CPD019S 


234X90 


233785 




oppA^l-Olivopftpcid* Binding Proccia.1 


CP&0X96 


235939 


237519 




opp\_2 -0ligop«pcid4 Binding Prot9in_2* (CT19B) 


CP&0197 


237978 


238662 




oppA_3-01igop9ptid9 Binding Prot9in_3 


CPB0198 


239X69 


240746 




oppA.41 -011gop«ptida Binding Protciia.4 


CPB0199 


24X042 


24X983 




OTOBU. -Oligopeptide PerBeaee.1- (CT199J 


CPB0200 


2420X7 


242868 




oppCLl -Oligopeptide Pexaease_l- (CT200J 


CPB0201 


242864 


243715 




oppO-Oligopeptide Transport A7Pese*<CT201) 


CAB0202 


243715 


244900 




oppF •Oligopeptide Transport ATPase- (CT202) 


CPI10203- 


243008 


249602 






CPn0204 


249817 


246002 






CPn0205 


246133 


246327 






Cffn0206 


246409 


247X61 




CT203 hypothetical protein 


CPn0207 


247208 


248617 




ybhl/sodiTl-Oxoglutarsce/Malate Tranaiocator- {CT204) 


CPa0208 


248953 


250602 




ptkX.2- Fructose- 6- P Ftiosphotrana£9reae_2- (CT20S) 


CPn0209 


291036 


251272 







(COOH-teraiaal ngioc 
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CPn0210 


9 C9 Oil 


251440 


R 


CPn02 1 1 


<3< f 3D 


*3«4 b J 


R 


CP&0212 




* 34000 


R 


CPn0213 


* 34 J4« 


9 ti i an 
434X7Q 


R 


CPa0214 


* 3303 / 


434440 


R 


CPn02l5 


23/WX3 


255759 


R 


CPa0216 


4 3 /OOO 


257174 


R 


CPn0217 


2S7896 


258579 


F 


CPn021B 


259058 


258582 


R 


CPn0219 


259357 


260472 


F 


CPn0220 


260696 


261238 


F 


CPn0221 


261657 


262064 


F 


CPn0222 


262504 


262842 


F 


CPn0223 


262956 


263333 


F 


CPn0224 


263435 


263674 


F 


CPn0225 


263873 


264541 


F 


CPn0226 


264566 


264967 


F 


CPn0227 


265416 


265009 


R 


CPn0228 


266110 


265412 


R 


CPn0229 


266326 


267560 


F 


CPn0230 


268253 


267576 


R 


CPn0231 


268957 


268253 


R 


CPn0232 


270122 


269232 


R 


CPn0233 


270424 


270248 


R 


CP&0234 


271240 


270548 


R 


CP&0235 


271416 


272177 


F 


CPn0236 


272156 


273766 


F 


CPn0237 


273762 


274214 


F 




274303 


275838 


F 


CPn0239 


275899 


276672 


F 


CP&0240 


277861 


276698 


R 


CF&02 4 1 


279354 


278203 


R 






279487 


R 


UXT1U44 J 


40U333 


280133 


R 






281556 


F 




•B10I3 


282499 


F 




*OA73« 


282551 


R 


CD«rt7J7 


*DJ413 


282969 


R 


WCT1U440 




263650 


R 




9 BKDJ 1 
«S3S41 


264333 


R 




Z00U3' 


285902 


R 




286060 


287559 


F 


wrnu*3« 




287576 


R 


UJraU«3J 


Z0043O 


287950 


R 


wrnu« 3 4 


289262 


28B4S9 


R 




29U1D3 


289329 


R 




4S1Z04 


290396 


R 


CPn.0257 


9041 


291267 


R 


CPn02 5 A 

Va/UWi 3 O 


* 9*3 J 4 


292133 


R 


CPn0259 




49Z441 


R 


CPn0260 


294045 


*SrJ34o 


R 


CPn0261 


294302 


* 3F3W J J 


F 


CPn0262 


295091 


*?3 9 J J 


F 


CPn0263 


296249 


**/ X JO 


F 


CPn0264 


297730 




R 


Cpn0265 


298620 


20771ft 
*J ' / J U 


R 


CPn0266 


299184 


*77D / 0 


F 


CPn0267 


300122 




F 


CPn0268 


300935 




F 


CP&0269 


302450 


301476 


n 


CPn0270 


303325 


302466 


R 


CPn0271 


303634 


304362 


F 


CPn0272 


305233 


304340 


R 


CPn0273 


305844 


305227 


R 


CPn0274 


308353 


305852 


R 


CPn0275 


310786 


308372 


R 


CPn0276 


311137 


310793 


R 


CPn0277 


311910 


311404 


R 


CPn0278 


312875 


312060 


R 


CPn0279 


313537 


312875 


R 


CPn0280 


314572 


313550 


R 



ypdP-(CT140) 

tgt-Oueuine tRNA Ribosyl Transferase- (CT193 ) 
•weak similarity co Bacteriophage CHP1 (Orf4) 



dsbB -Disulfide bond Oxidoreductase- (CT176) 
dsbG- Disulfide Bond Chaperone- (CT177 ) 
CT178 hypothetical protein 
CT179 hypothetical protein 

tauB-ABC Transport ATPase (Nicrate/Fe) - (CT180) 
•similarity to 5 1 -Methyl thioadenosine / S-Adenosylhomocysteine 
Nucleosidase 

CT181 hypothetical protein 

kdsB-deoxyoctulonosic Acid Synthetase- (CT182) 

pyrC-CTP Synthetase- (CT1 83) 

yggF Family- (CT184) 

zwf-ciucose-6-P Dehyrogenase- (CT185) 

devB-Clucoie-6-P Dehyrogenase (DevB family )- (CT106) 



adJc -Adenylate Kinase- (CT12 6) 

ydho- Polysaccharide Hydrolase- 1 nvas in Repeat Family- (CT1 27) 

rs9-S9 Ri bos anal Protein- (CT12 6) 

r!13-L13 Ribosomal Protein- (CT12S) 

ycfV/ybbA-ABC Transporter ATPase- (CTl 52) 

CT151 hypothetical protein 

rl33-L33 Ribosomal Protein- (CTl 50) 

'conserved hypothetical protein 

CTl 4 4 hypothetical protein (frame -shift with 0253?) 

CTl 4 4 hypothetical protein_l 

CT143 hypothetical proteizul 

CTl 4 2 hypothetical protein.1 

CTl 4 4 hypothetical protein_2 

CTl 4 3 hypothetical protein_2 

CTl 4 2 hypothetical protein (frame-shift with 0259?) 

CTl 4 2 hypothetical protein_2 

secA_l-Protein Trans 1 oca st Subunic.l- (CT141) 

ydaO-FP-Loop Superfamily ATPase- (CT217) 

surE-SurE-like Acid Phosphatase- (CT216) 

yQfU hypothetical protein- (CT221) 

ubiD- Phenyl aery late Decarboxylase- (CT220) 

ubiA-Benzoate Octaphenyltransf erase- (CT219) 



Dipeptidase-(CT138) 

ywlC-SuA5 Superfamily -related Protein- (CT13 7) 
Lysophospholipase esterase- (CTl 36 ) 
dnaX_2-DXA Pol XIX Gamma and Tau_2- (CTl 87) 
td)c-Thymidylate Kinase- (CT168 1 
gyrA_l-0NA Cyras e Sub unit A_l- (CT189) 
cyrB_l-DNA Cyrase Sub unit B_1-(CT190) 
CTl 91 hypothetical protein 

•conserved outer membrane lipoprotein protein 
'Possible ABC Transporter Permease Protein 
dppF-Dipeptide Transporter ATPase- (CT689 ) 
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CPn02Bl 

CPn0282 

CPa0283 

CPn0284 

CPn0285 

CPR0286 

CPn0287 

CPn0288 

CPn0289 

CPnO290 

CPn0291 

CPn0292 

CPn0293 

CPn0294 

CPn0295 

CPn0296 

CPn0297 

CPn0298 

CPn0299 

CPn0300 

CPn0301 

CPn0302 

CPn0303 

CPU0304 

CPn0305 

CPn0306 

CPn0307 

CPa0308 

CFn0309 

CPn0310 

CPn0311 

CPn0312 

CPn0313 

CPn03l4 

CPn0315 

CPa0316 

CPn0317 

CPn03l8 

CPn0319 

CPn0320 

CPn0321 

CPn0322 

CPn0323 

CPn0324 

CPn0325 

CPa0326 

CPn0327 

CPn0328 

CPn0329 

CPn0330 

CPn033l 

CPn0332 

CPn0333 

CPn0334 

CPn0335 

CPn0336 

CP&0337 

CP&0338 

CPn0339 

CPn0340 

CPn0341 

CPH0342 

CPn0343 

CPn0344 

CPn0345 

CPn0346 

CPn0347 

CPn0348 

CPn0349 

CPn0350 

CPn0351 



315057 

316126 

318497 

319045 

320595 

322059 

324221 

325716 

325812 

327042 

328667 

329228 

329949 

333092 

333863 

334765 

335697 

336721 

336816 

337783 

340250 

340787 

342958 

343133 

344154 

345145 

346986 

349234 

350974 

353433 

354438 

354S24 

354990 

356285 

356977 

358B20 

360081 

362767 

363175 

363860 

365858 

366249 

36733X 

369492 

370708 

371148 

372945 

373241 

37508B 

376675 

378437 

378655 

379090 

379311 

379817 

380650 

382027 

382278 

383420 

383842 

384160 

384622 

.84999 

387420 

388572 

389675 

391021 

391803 

392770 

393181 

393888 



316103 

317529 

317532 

318551 

319051 

320650 

322089 

324571 

326996 

328523 

329194 

329836 

332723 

333502 

333627 

334022 

334774 

335717 

337415 

340152 

340762 

341866 

341921 

344158 

345137 

346431 

346S15 

349596 

349595 

351049 

353575 

354976 

355355 

355353 

358716 

360121 

362750 

363126 

363879 

364783 

364767 

367328 

369460 

370688 

371148 

372725 

373211 

374992 

376146 

376202 

376701 

378536 

378800 

379823 

380674 

381591 

381575 

383375 

384034 

384156 

384495 

385062 

385595 

385558 

387436 

388704 

389678 

391027 

391790 

393684 

395432 



r 

R 
R 
R 
R 
R 
R 
F 
F 
F 
F 
F 
F 
P 
R 

It 

r 
i 
v 

V 
Ii 
I 
I 
I 

1; 
r 

R 
R 
R 
F 
F 
R 



R 
R 
R 
R 
F 
F 

r 

R 
F 
F 
F 
F 
F 
F 
R 
R 
R 
R 
R 
R 
F 
T 



dhnA- Predicted 1.6-Fruccoae &iphosph*.-« Aldolase (dehydrin family)' 
(CT215) 

xasA/gadC-Amino Acid Transporter- ICT216J 



mgtE-Mg** Transporter (CBS Domain) - (CT194) 
CT195 hypothetical protein 

aaaT -Neutral Amino Acid (Glutamate) Transporter- (CT230) 

Na-dependent Transporter- (CT23 1 ) 

incB- Inclusion Membrane Protein B-(CT232) 

incc- Inclusion Membrane Protein C-(CT233> 

CT234 hypothetical protein 

cAHP-Dependent Protein Kinase Regulatory Subunit-<CT235) 

acpP-Acyl Carrier Protein- (CT23 6) 

fabG-Oxoacyl (Carrier Protein) Reductase- (CT237) 

fabD-Malonyl Acyl Carrier Transcyclase- (CT238 ) 

fabK-Oxoaeyl Carrier Protein Synthase III-(CT339) 

recR-Recombination Protein- (CT240) 

yaeT-Omp85 Analog- (CT241) 

(OmpH-Like Outer Membrane Protein) - (CT242) 

lpxO-UDP Glucosamine N-Aeyl transferase- (CT243 ) 

CT244 hypothetical protein 

pdhA/odpA-Pyruvate Dehydrogenase Alpha- (CT2 45) 
pdhB/odpB-Pyruvate Dehydrogenase Beta-(CT246) 
pdhC-Dihydrolipoamide Acetyl transferase- (CT347) 
glgP-Glycogen Phosphorylase- { CT24 8 ) 
similarity to CT249 

dnaA_l -Replication Initiation Protein_l- (CT250) 

60IM-60kDa Inner Membrane Protein- (CT2 51) 

Igt-Prolipoprotein Diacylglycerol Transferase- (CT252) 

CT101 hypothetical protein 

acpS-Acyl -carrier Protein Synthase- (CT100) 

trxB-Thioredoxin Reductase- (CT099) 

rsl-Sl Ribosomal Protein- (CT0 98) 

nusA-N Utilization Protein A-(CT097) 

infB-Initiation Factor-2-(CT096) 

rbfA-Ribosome Binding Factor A-(CT09S) 

truB-tRNA Pseudouridine Synthase- (CT094) 

ribF-FAD Synthase- (CT093) 

ychF-CTP Binding Protein- (CT092) 

yscU-YopS Translocation Protein U -(CT091) 

lcrD- Low Calcium Response D-(CT090) 

lcrE- Low Calcium Response E- (CT089) 

sycE-Secretion Chaperone- (CT088) 

malO-Glucanotransf erase- (CT087) 

rl2 8 -L2 8 Ribosomal Protein- (CT086) 

CT085 hypothetical protein 

Phopholipase D Superfaaily (leader (33) peptide]- (CT084) 

CT083 hypothetical protein 

CT082 hypothetical protein 

CKLTR T2 Protein- (CT081) 

Itufi- (CT080) 

CT079 similarity 

folD-Methylene Tetrahydro folate Dehydrogenase- (CT07 8) 
yojL-{CT077) 

smpB- Small Protein B-(CT076) 
dnaN-DNA Pol III (beta chain) - (CT075) 
recF-ABC super family ATPase- (CT074) 
(frame-shift with 0339) 
t frame-shi ft with 0340) 

predicted CMP {leader (19) peptide] - (CT073J 
(frame-shift with 0342?) 
yaeL-Metalloprotease- (CT072) 
yaeM-(CT071) 

troD/ytgO* Integral Membrane Protein- (CT070) 
troC/y tec- Integral Membrane Protein- (CT069) 
troB/ytoB-ABC transporter ATPase- (CT068) 
troA/ytgA- Solute Protein Binding Family- (CT0 67) 
CT066 hypothetical protein 
adt.l-AOP/ATP Trans locase.l- (CT065) 
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CPn03S2 

CPn0353 

CPn0354 

CPn0355 

CPn03S6 

CPn0357 

CPn0358 

CPn0359 

CPn0360 

CPn0361 

CPn0362 

CPn0363 

CPn0364 

CPn0365 

CPR0366 

CPH0367 

CPn0368 

CPn0369 

CPn0370 

CPn0371 

CPn0372 

CPn0373 

CPH0374 

CPn0375 

CPn0376 

CPn0377 

C9n0376 

CPn0379 

CPn0360 

CFn0381 

CPn0382 

CPn0383 

CPn0384 

CPn0385 

CPn03B6 

CPa0387 

CPn0388 

CPn0389 

CPn0390 

CPn0391 

CPn0392 

CPn0393 

CPn0394 

CPn0395 

CPn0396 

CPn0397 

CPn0396 

CPn0399 

CPn0400 

CPnO401 

CPI10402 

CFn0403 

CPn0404 

CPnO405 

CPn0406 

CPn0407 

CPn0408 

CPn0409 

CPn04io 

CPn0411 

CPn0412 

CPn0413 

CPn0414 

CPn0415 

CPn0416 

CPn0417 

CPn0418 

CPn0419 

CPn0420 

CPn0421 

CPn0422 

CPn0423 



395574 

396893 

397167 

399669 

400459 

401317 

401751 

403012 

405358 

406647 

407635 

409688 

409966 

410528 

411976 

413102 

413790 

414351 

415600 

417147 

417687 

418380 

420218 

421121 

421854 

423438 

426166 

426322 

426756 

429809 

430749 

431693 

432377 

434016 

434525 

435196 

435329 

438134 

439144 

439692 

439814 

440379 

440736 

441964 

444353 

445115 

445533 

445879 

446536 

447884 

448994 

449015 

450887 

451739 

451969 

453742 

454105 

454645 

455123 

455833 

456590 

459203 

460143 

461498 

461856 

463035 

464401 

466834 

467106 

467998 

468242 

468791 



396830 

397135 

398507 

398591 

400109 

400469 

401578 

403817 

403922 

405382 

407055 

407943 

410238 

411544 

412440 

413636 

414107 

415562 

416912 

417503 

418001 

420218 

420961 

421615 

422294 

422347 

423445 

426765 

427876 

428037 

430036 

430749 

431862 

432522 

434046 

434699 

437320 

437319 

438134 

439510 

440383 

440723 

441968 

443175 

443241 

444381 

445700 

446523 

447306 

447495 

447888 

449710 

449871 

450966 

452865 

452856 

454561 

455127 

455833 

456609 

457246 

457227 

459172 

460221 

461557 

462244 

462953 

464676 

466824 

467108 

46.8784 

469216 



r 

F 
F 
R 
R 
R 
R 
F 
R 
R 
R 
R 
F 
F 

F 

F 

F 

F 

F 

F 

F 

F 

F 

F 

F 

R 

R 

F 

F 

R 

R 

R 

R 

R 

R 

R 

F 

R* 

R 

R 

F 

F 

F 

F 

R 
R 
F 
F 
F 
R 
R 
F 
R 
R 
F 
R 
F 
F 
F 
F 
F 
R 
R 
R 
R 
R 
R 
R 
R 
R 

r 

F 



lepA-GTPase- (CT064 ) 

gnd-6-Phosphogluconate Dehydrogenase- (CT063 ) 
cyrS-tyrosyl tRNA Synthetase- (CT062) 
fliA/rpsD-Sigma-28/WhiG Family- (CT061 ) 
flhA- Flagellar Secretion Protein- (CT060) 
fer4-Ferredoxin IV- (CT059 J 



CT058 hypothetical protein_2 
CT058 hypothetical protein.} 



gcpE-(CT057) 

CT056 hypothetical protein 



«ucB_l-Dihydrolipoamide Succ iny 1 transfer as e_l- (CT055) 
sucA-Oxoglutarate Dehydrogenase- (CT054) 
CT053 hypothetical protein 

hemN_l -Coproporphyrinogen III Oxidase_l- <CT052) 
CT326 similarity 

yabC/yraL-SAM-Depandenc Methy transferase- (CT048) 
CT047 hypothetical protein 
hctB-Histone-like Protein 2-ICT046) 
pepA-Leucyl Aminopeptidase A-{CT045) 
ssb-SS OKA Binding Protein- <CT044> 
CT043 hypothetical protein 

glgx -Glycogen Hydrolase (debranching) - (CT042) 
CT041 hypothetical protein 
ruvB-Holliday Junction Helicase- (CT040) 

dcd-dCTP Deaminase- (CT03 9) 
CT038 hypothetical protein 

tlyC_l-CBS Domain protein (Hemolysin Homolog)_l- (CT256) 
CT257 hypothetical protein 
yhfO-NifS-related protein- (CT258) 
PP2C phosphatase f amily- (CT259) 

CT253 hypothetical protein 
CT254 hypothetical protein 
CT255 hypothetical protein 
autY -Adenine Glycosylase-(CT107) 

yceC -predicted pseudouridine synthetase family- (CT106) 
CT105 hypothetical protein 

fabX-Enoyl-Acyl -Carrier Protein Reductase- (CT104 J 

HAD super family hydrolase /phosphatase-(CTl 03) 

CT102 hypothetical protein 

CT260 hypothetical protein 

dnaQ_l-DNA Pol III Epailon Chain.1- (CT261) 

CT262 hypothetical protein 

CT263 hypothetical protein 

msbA- Transport ATP Binding Protein- (CT2 64) 

accA-AcCoA Carboxylase/Transferase Alpha- (CT265) 

CT266 hypothetical protein 

himD/ihf A- Integration Host Factor Alpha- (CT267) 

amiA-N-Acetylmuramoyl Alanine Amidase- (CT268) 

murE-N-Acetylmuramoylalanyiglutamyl DAP Ligaee- (CT269) 

pbp3- transglycolase/transpeptidase-{CT270) 

CT271 hypothetical protein 

yabC-PBP2B Family .methyl trans 'erase- (CT272) 

CT273 hypothetical protein 

CT274 hypothetical protein 
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CPn0424 


469612 


470961 


F 


CPn042S 


470980 


471564 


r 


CPn0426 


472111 


471536 


r 


CPn0427 


472207 


473715 


r 


CPn0428 


473722 


474681 


r 


CPn0429 


474681 


475319 


F 


CPnO430 


475326 


476093 


r 


CPn0431 


476483 


476151 


R 


CPn0432 


476816 


476514 


R 


CPn0433 


477273 


476929 


R 


CPn0434 


479462 


477276 


R 


CPn0435 


480902 


479475 


R 


CFn0436 


481618 


480902 


R 


CPn0437 


481816 


484350 


F 


CPn0438 


485416 


484334 


R 


CPn0439 


485553 


486077 


F 


CPn0440 


486105 


486740 


F 


CPn0441 


486891 


487638 


F 


CPn0442 


488013 


488528 


F 


CFn0443 


4B8729 


489979 


F 


CPD0444 


490287 


494507 


F 


CPO044S 


494772 


497579 


F 


CPn0446 


497626 


500415 


F 


CPn0447 


500566 


503351 


F 


CPn0448 


504810 


503698 


R 


CPn0449 


507231 


505330 


R 


CPn04S0 


508112 


507180 


R 


CPX104S1 


50B275 


511058 


F 


CPn04S2 


511319 


512860 


F 


CPn0453 


513234 


516152 


F 


CF1&0454 


516182 


519115 


F 


CPn0455 


520346 


519456 


R 


CPn0456 


521532 


520327 


R 


CPn0457 


523865 


522120 


R 


CPn045S 


526320 


524236 


R 


CPn0459 


527005 


526619 


R 


CPn0460 


527840 


526992 


R 


CPn0461 


528638 


527844 


R* 


CPU0462 


531052 


529037 


R 


CPn0463 


532357 


531191 


R 


CF&0464 


532842 


532366 


R 


CPn0465 


533212 


532871 


R 


CPn0466 


533724 


536537 


F 


CPn0467 


536633 


539434 


F 


CPn0468 


539632 


540432 


F 


CPn0469 


540399 


541460 


F 


CPn0470 


541357 


542532 


F 


CPn0471 


542564 


545401 


F 


CPn0472 


547905 


545581 


R 


CPn0473 


549593 


548070 


R 


CPn0474 


551573 


549807 


R 


CPn0475 


553844 


551685 


' R 


CPn0476 


554844 


553858 


R 


CPn0477 


556106 


554844 


R 


CPn0478 


557625 


556210 


R 


CPn0479 


558425 


557616 


R 


CPn0480 


559303 


558650 


R 


CPn04Bl 


560946 


559339 


R 


CPn0482 


561737 


560961 


R 


CPn0483 


56X836 


564964 


F 


CPH0484 


564970 


565824 


F 


CPU0485 


566036 


566229 


r 


CPn0486 


567784 


56C405 


R 


CPn0487 


569740 


568112 


R 


CPn0488 


570096 


569767 


R 


CPn0489 


570965 


570096 


R 


CPn0490 


571279 


573333 


F 


CPn0491 


574352 


573336 


R 


CPn0492 


574652 


574804 


F 


CPn0493 


575004 


574855 


R 


CPn04S4 


575364 


575146 


R 


CPn0495 


575603 


576793 


T 



daaA_2 -Replication Initiation Factor.*- (CT275) 
CT276 hypothetical proteins 
CT277 similarity 

nqr2-NADH (Ubiquinone) Dehydrogenase- (CT278) 
nqr3-NADH (Ubiquinone) Oxidoreductase , Gamma- (CT279) 
nqr4-NXDH (Ubiquinone) Reductase 4-(CT260) 
nqrS-NADH (Ubiquinone) Reductase 5-(CT2Bl) 



gcsH-Glycine Cleavage System H Protein- (CT2 82) 
CT263 hypothetical protein 

Phospholipase O super family (uncleavable leader peptide] - (CT284) 
lplA-Lipoate Protein Ligase-Like Protein- (CT2B5) 
elpC-ClpC Protease- (CT286) 
ycbF-PP-loop super family ATPase- (CT2B7) 



CT007 hypothetical protein 
CT006 hypothetical protein 
CT00S hypothetical protein 

pmp_6- Polymorphic Outer Membrane Protein G/I Family 

pmp_7- Polymorphic Outer Membrane Protein ts Family 

pmp_8- Polymorphic Outer Membrane Protein G Family 

pmp_9- Polymorphic Outer Membrane Protein G/I Family 

*yxjG_Bs_2 Hypothetical Protein 

pmp_10-FMF_10 (Frame-shift with 0451) 

pmp_10- Polymorphic Outer Membrane Protein G Family 

pmp.ll -Polymorphic Outer Membrane Protein G Family 

pmp_12 -Polymorphic Outer Membrane Protein A/Z Family (truncated) 

pmp_13 -Polymorphic Outer Membrane Protein G Family 

pmp_l 4 -Polymorphic Outer Membrane Protein H Family 



pmp_l 5 -Polymorphic Outer Membrane Protein E Family 

pmp_l 6 -Polymorphic Outer Membrane Protein E Family 

pmp_l 7 -Polymorphic Outer Membrane Protein E ramily 

pmp.l 7 -Polymorphic Outer Membrane Protein (Frame-shift with 0469) 

pmp_17 -Polymorphic Outer Membrane Protein (Frame-shift with 0470) 

pmp_18- Polymorphic Outer Membrane Protein E/F Family 



CT365 hypothetical protein 
glgB-Glucan Branching Enryme- (CT866) 
CTB 65 hypothetical protein 
•yq*V_Bs Hypothetical Protein 
hflX-CTF Binding Protein- (CT379 ) 
phnP-Metal Dependent Hydrolase- (CT3 80) 
CT383 hypothetical protein 

artJ-Arginine Periplasmic Binding Protein- (CT381) 

aroC-Deoxyheptonate Aldolase- (CT3 82) 
CT382.1 hypothetical protein 
'hypothetical proline permease 
CT364 hypothetical protein 
hitA-HIT Family Hydrolase- (CT3 85) 
CT386 hypothetical protein 
C7387 hypothetical protein 
CT389 hypothetical protein 



aspc- Aspartate Aminotransferase- (CT390) 
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CPn0496 


576793 


577812 


r 


C7391 hypothetical protein 


CPn0497 


578089 


577820 


R 


CT386 hypothetical protein 


CPn0498 


579035 


578085 


R 




CPn0499 


580399 


579205 


R 




CPnOSOO 


580659 


562362 


r 


proS-Prolyi cftHA Synthetase- (C7393) 


CPnOSOl 


582457 


583690 


F 


hrcA-HTK Transcriptional Repressor- (CT394) 


CPn0302 


583650 


584201 


r 


grpE-HSF-70 Cofaetor-(CT395) 


CPn0503 


584234 


586213 


r 


dna*>HSP-70-(CT396) 


CPn0504 


566487 


588514 


F 


vaca-ribonuclease family* (CT397) 


CPn0905 


588519 


589106 


F 


M-twthyladenine DMA glycosylase 


CPn0506 


589172 


589840 


F 


CT421 hypothetical protein 


CPn0507 


589961 


590122 


F 


CT421.1 hypothetical protein 


CTaOSOS 


590142 


590300 


F 


CT421.2 hypothetical protein 


CPn0909 


590335 


590808 


F 


(predicted MetalloenrymeJ -(CT422) 


CPnOSlO 


590813 


S91973 


F 


tlyC_2 -CBS Domains (Hemolysin hoa»log)_2- (CT423) 


C7B0S11 


592141 


592488 


F 


rsbV_l-Sigma Regulatory Faetor_l-(CT424) 


C7ttOS12 


592SS3 


594412 


F 


CT42S hypothetical protein 


CPH0S13 


594647 


595753 


F 


Fe-s oxidoreduetaae_l-(CT426) 


CPn0914 


599729 


596520 


F 


CT427 hypothetical protein 


CPn0915 


596492 


597181 


F 


ubiE -Ubiquinone Methyl transferase- (CT428) 


CPn0516 


596814 


597255 


R 




CPtt0517 


599631 


598795 


R 




CPnOSie 


600803 


599832 


R 


CT429 hypothetical protein 


CPn0919 


601674 


600904 - 


R 


dapP-Oiaminopimalate Bpimerase- (CT430J 


C7tt0520 


602218 


601646 


R 


clpP-CLP Protease- (CT431) 


CFB0321 


603797 


602241 


R 


glyA-Serine Hydroxymethyltransf erase- (CT432) 


CPn0922 


603987 


604655 


r 


CT433 hypothetical protein 


CPn0923 


604723 


605052 


F 




CPn0524 


609103 


606179 


F 




CPn0525 


606S22 


607283 


F 


CT398 hypothetical protein 


CPn0526 


608696 


607710 


R 


yrbH-CutQ/Kp sF Family Sugar- P Isomerase- (CT399) 


CFB0527 


609904 


608726 


R 


sucB_2 -Oihydrolipoaaide Succinyl transferase^- (CT400) 


CPn0928 


611162 


609521 


R 


gltT-Clutamate Symport- (CT401J 


CPH0529 


612299 


611165 


R 


yeaH-ATPaae- (CT402 ) 


CPnO930 


613254 


612460 


R 


■poOU-rRKA Methylaae_l-(CT403) 


CPH0531 


614069 


613245 


R 


SAM dependent methyl transferase- (CT404) 


CFnOS32 


614674 


614075 


R 


ribC/risA-Riboflavin synthase- (CT4 05) 


CPn0933 


614930 


615385 


F* 


CT406 hypothetical protein 


CPnOS34 


615413 


615784 


F 


dxsA-OnaK Suppressor- (CT407) 


CFBOS35 


615793 


616296 


r 


lspA-Lipoprotein Signal Peptidase- (CT4 0B) 


CPn0936 


6X6345 


617691 


F 


dagA_l -D-Ala/Gly PermeaseJl-{CT409) 


CP»0937 


617833 


618189 


F 


CT814.1 hypothetical protein 


CPnOS36 


618212 


618511 


F 


CT814 hypothetical protein 


C7a0539 


618705 


621545 


F 


pmp_19 -polymorphic outer membrane protein A Family - (CT412) 


CP&0540 


621694 


626862 


F 


pap_20 -polymorphic outer membrane protein 8 Family- (CT413) 


CPU0541 


627170 


628003 


F 


Solute binding protein (-yebL-Synechocystis Adhesin Homolog) -(CT415J 


CF&0S42 


628003 


628737 


F 


ABC Transporter ATPase-(CT416) 


CFU0343 


628725 


629603 


F 


(Metal Transport Protein) -{CT417) 


CP&0S44 


630529 


62992S 


R 


yhbS-CTP binding protein- (CT418) 


CPB0S4S 


630884 


630633 


R 


rl27-L27 ribosomal protein- (CT419) 


CPnOS46 


631229 


630912 


R 


rl21-L21 Ribosomal Protein- (CT4 20) 


CPn0547 


631661 


632188 


* F 


ygbB family- (CT434) 


CPn0548 


633231 


632X91 


R 


cysj-Sulfite Reductase- (CT435) 


CPB0549 


633569 


' 633255 


R 


rslO-SlO Ribosomal Protein- (CT436) 


CPnOSSO 


639661 


633580 


R 


f usA- Elongation Factor C- (CT437) 


CPn0591 


636168 


635698 


R 


rs7-S7 Ribosomal Protein- {CT43 8) 


CPB0SS2 


636567 


6362X9 


R 


rs!2-S12 Ribosomal Protein* (CT439) 


CPn0993 


637747 


636812 


R 




CPnOS54 


637894 


638141 


F 


CT440 hypothetical protein 


CPnOSS 5 


638298 


640241 


F 


tsp-Tail-Specific Protease- (CT441) 


CPnOSS 6 


640912 


640325 


R 


crpA-lSkDa Cysteine-Rich Protein- (CT442 ) 


CPn0997 


642861 


641194 


R 


ome8-60kna Cysteine-Rich Outer Membrane Complex Protein- (CT443) 


CP&053B 


643300 


643031 


R 


omcA-9kDa-Cyateine-Rich Outer Membrane Complex Lipoprotein- (CT444 1 


CPnOSS 9 


643742 


643927 


F 


CT441.1 hypothetical protein 


CPnO960 


649612 


644096 


R 


gltX-Glutaayl-tRMA Synthetase- (CT449I 


CPn096l 


646404 


645871 


R 


euo-CHLPS Euo Protein- (CT44 6) 


CV&OS62 


648036 


646918 


R 


•CHLPS 43 kOa protein homoloc_l 


C9&OS63 


690096 


646293 


R 


recJ-ssDKA Exonuclease-(CT447) 


CPn0964 


654390 


650145 


R 


secD4secF- Protein Export Proteins SccD/SecF (fusion) - (CT448) 


CPnOS6S 


699630 


654533 


R 


CT449 hypothetical- protein 


CPHOS66 


696141 


656890 


F 


yaeS family- (CT490) 


CfnOS67 


696894 


657817 


F 


cdsA-Phosphatidate cytidyly transferase - (CT491) 
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CPn05«B 


637017 


658464 


F 


cdsA- Phosphatide te cytidylytransferas*- (CT452) 


CPnOS69 


638464 


659099 


F 


plsC-Glycerol-3-P Acyl transferase- (CT433) 


CPn0S70 


639107 


660789 


F 


argS-Arginyl tRNA Transferase- (CT454) 


CPn0571 


662122 


660749 


R 


murA-UDP*N-Acetylglucosamine Transferase:* (CT4SS ) 


CFU0S72 


662332 


664616 


F 


CT4S6 hypothetical protein 


CFnOS73 


663404 


664691 


R 


yebC family *(CT4 57) 


CfnOS74 


663945 


665394 


R 




CFB0S75 


664494 


665982 


R 


YhhY -Amino Croup Acetyl Transferase- (CT4SB) 


C7nOS76 


667343 


666494 


R 


prfB* Peptide chain Release Factor 2 (natural UGA frame-shift )~(CT435 


C9ttOS76 


667596 


667330 


R 


prfB- (natural UGA frame-shift ) 


CFnOS77 


667895 


668155 




SffZB (YM74) complex protein- (CT460J 


CPbOS78 


668406 


669365 


F 


I y n#i lyw* w**» ww i v * ^ v a § 


CPB0S79 


669281 


669993 


- 


<mhD/m<»M.CTiw»r tfuelaatida Phoiflhorvlni* (CT462 \ 

/ y«» / y ■yn w #vya* fvi#w*ww*wv • * ^ * ^ v • / 




669993 


670793 


- 


txttA-Paeudouridvlata Svntheaa r - f CT443 ) 


Cpa0581 


671434 


670745 




Phosphoglycolats Phosphatase * ( CT4 6 4 ) 




671503 


672177 


- 




CPnOSBS 


672400 


672717 




CT466 hypothetical protein 


CPn0584 


672707 


673798 


r 


atoS/ntr8-2-Component Sensor* (CT4 67) 


CPnOSBS 


675817 


673865 


j. 


* similar! tv to Cam *jieA_2 


C7nOS66 


676026 


677183 


f ! 


■f wf/wryf.^wwvwHf ftAou.la.eor » (CT46B 1 

• ***rw 9 M w* W » w lil* *l*Jyw*wWW» lw*™ww# 


C7&0SB7 


677441 


678124 


F 




CPnOSBS 


67B0B4 


678626 


j. 


CT469 hvooehaeleal n rot* in 

w*w# "j|TVUIVb*bBA |^*wwV*## 


C7aOS89 


678640 


679395 




CT470 hypothetical protein 


C9&0590 


680112 


679516 




CT471 hypothetical protein 


C7&OS91 


660373 


681020 


P 


yagE family* (CT472 ) 


CPn0592 


661X53 


6B1461 


f 


yidD family* (CT473) 


C9&0593 


6B2476 


681391 


2 


CT474 hypothetical protein 


CP&0S94 


662583 


684958 




pheT-phenylalanyl tRNA Synthetase Beta*(CT475) 


CFn0S95 


664958 


685926 


• 


CT476 hypothetical protein 


CPB0S96 


685939 


686457 


F 


ada -mat hy .transferase- (CT477 ) 


CF&0S97 


688215 


686479 


R 


oppC_2 -01 igopept i de Per&eas e_2 - ( CT47 8 ) 


CFn0S98 


689697 


688219 


R 


oppJL.2 -Oligopeptide Pexmease_2 - (CT479 ) 


CP&0399 


691802 


689682 


R 


oppA_S*ol igopept ide Binding Lipoprotein^* (CT4B0) 


cvnofoo 


692147 


691827 


R 




cpn060l 


693053 


692736 


R 


CT4B3 hypothetical protein 


CFB0602 


694105 


693104 


R 


CT4B4 hypothetical protein 


CVB0603 


694205 


693185 


F 


henz— Ferroche tales e* (CT485) 


CPn0604 


695945 


695196 


R* 


f liY*Glutamine Binding Protein- (CT486) 


CFB060S 


696707 


696150 


R 


yfefcF-Methylase -(CT487) 


CPn0606 


697444 


696707 


R 


CT4B8 hypothetical protein 


CPS0607 


698895 


697373 


R 


glgC-Glucose-1-P Adsnyltrans£erass-(CT489) 


CPttOCOB 


699645 


699016 


R 


*pyrF*Uridine 5 ' -Monophosphate synthase (Dap Synthase) -truncated? 


CPn0609 


699705 


699966 


F 


CT490 hypothetical protein 


CPBOCIO 


701420 


700029 


R 


rha-Tnwaf *i wf< ww Termination Factor* ICTA9 1) 


CPn0611 


702025 


701420 


R 


yacE* predicted phosphatase/Kinase* (CT492) 


CF&0612 


704631 


702022 


R 


polA-DNA Polymerase I - (CT493 ) 


CP&0613 


705656 


704658 


R 


sohB • Protease* ( CTC 94 ) 


CP&0614 


707402 


705783 


R 


adt_2*ADP/ATP Trans loca*«_2- (CT49S) 


CDB061S 


708137 


707634 


R 


dosA_1 -Glycerol -3 -P PhosohatioVl transferee a 1-(CT496) 

|nl»'^* w»^vw*w* • « W Mvvyiw^ • SMS * Wm ^ WW^+ \ ^ m ^ m ^ i 


CPB0616 


708791 


710137 


p 


dnaB-Replicative ONA Hellcase-(CT497) 


CVB0617 


710484 


712316 


F 


gidA*FAD*dspendent oxidoreductase* (CT498 ) 


CFB0618 


712306 


713010 


F 


lplA-Lipoate* Protein Ligase A-<CT499) 


CPn0619 


713444 


713013 


R 


ndk- Nucleoside* 2 *P Kinase* (CT300 ) 


CF&0620 


714139 


713519 


R 


ruvA-Kolllday Junction Kelicase- (CTS01) 


CPn0621 


714647 


714144 


R 


ruvc -Crossover Junction Endonuclcase- (CT502) 


CPnO«2 


713752 


714793 


R 


CTS03 hypothetical protein 


CFB0623 


716993 


716163 


R 


CTS04 hypothetical protein 


CPn0624 


718015 


717011 


R 


gapA-Glyceraldehyde-3-P Oehyrogenase- (CTS0S) 


CVB062S 


718485 


718060 


R 


rll7-L17 Ribosomal Protein- (CTS 06) 


CPn0626 


719616 


718495 


R 


rpoA-RKA Polymerase Alpha* (CTS 07) 


CPn0627 


720038 


719640 


R 


rsll-Sll Ribosomal Protein- (CTS 08) 


CPn062B 


720428 


720063 


R 


rsl3-S13 Ribosomal Protein- (CTS 09) 


CPn0629 


721857 


720487 


R 


secY-Translocase- (CTS10) 


CVB0630 


'.22316 


721885 


R 


rllS-LlS Ribosomal Protein- (CTS11) 


CPn0631 


722806 


722312 


R 


raS-SS Ribosomal Protein* (CTS 12) 


CP&0632 


723195 


722827 


R 


rlia*L18 Ribosomal Protein- (CTS 13) 


C9a0633 


723757 


723209 


R 


rl6-L6 Ribosomal Protein- (CTS14) 


CPn0634 


724185 


723787 


R 


rs6-S8 Ribosomal Protein- (CTS1S) 


CP&0635 


724745 


724206 


R 


rl5-L5 Ribosomal Protein- (CT516) 


CP&0636 


725082 


724750 


R 


r!24-L24 Ribosomal Procein* (CTS 17) 


CFH0637 


725464 


723099 


R 


rll4-U4 Ribosomal Protein- (CTS1B) 


CPn063B 


725747 


725490 


R 


rsl7-Sl7 Ribosomal protein- (CTS19) 
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72S958 


725743 


R 


lrl* 7 — 4*4 7 RiWIOBUIi tTObf ID* (tT9«U) 




726377 


725964 






ronft/ti 1 
(■ravioli 


727077 


726409 


r 




CPn0642 


727428 


727096 


R 


ri22-l*22 KiDOSOJnai, rrOCf ln a I WT3*J J 


WjrnUOtJ J 


727713 


727450 


R 


ri49 a aii Autosomal rrocBin* t c* 3** j 


CP&0644 


1*03 ' J 


mm 

'it S2Z 


R 


ri2*L2 RioosOfMi Protein- (CT52 3 ) 


CPn0645 


/209JU 


' 2B39B 


R 


»1 ^ ^ f ^ ^ Mai — 1 fffcaMamaM A * ahm f ****** ft ^ *? t 

ri23*L2J Rioosomai Protein- (CTSZoj 


CPn0646 


729621 


728950 


R 


rl4-L4 Ribosomal Protein- (CT527) 


CPB0647 


7J0 jJl 


729657 


R 


rl3-L3 RlbosonAl Protein- (CT526) 


CPn0648 


731603 


730605 


R 


CT529 hypothetical protein 


CPn0649 


732672 


731710 


R 


fmt*Kethionyl tRKA Formyl trans f erase- (CT530) 


CPn06S0 


733501 


732665 


R 


lpxA-Aeyl -Carrier QDP-GlcNAc - (CT531 ) 


CPn0651 


733975 


733517 


R 


fabZ-Myristoyl-Acyl Carrier Dehydratase- (CTS3 2) 


CPn0652 


734835 


733990 


R 


lpxC"Myristoyl GleNac Deacetylase- (CT333) 


CPn0653 


736490 


734868 


R 


cut E-Apo lipoprotein N- Acetyl trans f erase- (CT534 ) 


CPn0654 


736967 


736503 


R 


volD/yciA-acyl-CoA Thloesterase* (CT535) 


CPn06S5 


737847 


737101 


R 


dnaQ_2-DNA Pol III Epsilon Chaia_2* (CTS36) 


CPn0656 


737872 


738048 


F 




CPn0657 


738473 


738051 


R 


yje£ (ATPase or Kinase) - (CT537 ) 


crnooss 


7J910B 


7JS455 


R 


CT538 hypothetical protein 


CPn0659 


7J95 A J 


739838 


F 


trxA-Thioreooxin- (CT539J 


crnuoou 


74Q327 


749000 


R 


spou_2-rRHA Mccnylase_2 - {CT540 > 


Crnuool 


74110Q 


740327 


R 


siip* FXBP* type pepcidyl -prolyl ci* -trans isomerase- (CT541) 


CFn0662 


742923 


741172 


R 


aspS-Aspartyl tRKA synthetase- (CT542) 


CPn0663 


744190 


742901 


R 


hisS-KlstidVl tRKA Synthetase- (CT543) 






744557 


R 




wrawoos 




7 4 63 6 5 


F 


uhpC-Hexosphosphate Transport -(CT544) 


WJT1UOOO 


746388 


7 cm A7 


F 


ana£-DNA Pol III Alpna- ICT545) 






7(A1 77 

/ 3U1 / / 


R 


preoictea qmp [leaaer (i7)-(CT54oi 


s» ruU ODD 


/ 31*U7 


* 3*10« 


F 


CT547 hypothetical protein 




/ 341 / 9 


7 C777C 

732775 


F 


CT548 hypothetical protein 


/— Bv»Aa?7 A 


752765 


753196 


F 


rsbw-sigma regulatory factor-histidine kinase- (CT549) 


CPn0671 




73^205 


R 


CTS50 hypothetical protein 


CPn0672 


753741 


755048 


t 


UCf (pop?) — U-AXa-U— AiA WdAWA/pepblQAIC" lhTl3 3X | 


CPn0673 


755287 


755463 


p 


wi99< nypowncciCAi protein 






755577 


R 


cnni-jviA netnyxtrans cerase- ( wT33 j ) 


CPn0675 


757919 


756768 


R 


WT09Q nypotnebicaA prociui 


CPn0676 


759217 




D * 
K 


homologous to CT69S 


CPn0677 


760401 


759256 


fi 
n 






761320 


7CA£B9 
' OWQO* 


R 




CPn067 9 


762930 


761725 


r 


P^k— raospnovxycsracQ Kinase" iwtovjj 


CPn06B0 


764248 


762971 


R 


yflwfl *rnoipiuwe remease- (wt 09* j 


CPn0681 


764929 


764258 




wToyx nypouieciwai protein 


CPn0682 


764984 


765955 


p 


appVAOL ATrAie UipeptlQC TTOnipOtb — I W* Q3FU I 


CPn06S3 


765948 


766919 


p 


ufjfjr now Ai ruv WA^St'WAWI 4 *> WlS^Wt b (WIWII* * 


CPn0684 


768038 


767181 




•pw / pu0 a w>awiy«Miic ru titioniny rcublU) iw* bub/ 


CPn0685 


768068 


768217 


F 




CPnOC86 


768361 


768176 


n 




CPH0687 


768564 


769214 


p 


CT482 hvnathitci.Cavl arociin 


CPn0688 


769382 


770137 


p 


CT481 hvnaehaciffavl nrorain 


CPH0689 


771404 


770187 


R 


vfhO l-Nif S-r#lat«d Aminocransf eraae 1-(CT6B7) 


CPnO690 


772660 


771436 


■ R 


ABC Transporter Membrane Protein* (CT666) 


CPn069l 


773452 


772685 


R 


abeX-ABC Transporter ATPase- (CT68 5) 


CPn0692 


774912 


773461 




ABC Transporter- (CT684t 


CPn0693 


776256 


775240 


R 


TPR Repeats {0- Linked ClcNAc Transferase similarity )- (CT683 J 


CPH0694 


779599 


776330 


R 


pbp2*PBP2-trans9ly co las e/ transpeptidase- (CT682 ) 


CPH0695 


780216 


781382 


F 


ompA-Major Outer Membrane Protein- {CT6BD 


CPn0696 


781769 


782599 


F 


rs2-S2 Ribosomal Protein- (CT680) 


CPn0697 


782602 


783447 


F 


ts£ -Elongation Factor TS-(CT679) 


CPn0698 


783458 


784201 


F 


pyrH-UMP Kinase- (CT67 9) 


CPn0699 


784182 


784721 


F 


rrf-Ribosome Releasing Factor- (CT677) 


CPn0700 


785097 


785609 


F 


CT676 hypothetical protein 


CPH0701 


785599 


786672 


F 


JcarG-Arginine Kinase- (CT67 5) 


CPn0702 


7896B5 


786929 


R 


yscC/gspD-Yop C/Sen Secretion Protein 0-(CT674) 


CPn0703 


791190 


789685 


R 


pknS-S/T Protein Kinase- (CT67 3) 


CPn0704 


792321 


791209 


R 


fUN- Flagellar Motor Switch Oomain/YscO family- (CT67 2) 


CPn070S 


793173 


792334 


R 


CT671 hypothetical protein 


CPn0706 


793683 


793180 


R 


CT670 hypothetical protein 


CPnO707 


795029 


793704 


R 


yscN-Yop M (Flagellar-Type ATPaee) - (CT669) 


CPn0708 


795705 


795034 


R 


CT668 hypothetical protein 


CPn0709 


796188 


795742 


R 


CT667 hypothetical protein 


CPnO710 


796461 


796210 


R 


CT666 hypothetical protein 
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CPn07ll 


796731 


796486 


R 


CT665 hypothetical protein 


CPn0712 


799315 


796781 


R 


FKA domain; homology to Adenylate cyclase) - (CT664) 


CFn0713 


799721 


799332 


R 


CT663 hypothetical protein 


CPn0714 


801107 


800091 


R 


hemA -Glutamyl tRNA Reductase- (CT6 62) 


CPn0715 


801657 


803462 


r 


gyrB_2-DttA Cyras e Suounit 8_2-(CT66l) 


CPn0716 


803469 


804902 


F 


gyrA_2-DHA Cyrase Subunit A_2- (CT660) 


CPn0717 


805010 


805306 


r 


CT6S6 hypothetical protein 


CPn0718 


B05309 


805626 


r 


CT657 hypothetical protein 


CPn0719 


805916 


806890 


F 


s f hB- ( Ps eudouridine Synthase ) - ( CT65 B ) 


CPn0720 


807003 


807236 


F 


CT659 hypothetical protein 


CFn0721 


807683 


808489 


F 


kdsA-KOO Synthetase- (CT655) 


CPn0722 


608489 


808974 


r 


CT654 hypothetical protein 


CPn0723 


808984 


809703 


F 


yhbG-ABC Transporter ATPase-{CT6S3) 


_CPn0724 


810527 


809706 


R 




CPn0725 


810811 


810587 


R 


CT652.1 hypothetical protein 


CPn0726 


B13372 


610680 


R 


CT620 hypothetical protein 


CPn0727 


813577 


816192 


F 


CT619 hypothetical protein 


CPn0728 


818477 


816525 


R 


CHLPN 76kDa Homo log. 1 (CT622) 


CPn0729 


819857 


618592 


R 


CKLFN 76kOa Homo log. 2 (CT623) 


CPB0730 


821603 


819963 


R 


mviN- Integral Membrane Protein- (CT624) 


CP&0731 


821567 


821760 


r 




CFU0732 


822098 


822976 


F 


nfo-Endonuclease IV-(CT625) 


CPn0733 


823727 


823101 


R 


rs4-S4 Ribosomal Protein- (CT62 6) 


CPn0734 


823944 


82491S 


F 


yeeA*(CT627) 


CPn0735 


625668 


825003 


R 


•pyrH/udk-Uridine Kinase (Uridine Honophosphokinase) (Pyrin 










Ribonucleoside Kinase) . 


CPn0736 


827686 


825992 


R 


ygeD- Efflux Protein- (CT641) 


CPn0737 


627685 


830756 


r 


recC-£xodeoxyribonuclease V, Gamma- (CT640) 


CPn0738 


830746 


833895 


F 


recB-Exodeoxyribonuelease v, Beta-(CT639> 


CPn0739 


834871 


833861 


R 


CT638 hypothetical protein 


CPnO740 


836046 


834864 


R 


cyrB-Aromatic AA Amino trans f erase- (CT6 37) 


CFX10741 


638350 


636165 


R 


greA- Transcript ion Elongation Factor- (CT 63 6) 


CPn0742 


838463 


838868 


F 


CT635 hypothetical protein 


CPB0743 


838962 


840362 


F 


nqrA- Ubiquinone Oxidoreductase . Alpha- (CT634) 


CPn0744 


841364 


840389 


R 


hemfl- Porphobilinogen Synthase- (CT633 ) 


CPn0745 


841903 


841742 


R 




C7n0746 


841975 


843567 


F 


CT632 hypothetical protein 


CPn0747 


843675 


843740 


F* 


CT631 hypothetical protein 


CPn0747 


843725 


843910 


F 


CT631 hypothetical protein (frame -shift) 


CPH0748 


644987 


644121 


R 


ispA-Geranyl Transtransf erase- (CT628) 


CPn0749 


845629 


845006 


R 


glmU-UDP-GlcNAc Pyrophosphory lase- (CT62 9 ) 


CPn0750 


846411 


845707 


R 


tctD/cpxR-HTK Transcriptional Regulatory Protein ♦ Receiver 










(CT630) 


CPn0751 


846608 


848434 


F 


CT6S1 hypothetical protein 


CPn07S2 


848604 


850082 


F 


recD_2-Exodeoxyribonuclease V, Alpha_2-(CT652) 


CPn0753 


851006 


850161 


R 




CPn0754 


851336 


851040 


R 


rs20-S20 Ribosomal Protein- (CT617) 


CPn07SS 


851597 


852799 


F 


CT616 hypothetical protein 


CPn0756 


852961 


854676 


F 


rpoO-RKA Polymerase Sigma-66 - (CT61S) 


CPn0757 


854733 


855134 


F 


folX-Oihydroneopterin Aldolase- (CT61 4) 


CPn0756 


855110 


856459 


F 


folP/dhpS-Oihydropteroate Synthase- (CT613 ) 


CPn07S9 


856488 


656997 


F 


folA-Di hydro folate Reductase- (CT61 2) 


CPnO760 


856957 


657694 


F 


CT611 hypothetical protein 


CPn0761 


857704 


856375 


F 


CT610 hypothetical protein 


CPn0762 


B59S97 


858539 


R 


reeA-RecA recombination protein- (CT650) 


CPn0763 


860511 


859972 


R 


y gf A- Foray ltetrahydro folate Cycloligase- (CT649) 


CPB0764 


861807 


860524 


R 


CT648 hypothetical protein 


CPn076S 


862362 


861801 


R 


CT647 hypothetical protein 


CP&0766 


863782 


662394 


R 


CT646 hypothetical protein 


CPa0767 


863884 


664177 


F 


CT645 hypothetical protein 


CPn0768 


864159 


865163 


F 


yohI/nir3 -predicted oxidoreductase -(CT644) 


CPR0769 


867733 


865121 


R 


topA-DMA Topoisomerase I-Fused to SWI Domain* (CT64 3) 


CPR0770 


868340 


869131 


F 


CT642 hypothetical protein 


CPn0771 


870463 


869144 


R 


rpoN-RMA Polymerase Sigma -54- (CT609) 


CFn0772 


872385 


670469 


R 


uvrO-ONA Helicase-(CT608) 


CPn0773 


872488 


873195 


F 


ung -Uracil DNA Glycosylase-(CT607> 


CPn0774 


873195 


873425 


F 


CT606.1 hypothetical protein 


CPn0775 


874031 


873414 


R 


yggV family- (CT606) 


CPH0776 


874246 


875467 


F 


CT60S hypothetical protein 


CPn0777 


875601 


877178 


F 


groEL_2-heat shock protein- 60 -(CT604) 


CPn0778 


877505 


878092 


F 


tsa/ahpC-Thio-specif lc Antioxidant (TSA) Peroxidase- (CT603) 


CPn077 9 


878481 


878095 


R 


CT602 hypothetical protein 
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CFnO780 


879205 


878591 


R 


papQ7ami8*N*Acetylmuramoyl-L-Ala Amidaae- (CT601) 


CPn0781 


879773 


879196 


R 


pal-Peptidoglycen-Associated Lipoprotein- (CT600 ) 


CPn0782 


8B106S 


879773 


R 


to IB -polysaccharide transporter* (CT599) 


CFn0783 


8B1BBS 


861100 


R 


CTC98 hypothetical protein 


CFn0784 


882296 


681892 


R 


exbD-Biopolymer Transport protein* (CT597) 


CPn0785 


8B2991 


882296 


R 


exbs/ to 10-poly saccharide transporter* (CT596J 


CPn.0786 


883185 


885293 


r 


dsbO/xprA-Thio i disulfide Interchange Protein* (CTS9S) 


rPn07B7 


8B5619 


886401 


p 


yabD/ycl.H-PHP super family (ureas e/pyriraidinase) hydrolase- (CT5 94) 


CPn07Bfi 


8B6S42 


887432 


F 


sdhC-Succinate Dehydrogenase- (CT5 93) 


CPn07B9 

WfUV ( u # 


867439 


889316 


F 


sdhA- Succinate Dehydrogenase- (CTS92) 


Www * * v 


889330 


890103 


p 


sdhB* Succinate Dehydrogenase- (CT591) 


CF&0791 


B93050 


890111 


R 


C*pS90 hypothetical protein 


CP&0792 


894919 


693108 


R 


C*fS89 hypothetical protein 


CP&0793 


896823 


894919 


R 


rbsU-sigma regulatory family protein— PP2C phosphatase (RsbW 

■ntioanii C \ « I C"P?Rfl 1 

■JiW»yw** 9 W | 1 W i J BO J 


CP&0794 


897174 


696004 


p 




CP&0795 


898126 


899195 


p 




CPrk079£ 

VfHV f » o 


B99301 


901340 


p 




CPn0797 


901600 


902694 


p 




vfUU # 7D 


902846 


903856 


p 




CP&0799 


904986 


903940 


n 






906532 


905249 


R 




CPnOBOl 


908697 


906727 


R 


uvr».ry< filial mmmrn ABC Subtinit B - (CT586) 


CFnOB02 

WWW w * 


909740 


908709 


n 


f.ffje s Tfvnf otAmv 1 r DMA fivnrhnriia* (CTSfiS ) 


CFnOBOS 


910303 


909752 


R 


CT5S4 hvooch*p t e*l orocftin 


CPtt0804 


911059 


910310 


R 


gp6D-CKLTR Plasmid Paralog-(C77S83) 


CPnOSOS 


911631 


911067 


R 


m\ ftP-chrotnosonw partitioning ATPase— CHLTR plasmid protein GP5D-(CT582) 


CFn0806 


913771 


911867 


R 


thrS-TltrMHvl tRNX Synthetase- (CT561) 


CP&0807 


913971 


914879 


P 


CT580 hypothetical protein 


CPnOSOS 


916287 


914956 


R 


CTS79 hypothetical protein 


CFQ0809 


9177B5 


916307 


R 


CTS78 hypothetical protein 


CF&OBIO 


918194 


917825 


R 


CT577 hypothetical protein 


CPnOBU 


918900 


916208 


R 


lcrH_l -Low Ca Response Protein H_l- (CT576) 


CPn0812 


919123 


920862 


P 


mutL-DNA Mismatch Repair- (CT575) 


CPxt0813 


920870 


921934 


P 


pepP*Afflinopeptidase P* (CT574 1 


CFn0814 


922107 


9 233 57 


F 


CT573 hypothetical protein 


CPn0815 


923361 


925622 


F 


gspD/pilQ-Gen. Secretion Protein D-(CT572I 


CPn0816 


925615 


927102 


F" 


gspE-Gen. Secretion Protein E-(CT571) 


CPn08l7 


927115 


928287 


F 


gspF-Gen. secretion Protein F-(CT570> 


CPnOBIB 


92B314 


92B662 


F 


predicted CMP [leader (16) peptide) -ICT5 69) 


CPn0819 


92B6B9 


929132 


F 


CT566 hypothetical protein 


CF&0B20 


929120 


929659 


F 


CTS67 hypothetical protein 


CPn0821 


929667 


930668 


F 


CTS66 hypothetical protein 


CPH0822 


930756 


931229 


F 


CT565 hypothetical protein 


CPO0823 


932367 


931501 


R 


yscT/spaR*YopT Tranlocation T-(CTS64) 


CPn0824 


932662 


932376 


R 


yseS/fliO-YopS/fliQ Translocation Protein- (CT5 63) 


CPn082S 


933594 


932677 


R 


yscR-Yop Translocation R-(CT562) 


CFn0826 


934310 


933612 


R 


yscL-Yop Translocation L-(CT561) 


CPH0B27 


935264 


934434 


R 


CT560 hypothetical protein 


CPn0828 


936271 


935267 


R 


yscJ-Yop Translocation J-(CT5S9) 


CPn0829 


936744 


937296 


F 




CPn0830 


937444 


937959 • 


F 




CPn0631 


938267 


938434 


F 




CPn0832 


939747 


938827 


R 


lipA-Lipoate Synthetase- (CT5SB ) 


CFn0833 


941129 


939747 


R 


lpdA-Lipoamide Dehydrogenase- (CT55 7) 


CFD0834 


941553 


942014 


F 


CT5S6 hypothetical protein 


CFn083S 


945669 


942045 


R 


motl_l-SWl/SNF family helicase_l-<CT555> 


CFn0836 


946879 


945722 


R 


brnQ-Amino Acid (Branched) Transport- (CT554) 


CPn0837 


947771 


947145 


R 


nth- Enodnucl ease IIZ-(CT697) 


CPn083B 


949106 


947781 


R 


thdF-Thiophene/Furan Oxidation Protein- (CT696 ) 


CPn0839 


949257 


950159 


F 


psdD-Phosphatidylserine Decarboxylase- (CT699 ) 


CPnO840 


950222 


951544 


F 


CT700 hypothetical protein 


CPn0841 


951731 


954640 


r 


secA_2-Translocase SecA_2-(CT701) 


CPn0842 


954883 


954710 


R 


CT702 hypothetical protein (frame-shift with 084 3) 


CPn0843 


955191 


954994 


R 


CT702 hypothetical protein 


CPn0844 


9S6730 


955270 


R 


yphC-CTPase/CTP- binding protein* (CT703) 


CPn0845 


9SB079 


956650 


R 


pcnB.l-Poly A Polyraerase_l-(C7704) 


CPn0846 


959374 


958112 


R 


clpX-CLP Protease ATPase-(CT705) 


CPn0847 


959995 


959367 


R 


clpP-CLP Protease Subunit-(CT706) 


CPn084B 


961502 


960177 


R 


tig/mur I -Trigger Factor-peptidyl -prolyl isomerase- (CT707) 


CPn0849 


961788 


965285 


F 


motl_2-SWI/SNF family helicast.2- (CT708) 


CPnOBSO 


965293 


966390 


F 


mr»8-Rod Shape Protein-Sugar Kinase* (CT709) 
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CPnOSSl 


966396 


968195 


r 


CPn0852 


968316 


970613 


F 


CPn0853 


970637 


971603 




CPn0854 


972837 


971806 




CPnOSSS 


973995 


972994 


B 

t\ 


CPn08S6 


975377 


973995 


a 

n 


CPn0857 


975757 


7 '9 474 


R 


CPn0856 


977055 


975757 


R 




977588 


7 I 'U33 


R 




978630 


7 / / QUO 


R 




979722 


Q70.O94 
7 / 07*3 


R 




980873 


979752 
7 / 7 / 


R 




981514 


980631 


K 


CPn0864 


90X670 


982374 




CPnOfifiS 

%* aril V W J 


962418 


962942 




CPnOB66 


983491 


962916 




CPn0867 


983423 


984667 




CPnOB68 


986643 


984670 


* 


CPn0869 


967401 


986658 


* 


CPn0870 


988728 


987448 


* 


CFn0871 


968772 


989899 




CFn0672 


969963 


991216 




CPn0873 


991233 


991694 




CPn0874 


993107 


991749 




CPn0875 


993372 


994022 




CPn0876 


994144 


995517 




CPn0877 


995533 


995982 




CFB0878 


996654 


995992 




CPU0879 


997439 


996645 




cpnoeeo 


999861 


997444 




CPnOBBl 


1005667 


1006209 




CPR0882 


1006268 


1007404 




CPn0883 


100BB65 


1007573 




CPn0884 


1009359 


1009009 




CPn088S 


1010635 


1009433 


ft 


CFn08B6 


1011276 


1010908 


ft 


CPH08B7 


1011692 


1014157 


P 


CPn0888 


1015423 


1014119 


R* 


CPnOBB9 


1016835 


1015462 


ft 


CPn0890 


1017805 


1016619 


ft 


CPn0891 


1021073 


1017819 


ft 


CPn0892 


1023661 


1021046 


R 


CPn0893 


1023894 


1025688 




CPB0894 


1026766 


1025888 




CP&0895 


1026988 


1027557 




CFn0896 


1027595 


1027822 




CPn0897 


1028737 


1027853 


R 


CPn089B 


1030460 


1028904 


R 


CPTJ0899 


1030875 


1032215 


F 


CPn0900 


1032235 


1033281 




CPn0901 


1033287 


1034537 


F 


CPn.0902 


1034543 


1035241 - 


P 


CPn0903 


1035263 


1036417 


P 


CFn0904 


1036326 


1037396 


r 


CPnO905 


1037409 


1039835 


P 


CPn0906 


1040340 


1039915 




CPn0907 


1040780 


1040445 


ft 


CFnO908 


1041589 


1040780 




CPn0909 


1041637 


1041966 




CPnO910 


1041979 


1043004 




CPn091l 


1044043 


1042985 




CPn0912 


1044129 


1045760 




CPn0913 


1045760 


104S94S 




CPn09l4 


1045999 


X046397 




CPn0915 


1046461 


1046817 




CPn0916 


1046837 


1048084 




CPn09l7 


1048090 


1048539 




CPn09lB 


1049223 


1048579 




CPn09l9 


1049378 


1050430 




CPnO920 


1051405 


1050431 




CPn092l 


1CS1S35 


1052293 





pckA-Phosphoeno Ipy ruvace Carboxyfcinase- (CT710) 

CT711 hypothetical procein 

CT712 hypochecical procein 

ompB-Oucer Membrane Procein B-(CT713) 

gpdA -Glycerol -3 -P Dehydrogenase- (CT71 4 > 

AgX-1 Hoaolog-UDP-Glucose Pyrophosphorylase- (CT715) 

CT716 hypochecical procein 

fiil-Flagellua-specific ATP Synthaae- (CT717) 
CT718 hypochecical procein 
fliF-Flaoellar W»Ring Procein- (CT719> 
nifU-NifU-relaced procein- ICT720J 
yfhO_2-NifS-relaced procein_2- (CT721) 
pgmA-Phosphoglycerace Kucase- (CT722) 
yjbC-predieted pseudouridine synthase- (CT723) 
CT724 hypochecical procein 
birA-Biocin Synthecase- (CT725) 
rodA-Rod Shape Procein- (CT72 6) 
zncA/cadA-Mecal Transporc P-cype ATPase- (CT727i 
CT728 hypochecical procein 
serS-Seryl CRHA Synchecase_2- (CT729) 
ribD-Ribo flavin Deaminase- (CT730) 

ribAtribB-CTP Cyclohydraca.se fc DHBP Synthase -(CT7311 

ribE-Ribityllunazine Synchaae- (CT732) 

CT733 hypochecical procein 

CT734 hypochecical procein 

dagA_2-D- Alanine/Glycine Pennaaae_2- (CT735) 

ybcL family- (CT73 6) 

SET Domain procein- (CT73 7) 

yycJ-mecal dependent hydrolaae- CCT738) 

£C«K-Cell Diviaion Procein FtsK-(CT739J 



dnpP/nqr6-Phenolhydrolaae/KAOH ubiquinone oxidoreductase-(CT740) 

CT741 hypochecical procein 

ygcA-rRKA Mechylcrana ferae- (CT742 ) 

hccA-Hiacone-Like Developmental Procein- (CT7 43 ) 

CKLTR posaible phoiphoprocein- tCT744) 

hens-procoporphyrinogen Oxidase- ( CT74 5 ) 

hemN_2 -Coproporphyrinogen in Oxidase_2- (CT746) 

hemE -Uroporphyrinogen Decarboxylase- (CT7 47 1 

mfd-Transcripcion-Repair Coupling- (CT748) 

alaS-Alanyl cRNA Synthetase- (CT749J 

ckcB -Trans kecolase- (CT750) 

awn -AMP Nucleosidase- (CT751) 

efp_2-Elongacion Paccor P_2- (CT752) 
CT7S3 hypochecical procein 

(possible phosphonydrolase} - (CT754 ) 
Mlcochondrial KSP60 Chaperonin Homo log- (CT755) 
murF-Muraooyl-DAP Ugase- (CT756) 
mrar-Muranoyl-Pencapepcide Transferase- (CT757) 
murO-Muraaoylalanine-Glucaraace Ligaae* (CT758) 
nlpD-Muraaidase {invasin repeat f aaily) - (CT759) 
ftaw-cell Division Procein FcsW-(CT760) 
murC-Pepcidoglycan Transferase- (CT761 J 

murCtddlA-Murainaee-Ala Llgase fc D-Ala-D-Alam Ligase- (CT762) 
CT763 hypochecical procein 

•cucA Periplastic Divalenc Cacion Tolerance Procein CutA (C-Type 

Cytochrome Biogenesis Procein) 
CT764 hypochecical procein 
rsbV_2-Sig»a Paccor Regulacor_2- (CT765) 
miaA-cRNA Pyrophoiphace Transferase- (CT7 66 J 
Fe-S cluster oxidoreduccase_2- (CT767) 
CT768 hypochecical procein 



ybeB-iojap superfiamily orcholog- (CT769) 
£abF*Acyl Carrier Procein 5ynchase- (CT770) 
hydro 1 as e/phospha case homolog- (CT77U 
ppa* Inorganic Pyrophosphatase- (CT772) 
idh-Leucine Dehydrogenase- (CT77 3 ) 

cysQ-Sulfite Synthesis /biphosphace phosphacaee*(CT774) 
snClyceroiO-P Acylcransf erase- (CT77 3 ) 
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CPn0922 


1052314 


1053927 


F 


aas-Acylglycerophosphoethanol amine Acy/. transferase- (CT776) 


CPa0923 


1053964 


1055093 


F 


bioF_l -Oxononanoac e 5ynth«se_i - ( CT777 ) 


CPn0924 


1057274 


1055028 


R 


priA-primosomal Protein N' - { CT77 8 ) 


CPn092S 


1057900 


1057226 


R 


CT779 hypothecs. cal protein 


CPn0926 


1058060 


1056557 


F 


Thioredoxin Diiulfide Isomerase- (CT780) 


CPn0927 


1059809 


1058670 


R 


•CHLPS 43 )eDa procein homolog_2 


CPn0928 


1061008 


1059684 


R 


*CHLPS 43 kDa procein homolog_3 


CPn0929 


1062292 


1061186 


R 


•CHLPS 43 kDa procein homology 


CPnO930 


1062657 


1063330 


F 




CPn0931 


1064138 


1065718 


r 


lysS-Lysyl cRNA Synthetase- (CT781 J 


CPn0932 


1067142 


1065721 


R 


cysS-Cysteinyl CRNA Synchecase- (CT782) 


CPn0933 


1067S3S 


1068578 


F 


predicted disulfide bond isomerase- <CT78 3) 


CPn0934 


1068942 


1068526 


R 


rnpA-Ribonuclease P Procein Componenc- (CT7B4) 


CPR0935 


1069091 


1068957 


R 


rl34-L34 Ribosomal Procein- tCT7 85) 


CPn0936 


1069336 


1069470 


F 


rl36-L36 Ribosomal Procein- (CT78 6) 


CPa0937 


.1069496 


1069796 


F 


rsl4-S14 Ribosomal Procein- (CT7 67) 


CPn0938 


1070322 


1069849 


R 


CT788 hypothetical protein -[leader (60) pepcide-periplasmicl 


CFn0939 


1070726 


1071195 


F 


CT790 hypothetical protein 


CPn0940 


1073012 


1071204 


R 


uvrC-Cxcinuclease ABC. Subunic C-<CT791) 


CPn0941 


1075501 


1073018 


R 


mutS-DNA Mismatch Repair- (CT792 ) 


CPn0942 


1075985 


1077754 


F 


dnaG/priM-DNA Primase-(CT794) 


CPn0943 


1077978 


1076236 


F 


CT794.1 hypothetical protein 


CPn0944 


1076512 


1078997 


F 




CPn094S 


1079070 


1079660 


F 


CT79S hypothetical procein 


CPn0946 


1062786 


1079745 


R 


glyQ-Clycyl CRNA Synthetase- (CT7 96) 


CPn0947 


1063442 


1084059 


F 


pcsA_2 -Glycerol -3 -P-Phosphatydyl trans £erase_2 - (CT7 97 ) 


CPn0948 


1085474 


1084047 


R 


glgA -Glycogen Synthase- (CT798) 


CPn0949 


1085929 


1086483 


F 


etc -General Stress Protein- (CT7 99 ) 


CPnO950 


1066488 


1067027 


F 


pth-Peptidyl tRNA Hydrolase- (CT800) 


CPn0951 


1067122 


1067457 


F 


rs6-S6 Ribosomal Protein- (CT801) 


CP&0952 


106747B 


1087723 


F 


rsl8-S18 Ribosomal protein- (CT802) 


CPn0953 


1087742 


1086248 


F 


rl9-L9 Ribosomal Protein- (CT803 ) 


CPn0954 


1068286 


1088708 


F 


ychB-Predicted Kinase- (CT804) 


CPn0955 


1088612 


1089175 


F 


tframe-ahifc vlth 0954) 


CPn0956 


1089560 


1090909 


F 


CT805 hypothetical protein 


CPn0957 


1093788 


1090963 


R 


Ide/ptr-Znsulinase family /Protease III-(CT806) 


CPn0958 


1094785 


1093793 


R 


plsB-Glycerol-3-P Aeyl transferase- (CT807) 


CPn0959 


1096343 


1094799 


r' 


cafE-Axial Filaxaent Protein- (CT808) 


CPnO960 


1096764 


1097102 


F 


CT809 hypothetical protein 


CPn0961 


1097118 


1097297 


F 


rl32-L32 Ribosomal Protein- (CT810) 


CPn0962 


1097316 


1098275 


F 


plsX-FA/ Phospholipid Synthesis Protein- (CT8 11) 


CPn0963 


1098396 


1103224 


F 


pmp_21 -Polymorphic Outer Membrane Protein 0 Family- (CT8 12 ) 


CPa0964 


1104756 


1103301 


R 




CPn0965 


1106736 


1104925 


R 


lpxB-Lipid A Oisaccharide synthase- (CT411) 


CPn0966 


1106037 


1106748 


R 


pcnB_2-PolyA Polymeraie.2- (CT410) 


CPn0967 


1108512 


1109885 


F 


mrsA/pgm-phosphoglucomutase- (CT815 ) 


CPn0968 


1109895 


1111721 


F 


glmS-Clucosamine-Fructose-6-P Aminotransferase- (CT616) 


CP&0969 


1111812 


1112999 


r 


096 9 -cyrF_l -Tyrosine Transports- (CT8 17) tyrP.l -Tyrosine Transport.l- 










(CT817) 


CPn0970 


11X3461 


1114648 


F 


0 97 0-tyrP_2 -Tyrosine Trensporc_2- (CT818) tyr P. 2 -Tyrosine Tran*pcrt_2- 










(CT81B) 


CPn0971 


1114702 


1115415 ' 


F 


ycc A -Transport Permease- (CT819) 


CFn0972 


1116299 


1115430 


R 


ftsY-Cell Division protein FtsY-(CT820) 


CPn0973 


1116370 


1117527 


F 


sucC-Succinyl-CoA Synthetase. &eta-(CT621) 


CPn0974 


1117544 


1118422 


F 


sucD-Succinyl-CoA Synthetase, Alpha- (CT622 ) 


CPn097S 


1119104 


1119637 


F 




CPn0976 


1120082 


1121165 


F 




CPn0977 


1121371 


1122402 


F 




CFn0978 


1122665 


1123693 


F 




CPH0979 


1123980 


1125443 


F 


htrA-DO Serine Protease- (CT823 ) 


CPn0980 


1126962 


1125504 


R 


•similarity to Saccharomyces serevisiae hypothetical 52.9KD protein 


CPn09Bl 


1127031 


1129952 


F 


Zinc Met alloprot ease Unsulinase family) - (CT824) 


CPn0982 


1131194 


1129962 


R 


yigN family- (CT825) 


CPn0983 


1132000 


1131206 


R 


pssA-Glycerol-Serine Phosphatidyl transferase- (CT826) 


CPn0984 


1132379 


1135510 


F 


nrdA-Ri bo nucleotide Reductase, Large Chain- (CT827) 


CPn0985 


1135534 


1136571 


F 


nrdB-Ribonucleoside Reductase, Small Chain- (CT628) 


CPn0986 


1136724 


1137395 


F 


ygcH- predicted rRNA Methylase- (CT829 I 


CPn0987 


1137516 


1138115 


r 


ytgB-like predicted rRNA methylase- (CT8 30) 


CPa0988 


1138986 


1138075 


R 


murB-UDP-N-Acetylenolpyruvoyl glucosamine Reductase- (CT831) 


CPn0989 


1139495 


1139016 


R 


CT832 hypothetical protein 


CPn0990 


1139B83 


1140440 


F 


infC-Iniciation Factor 3-(CT833) 


CPn0991 


1140421 


1140612 


F 


rl3S-L35 Ribosomal Protein- (C7834) 
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CPn0992 


1140634 


1140996 


r 


rl20-L20 Ribosomal Protein- CCT83S) 


CPn0993 


•1141014 


1142030 


F 


pheS-Phenylalanyl CRN A Synthetase. Alpha- (CTB3 6) 


CPn0994 


1142398 


1144440 


r 


CT837 hypothetical protein 


CPn0995 


1145512 


1144415 


R 


CT83B hypothetical protein 


CPa0996 


1146589 


1145519 


R 


CTB39 hypothetical protein 


CPn0997 


1146708 


1147664 


r 


mesJ-PP-loop *uper family ATPase- (CT840) 


CPn0998 


1147855 


1150SB4 


F 


ftsH-ATP-dependent zinc protease* (CT841) 


CPn0999 


1152847 


1150766 


R 


pnp- Polyribonucleotide Nucleotidyl transferase- (CT842) 


CPnlOOO 


1153157 


1152891 


R 


rslS-SIS Ribosomal Protein- (CT843) 


CPnlOOl 


1153405 


1153869 


F 


yfhC-cytosine deaminase- (CT844) 


CPnlO02 


1153862 


1154089 


F 


CT845 hypothetical protein 


CPnlO03 


1154796 


1154092 


R 


CTB46 hypothetical protein 


CPnlO04 


1155397 


1154879 


R 


CTB47 hypothetical protein 


CBnlOOS 


1155933 


1155415 


R 


CT8 4 6 hypothetical protein 


CPnlO06 


1156472 


1155990 


R 


CTB49 hypothetical protein 


CPnlO07 


1156689 


1156907 


F 


CT849.1 hypothetical protein 


CPnlOOB 


1156928 


1158223 


F 


CT850 hypothetical protein 


CPnl009 


1159058 


1158186 


R 


map -Methionine Aminopeptidase- (CTB51) 


CPnlOlO 


1159672 


1159067 


R 


CTB52 hypothetical protein 


CPnlOll 


1160306 


1159902 


R 


CT653 hypothetical protein 


CPnl012 


1162193 


1160421 


R 


yzeB-ABC transporter permease- (CT8 54) 


CPnl013 


1162245 


. 1163624 


F 


fumC-Fumarate Hydratase- (CTB55 ) 


CPnl014 


1165426 


1163732 


R 


ychM-Sulfate Transporter- CCT85 6) 


CPnlOlS 


1165634 


1166893 


F 


CT8S7 hypothetical protein (possible IM protein) 


CPnl016 


1167042 


1168898 


F 


CT8S8 hypothetical protein 


CPnl017 


1169006 


1169935 


F 


lytB-Metalloprotease- (CT859) 


CPnioia 


1169898 


1170629 


F 




CPnl0i9 


1172128 


1170638 


R 


CT860 hypothetical protein 


CP&1020 


1173679 


1172150 


R 


CT861 hypothetical protein 


CPnl021 


1174213 


1173698 


R 


lcrH_2-Low Calcium Response_2- (CTB62) 


CPnl022 


1175673 


1174216 


R 


CTB63 hypothetical protein 


CPnl023 


1176035 


1176331 


F 




CPnl024 


1177236 


1176334 


R 


xerD-Integrase/recombinase- (CTB64 ) 


CPnl02S 


1177302 


1178879 


F 


pgi -Glucose- 6- P Isomer ase- (CT37 8) 


CPIU026 


1178997 


1179137 


F 


ltuA-(CT377) 


CPnl027 


1179175 


1180755 


F 




CPnl028 


1181016 


1181999 


F 


mdhC-Halate Dehyrogenase- (CT376) 


CPnl029 


1182008 


1182844 


F* 




CPnlOBO 


1183886 


1182843 


R 


predicted D- ami no acid dehyrogenase- (CT375) 


CPnl031 


1185552 


1184098 


R 


arcD-Axginine /Ornithine Antiporter- [CT374 ) 


CPnl032 


1186150 


1185566 


R 


CT373 hypothetical protein 


CPnl033 


1187500 


1186187 


R 


CT372 hypothetical protein 


CP&103 4 


1188517 


1187732 


R 


Predicted GMP.1 (CT371) (leader (18) peptide} 


CPnl035 


1190000 


1188570 


R 


Aro£-Shi)cimate 5 -Dehyrogenase- (CT370) 


CPI11036 


1191135 


1189984 


R 


AroB- Deny ro qui nate Synthase- (CT3 69) 


CPnl037 


1192199 


1191123 


R 


AroC-Chorismate Synthase- (CT36B) 


CPnl038 


1192726 


1192199 


R 


aroL-Shikimate Kinase ZZ-(CT367) 


CPnl039 


1193999 


1192665 


R 


aroA*Phosphoshikimate Vinyl transferase- (CT3 66 ) 


CPnl040 


1194741 


1194073 


R 




CPnltUl 


1195994 


1194726 


R 


# bioA-Adenosylsethionine-8-Amino-7-Oxononanoate Aminotrans £ erase 


CPnl042 


J.196590 


1195934 


R 


•bioD-dethiobiotin synthetase 


CP&1043 


1197717 


1196572 ■ 


R 


bioF_2-Oxononanoate Synthase_2 


CFnl044 


1198691 


1197699 


R 


•bioB-Biotin Synthase 


CFnl045 


1199590 


1198901 


R 


'conserved hypothetical bacterial membrane protein 


CFnl046 


1200675 


1199590 


R 


•Tryptophan Kyroxylase 


CPnl047 


1200552 


1201343 




dapB*oinydrodipicolinate Reductase* (CT364) 


CPnl04 B 


1201606 


1202604 


T 


asd-Aspartate Dehydrogenase- (CT3 63) 




120259S 


1203914 




lysC-Aspaxcoxinase in - ictjo* ) 


CPnlOSO 


1203926 


1204798 




dapA-Dihydrodipicolinate Synthase- tCT3 61) 


CPnlOSl 


1204962 


1205270 






CPnl052 


1205417 


1206169 






CPnl053 


1206153 


1206701 






CPnl054 


1207034 


1209466 






CPnlOSS 


1209694 


1210521 






CPnl056 


1210527 


1211228 






CPnl057 


1211497 


1213596 




CT356 hypothetical protein 


CPnlOS8 


1213748 


1214836 




CT3SS hypothetical protein 


CPnlOS9 


1214848 


1215678 




kgs A - Dime thy 1 adenos ine Trans t er as e - 1 CT3 5 4 ) 


CPnl060 


1217658 


1215727 


R 


dxs/tkt-Transketolase-(CT331) 


CPni06i 


1217920 


1217666 


R 


CT330 hypothetical protein 


CPnl062 


1219820 


1218159 


R 


xseA-Exodoxyribonuc lease Vll-tCT329) 


CPal063 


1219951 


1220712 


r 


tpiS-Triosephosphate Isomeraee- (CT32B) 
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CPnl064 122( 


)719 1220895 


r 


CPnl065 1223 


L095 122092B 


R 


CPnX066 1223 


L135 1221488 


r 


CPnl067 1223 


L73S 1222292 


r 


CPnl06S 122] 


1256 1222365 


R 


CPnl069 122] 


1513 1223941 


r 


CPU1070 122! 


1511 1224144 


R 


CPnl071 1221 


'324 12258B5 


R 


CPnl072 1221 


'969 1228835 


F 


CPnl073 122S 


►Oil 1229832 


r 



def* Polypeptide DtfontyUie- tCT353) 
rnhfi.2-Ribonucleape KZI.2- (CT008) 
yfgA-HTH Tr*n«cripcion*l R«grulacor* CCT009) 



Pr«dicc«d OHP.2 -tCT371) 
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Tabic 2 (Supplemental Data) Functional Assignments of C pneumoniae Coding Sequences. C. trachomatis genes are shown in 
parentheses. 



Amino Add Biosynthesis 



Aromatic Family 



10 



15 



20 



25 



30 



35 



40 



1039 


(CT366) 


aroA 


Phosphoshikimate Vinyl transferase 


1036 


(CT369) 


aroB 


Dehyroquinate Synthase 


1037 


(CT368) 


»roC 


Chorismate Synthase 


1035 


(CT370) 


aroE 


Shi ki mate 5-Dehyrogentse 


0484 


(CT382) 


xroG 


Deoxyheptonaie Aldolase 


1038 


(CT367) 


aroL 


Shi ki mate Kinase II 


0740 


(CT637) 


tyrB 


Aromatic A A Amino transferase 


ipartate Family (lysine) 




1048 


(CT363) 


asd 


Asprtate Dehydrogenase 


1050 


(CT361) 


dapA 


Dthyttrodipicotinate Synthue 


1047 


(CT364) 


dapB 


Dihy<in)dtpicoltnate Reductase 


0SI9 


(CT430) 


dapF 


Dtan inopimetate Epimera&e 


1049 


(CT362) 


lysC 


Aspa to kinase 111 



Serine Family 
0433 (CT282) 
0521 (CT432) 



gciH 
8lyA 



Glyc ne Cleavage System H Protein 
Serine Hydroxyrnethyl transferase 



Base & Nucleotide Metabolism 



0171 




guaA 


GMP Synthase 


0172 




guaB 


Inosine 5'-Monophosphase Dehydrogenise 


0608 






Uridine 5'-Monophosphite Synthase 


0735 






Uridine Kinase 


0244 


(CT128) 


adk 


Adenylate Kinase 


0894 


(CT751) 


amn 


AMP Nucleosidase 


0568 


(CT452) 


cmk 


CMP Kinase 


0392 


(CT039) 


ded 


dCTP Deaminase 


0059 


(CT292) 


dm 


dUTP Nucleotidohydrolase 


0120 


(CT030) 


gmk 


CMP Kinase 


0619 


(CT500) 


ndk 


Nucleoside-2-P Kinase 


0984 


(CT827) 


nrdA 


Ribonucleoside Reductase. Large Chain 


0985 


(CT828) 


nrdB 


Ribonucleotide Reductase. Small Chain 


0236 


(CTI83) 


pyrG 


CTP Synthetase 


0698 


(CT678) 


pnH 


UMP Kinase 


0273 


(CT188) 


tdk 


Thymidylate Kinase 


0659 


(CT539) 


trxA 


Thioredoxin 


0314 


(CT099) 


trxB 


Thioredoxin Reductase 


1001 


(CT844) 


ymc 


Cy cosine Deaminase 



Bloiynthesis of Coficion 



45 



50 



Biotin. Upoate 4 Ubiquinone 



1041 




btoA 


Adenosylmethionine*8-Amino'7'Oxononanoate Aminotransferase 


1044 




bioB 


Biobn Synthase 


1042 




bioD 


Delhiobtorin Synthetase 


0923 


(CT777) 


bioFj 


Oxononanoate Synthase v l 


1043 


(CT777) 


bioF.2 


Oxononanoate Synthasc_2 


0866 


(CT725) 


birA 


Bio tin Synthetase 


0748 


(CT628) 


ispA 


Gcranyl Trans transferase 


0832 


(CT558) 


lipA 


Ltpoate Synthetase 
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10 



15 



20 



25 



0265 (CT219) 

0264 (CT220) 

0515 (CT428) 

folic Acid 

0759 (CT612) 

0335 (CT078) 

0758 (CT613) 

0757 (CT6I4) 

0763 (CT649) 

Porphyrin 

0714 (CT662) 

0744 (CT633) 

0052 (CT299) 

0890 (CT747) 

0888 (CT745) 
0138 (CT210) 
0380 (CT052) 

0889 (CT746) 
0603 (CT485) 



0872 (CT731) 
0532 (CT405) 
0871 (CT730) 

0873 (CT732) 
0320 (CT093) 



ubiA Benzoate Octaphenyl transferase 

ubiD Phenylacrylate Decarboxylase 

ubiE Ubiquinone Methyltransierase 

folA Dihydrofolate Reductase 

folD Methylene Tetrahydrofolate Dehydrogenase 

folP Dihydropteroate Synthase 

foIX Dihydroneopterin Aldolase 

ygfA Formyt tetrahydrofolate Cycloligase 

hemA Glutamyl tRNA Reductase 

hcrnB Porphobilinogen Synthase 

hemC Porphobilinogen Deaminase 

hem£ Uroporphyrinogen Decarboxylase 

hemG protoporphyrinogen Oxidase 

hemL Glutamate- 1 -Semialdehyde-2, 1 •Aminomutase 

hemhM Coproporphyrinogen (II Oxidase,! 

hemN.2 Coproporphyrinogen III Oxidase^ 

hemZ Ferrochetalase 

nbA&ribB OTP Cyclohydratase & DHBP Synthase 

ribC Riboflavin Synthase 

ribD Riboflavin Deaminase 

rib£ Ribiryllumazine Synthase 

ribF FAD Synthase 



Cell Envelope 

Fatty Acid at Phospholipid Metabolism 





0161 


(CT206) 




(predicted acyf transferase family) 


30 


0912 


(CT776) 


aas 


Acylgtycerophosphoethanolarrane Acyl transferase 




0414 


(CT265) 


accA 


AcCoA Carboxylase/Transferase Alpha 




0183 


(CT123) 


accB 


Biotin Carboxyl Carrier Protein 




0182 


(CT124) 


accC 


Biottn Carboxylase 




0058 


(CT293) 


accD 


AcCoA CarboxyUse/Transferase Beta 


35 


0295 


(CT236) 


acpP 


Acyl Carrier Protein 




0313 


CCTI00) 


acpS 


Acyl-carrier Protein Synthase 




0567 


(CT451) 


cdsA 


Phosphatidate Cytidylytnnsferase 




0297 


(CT238) 


fabD 


Malonyi Acyl Carrier Trsnscyclase 




0916 


(CT770) 


fabF 


Acyl Carrier Protein Synthase 


40 


0296 


(CT237) 


fabG 


Oxoacyl (Carrier Protein) Reductase 




■0298 


(CT239) 


TabK 


Oxoacyl Carrier Protein Synthase III 




0406 


(CT104) 


fab! 


Enoyl-Acyl-Carricr Protein Reductase 




0651 


(CT532) 


fibZ 


Myristoyl-Acyl Carrier Dehydratase 




0098 


(CT010) 


htrB 


Acyltransfense 


45 


0271 


(CT136) 




Lysophospholipasc Esterase 




0615 


(CT496) 


pg$A_l 


Glycerol-3-P Phosphatidyltnnsferasc.l 




0947 


(CT797) 


pg$A.2 


GlycerolO-P Phosphatydy loans ferase,2 




0958 


(CT807) 


plsB 


Glycerol-3-P Acyloinsferase 




0569 


(CT453) 


plsC 


Glycerol -3- P Acyloansferasc 


50 


0962 


(CT811) 


pIsX 


FA/Phospholipid Synthesis Protein 




0839 


(CT699) 


psdD 


Phospharidylserine Decarboxylase 




0983 


(CT826) 


pssA 


Glycerol-Serine Phospharidyltransfense 




0921 


(CT775) 




snGlycerol-3-P Acyloinsferase 




0654 


(CT535) 


yd A 


Acyl-CoA Thioesterase 


55 


0877 


(CT736) 


ybcL 


CT736 Hypothetical Protein 



LPS 
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0154 


(CT208) 


gseA 


KDO Transferase 




0721 


(CT655) 


kdsA 


KDO Synthetase 




0235 


(CT182) 


kdsB 


Deoxyoctulonosic Acid Synthetase 




0650 


(CTS31) 


IpxA 


Acyl-Cirrier UDP-ClcNAc O-Acylaansferase 


c 

5 


0965 


(CT4II) 


(pxfi 


Lipid A Dtsaccharide Synthase 




0652 


(CT533) 


IpxC 


Myristoyl GlcNac Deacetylase 




0302 


(CT243) 


IpxD 


UDP Glucosamine N-Acy) transferase 




Membrane Proteins, Lipoproteins <& Porins 




0310 


(CT251) 


60IM 


60kDa Inner Membrane Protein 


10 


0556 


(CT442) 


crpA 


1 5kDa Cysteine-Rich Protein 




0653 


(CT534) 


cui£ 


Apo lipoprotein N-Aceryl transferase 




0311 


(CT252) 


Igt 


Protipoprotein Diacylglyccrol Transferase 




0558 


(CT444) 


omcA 


9kDa-Cysteme-Rich Lipoprotein 




0557 


<CT443) 


omcB 


60kDa Cysteine-Rich OMP 


15 


0695 


(CT681) 


ompA 


Major Outer Membrane Protein 




0854 


(CT713) 


OfRpB 


Outer Membrane Protein B 




0781 


(CT600) 


pal 


Pepodoglycan-Associated Lipoprotein 




0300 


(CT24I) 


yaeT 


Omp85 Homotog 




Peptidogfycan 






20 


0417 


(CT268) 


amiA 


N-Acetylmuramoyl Alanine Arnidase 




0780 


(CT601) 


atniB 


N-Acerylmuramoyl-L-Ala Amidase 




0672 


(CT551) 


dacF 


D-Ala-D-Ala Caroxypeptidase 




0968 


(CT816) 


glmS 


Clucosamine-Fructose-6-P Aminotransferase 




0749 


(CT629) 


glmU 


UDP-ClcNAc Pyrophosphorytase 


25 


0900 


(CT757) 


mnY 


Muramoyl-Pentapeptide Transferase 




0571 


(CT455) 


murA 


UDP-N-Accrylglucosamine Transferase 




0988 


(CT831) 


tnurB 


UOP-N-Acerylenolpyntvoylglucosamine Reductase 




0905 


(CT762) 


murC&ddIA Muramate-Ala Ligase & D-Ala-D-Alam Ligase 




0901 


(CT758) 


murD 


Muramoylalanine-Clutarnate Ltgase 


30 


0418 


(CT269) 


mur£ 


N-Acetylmuramoylalanylglutamyl DAP Ligtse 




0899 


(CT756) 


murf 


Muramoyl-DAP Ugase 




0904 


(CT761) 


muKj 


Peptidogtycan Transferase 




0902 


(CT759) 


nlpD 


Muramidase (invasin repeat family) 




0694 


(CT682) 


Pbp2 


PB P2-Transglycolaie/Transpepodas e 


35 


0419 


(CT270) 


Pbp3 


Transglycolase/Trsnspcptidasc 




0421 


(CT272) 


yabC 


PBP2B Family Methyl transferase 










Cellular Proceiies 




Cell Division 






40 


0959 


(CT808) 


cafE 


Axial Filament Protein 




0880 


(CT739) 


(UK 


Cell Division Protein FtsK 




0903 


(CT760) 


n$w 


Cell Division Protein FtsW 




0972 


(CT820) 


(UY 


Cell Division Protein FtsY 




0617 


(CT498) 


gidA 


FAD-dependent Oxido reductase 




0805 


(CT582) 


minO 


Chromosome Partitioning ATPase 




0850 


(CT709) 


rraxB 


Rod Shape Protein-Sugar Kinase 




0867 


(CT726) 


rodA 


Rod Shape Protein 




0684 


(CT688) 


parB 


Chromosome Partitioning Proiein 




Detoxification 






50 


0057 


(CT294) 


sodM 


Superoxide Dismutase (Mn) 




0778 


(CT603) 


ahpC 


Thio-spccific Antioxidant (TSA) Peroxidase 




Signal Transduction 








0148 


(CTM5) 




ST Protein Kinase 




0584 


(CT467) 


auS 


Two-Component Sensor 


55 


0294 


(CT235) 




cAMP-Dependent Protein Kinase Regulatory Subunit 




0712 


(CT664) 




(FKA domain) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



0478 


(CT379) 


hflX 


GTP Binding Protein 


0703 


(CT673 ) 




SfT Pmtein ICtiuce 
•J 1 1 rivwin iviiwic 


0095 


(CT301) 




C/T Pmtein Ktnaw 
1 riwicin ivuiax 


0397 






rriVp rumpni MW rwiiijf 




tr r r\'M\ 

J J/) 


port 


no rnospnoc*mer rroicin rip< 


0038 


(CT336) 


ptsl 


PTS PEP Phosphotransferase 


0060 


(CT291) 


ptsNJ 


PTS II A Protein J 


0061 


(CT290) 


ptsN_2 


PTS HA Protein * HTH DNA-Binding Domain 


0262 


(CT218) 


surE 


SurE-like Acid Phosphatase 


0838 


(CT698) 


thdF 


Thiophene/Furan Oxidation Protein 


0693 


(CT683) 




TPR Rcpcats*CT683 Hypothetical Protein 


0321 


(CT092) 


ychF 


OTP Binding Protein 


0544 


(CT4I8) 


yhbZ 


GTP binding protein 


0844 


(CT703) 


yphC 


GTPasc/GTP -binding protein 


Standard Protein Secretion 




0115 


(CT025) 


flh 


Signal Recognition Particle GTPuc 


0363 


(CT060) 


flhA 


Flagellar Secretion Protein 


0858 


(CT717) 


nu 


Flagellum-spcciftc ATP Synthase 


0704 


(CT672) 


fliN 


Flagellar Motor Switch Domain/Y scQ family 


0815 


(CT572) 


gspD 


Gen. Secretion Protein O 


0816 


(CT57I) 


gspE 


Gen. Secretion Protein E 


0817 


(CT570) 


gspF 


Gen. Secretion Protein F 


0359 


(CT064) 


lepA 


GTPase 


0110 


(CT020) 


tepB 


Signal Peptidase 1 


0535 


(CT408) 


IspA 


Lipoprotein Signal Peptidase 


0260 


(CT141) 


secAJ 


Protein Translocate SubuniM 


0841 


(CT701) 


secA.2 


Translocate SecA_2 


0564 


(CT448) 


secD&seeF Protein Export Proteins SecD/SecF (fusion) 


0075 


(CT32 1 ) 


secE 


Preprotein Translocate 


0629 


(CT510) 


secY 


Translocate 


0848 


(CT707) 


tig 


Trigger Factor-Peptidyl -prolyl I some rase 


Transport-Related Proteins 




0486 






Hypothetical Proline Permease 


0289 


(CT230) 


aaaT 


Neutral Amino Acid (G lata mate) Transporter 


0691 


{CT685) 


abcX 


ABC Transporter ATPase 


1031 


(CT374) 


arcD 


Argininc/Omi thine Annporter 


0482 


(CT381) 


art/ 


Arginine Periplasmic Binding Protein 


0836 


(CT554) 


bmQ 


Amino Acid (Branched) Transport 


0536 


(CT409) 


dagAJ 


D-Ala/Gly Permease. 1 


0876 


(CT735) 


dagA_2 


D-Alanine/Glycine Permease_2 


0682 


(CT690) 


dppD 


ABC ATPase Dipepade Transport 


0683 


(CT689) 


dppF 


ABC ATPase Dipepride Transport 


0280 


(CT689) 


dppF 


Dipepade Transporter ATPase 


0785 


(CT596) 


exbB 


Macromolecule Transporter 


0784 


(CT597) 


exbD 


Biopolymer Transport Protein 


0604 


(CT486) 


niY 


Gluurnine Binding Protein 


0192 


(CT129) 


gtnP 


ABC Amino Acid Transporter Permease 


0191 


(CT130) 


8>nQ 


ABC Amino Acid Transporter ATPase 


0528 


(CT401) 


gltT 


Glutamate Symport 


0286 


(CT194) 


mgiE 


Mg** Transporter (CBS Domain) 


0413 


(CT264) 


nubA 


Transport ATP Binding Protein 


0290 


(CT231) 




Na^-dependeni Transporter 


0195 


(CT198) 


oppAJ 


Oligopeptide Binding Protein_l 


0196 


(CTI98) 


oppA.2 


Oligopeptide Binding Protein.2 


0197 


(CT139) 


oppA_3 


Oligopeptide Binding Protein_3 


0198 


(CTI75) 


oppA.4 


Oligopeptide Binding Protein 4 
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15 
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25 



30 



35 



40 



45 



0599 


(CT480) 


oppA.5 


Oligopeptide Binding Lipoprotein^ 


0199 


(CT199) 


oppBJ 


Oligopeptide Permease, t 


0598 


(CT479) 


oppB_2 


Oligopeptide Permease_2 


0200 


(CT200) 


oppC_I 


Oligopeptide Permease J 


0597 


(CT478) 


oppC_2 


Oligopeptide Permease _2 


0201 


(CT201) 


oppD 


Oligopeptide Transport ATPase 


0202 


(CT202) 


oppF 


Oligopeptide Transpon ATPase 


0231 


(CT1S0) 


tauB 


ABC Transport ATPue fNitrate/Fe) 


0782 


(CT599) 


tolB 


Macromolecule Transponer 


0969 


(CT817) 


ryrPJ 


Tyrosine Transport.. 1 


0970 


(CT8.18) 


iyrP.2 


Tyrosine Transport,! 


0665 


(CT544) 


uhpC 


Hexosphospnate Transport 


0282 


(CT216) 


xisA 


Armm Acid Transporter 


0207 


(CT204) 


ybhl 


dicarboxylste Transloeator 


0971 


fCT8!9) 


yccA 


Tranrport Permease 


0248 


(CTI52) 


ycfV 


ABC Transporter ATPase 


1014 


(CT856) 


ychM 


Sulfa c Transporter 


0736 


(CT641) 


ygeD 


Efflur. Protein 


0680 


(CT692) 


ygt>4 


Pbosp ute Permease 


0723 


(CT653) 


yhbG 


ABC Transporter ATPase 


0023 


(CT348) 


yii* 


ABC Transporter Protein ATPase 


0127 


(CT034) 


yrfF 


Catioi ic Amino Acid Transporter 


0349 


(CT067) 


ytgA 


Solutt Protein Binding Family 


0348 


(CT068) 


yigB 


ABC raisporter ATPase 


0347 


(CT069) 


ytgc 


Integral Membrane Protein 


0346 


(CT070) 


ytgD 


Integral Membrane Protein 


1012 


(CT854) 


yzcB 


ABC Transporter Permease 


0868 


(CT727) 


zntA 


Metal Transpon P-cype ATPase 


0279 






Possible ABC Transporter Permease Protein 


0543 


(CT417) 




(Metal Transport Protein) 


0692 


(CT684) 




ABC Transponer 


0542 


(CT416) 




ABC Transporter ATPase 


0690 


(CT686) 




ABC Transponer Membrane Protein 


0541 


(CT415) 




solute binding protein 



Type*/ f! Secretion 



0323 


(CT090) 


IcrD 


Low Calcium Response D 


0324 


(CT089) 


IcrE 


Low Calcium Response £ 


0811 


(CT576) 


IcrH.t 


Low Ca Response Protein H_l 


1021 


(CT862) 


lcrH_2 


Low Calcium Response.2 


0325 


(CT088) 


sycE 


Secretion Chapcrone 


0702 


(CT674) 


yscC 


Yop OGcn Secretion Protein O 


0828 


(CT559) 


yscJ 


Yop Translocation J 


0826 


(CT561) 


yscL 


Yop Translocation L 


0707 


(CT669) 


yscN 


Yop N (FUgellar-Type ATPase) 


0825 


(CT562) 


yscR 


Yop Translocation R 


0824 


(CT563) 


yscS 


YopS Translocation Protein 


0823 


(CT564) 


yscT 


YopT Tranlocation T 


0322 


(CT091) 


yscU 


Yop Translocation Protein U 



50 



55 



Central Intermediary Metabolism 



Glycogen Metabolism 
0856 (CT715) 
(CT798) 



0948 
0475 
0607 
0307 
0388 



(CT866) 
(CT489) 
(CT248) 
(CT042) 



UDP-Glucose Pyrophosphorytase 
glgA . Glycogen Synthase 
glgB Glucan Branching Enzyme 
glgC Glucose- UP Adenyltransferase 
glgP Glycogen Phosphorylase 
glgX Glycogen Hydrolase (debrsnching) 
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0326 


(CT087) 


milQ 


G lucanotransfersse 




0851 


(CT710) 


pckA 


Pnosphoenolpyruvste Carboxykinase 




Phosphorous & Sulfur 








0548 


(CT435) 


evil 


Sulfite Reductase 


5 


0920 


(CT774) 


cysQ 


Sulfite Symhcsis/Biphosphate Phosphatase 




0025 


(CT346) 


atsA 


Sulphohydrolase 




0918 


rCT772) 


ppa 


Inorganic Pyrophosphatase 










DNA Replication, Modification, 


10 


DNA Mismatch Repair 








0505 






3-Methyladenine DNA Glycosylate 




0812 


(CT575) 


mud. 


DNA Mismatch Repair 




0941 


(CT792) 


mutS 


DNA Mismatch Repair 




0402 


(CT107) 


muiY 


Adenine Glycosylase 


15 


0732 


(CT625) 


nfo 


Endo nuclease IV 




0837 


(CT697) 


nth 


Enodnuckease 111 




DNA Modification 








0596 


(CT477) 


ada 


Methylnnsferase 




0114 


(CT024) 


hemK 


A/G-specific Methylase 


20 


0891 


(CT748) 


mfd 


Transcription- Repair Coupling 




0620 


(CT501) 


nivA 


Holliday Junction Helkase 




0390 


(CTO40) 


mvB 


Kolliday Junction Heiicase 




0621 


(CT502) 


ruvC 


Crossover Junction Endonuclease 




0053 


(CT298) 


sms 


Sms Protein 


25 


0773 


(CT607) 


ung 


Uracil DNA Glycosylase 




1062 


(CT329) 


xscA 


Exodoxyribonuclease VII 




DNA Recombination 








0762 


<CT650) 


recA 


RecA Recombination Protein 




0738 


(CT639) 


recB 


Exodeoxynbonuckase V. Beta 


30 


0737 


(CT640) 


recC 


Exodeoxyribonuclease V. Gamma 




0123 


(CT033) 


recDJ 


Exodeoxyribomiclease V (Alpha Subunit)_l 




0752 


(CT652) 


recDJ 


Exodeoxyribonuclease V. Alpha_2 




0339 


(CT074) 


recF 


ABC Superfamily ATPasc 




0340 


(CT074) 




(frame-shift with 0339) 


35 


0563 


(CT447) 


rtcJ 


ssDNA Exonuclease 




0299 


{CT2 40) 


recR 


Recombination Protein 




DNA Replication 








0309 


(CT250) 


dnaAJ 


Replication Initiation Protein_l 




0424 


(CT275) 


dnaA_2 


Replication Initiation Factor_2 


40 


0616 


(CT497) 


dnaB 


Rcplicanve DNA Heiicase 




0666 


(CT545) 


dnaE 


DNA Pol III Alpha 




0942 


(CT794) 


dnaG 


DNA Primase 




0338 


(CT075) 


dniN 


DNA Pol III (Beta) 




0410 


(CT26I) 


dniQ_l 


DNA Pot III Epsilon Oum.l 


45 


0655 


(CT536) 


dniQ_2 


DNA Pol HI Epsilon Quin_2 




0040 


(CT334) 


dnaXJ 


DNA Pol III Gamma and Tau J 




0272 


(CT187) 


dnaX_2 


DNA Pol lit Gamma and Tau.2 




0149 


(CTI46) 


dnU 


DNA Ligase 




0274 


(CT189) 




DNA Gyrase Subunit A J 


50 


0716 


(CT660) 


8>rA_2 


DNA Gyrase Subunit A_2 




0275 


(CT190) 


gyrB.l 


DNA Gyrase Subunit B_l 




0715 


(CT661) 


gyrfi.2 


DNA Gyrase Subunit B_2 




0416 


(CT267) 


himD 


Integration Host Factor Alpha 




0612 


(CT493) 


polA 


DNA Polymerase 1 


55 


0924 


(CT778) 


priA 


Primosomal Protein N* 




0386 


(CT044) 


ssb 


SS DNA Binding Protein 



58 
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0835 i 


CCT555) 




SWl/SNF family helicase J 




0849 


(CT708) 




SWl/SNF fimily heiicase_2 




0769 


(CT643) i 


topA 


DNA Topoisomerssc l-Fused to SW[ Domain 




0034 


(CT347) 


xerC 


[ntegnserrccombtnase 


c 

5 


1024 


(CT864) 


xerO 


[ntegrase/rccombinase 




Eukaryotic-Type Chromatin Factors 




0886 


(CT743) 


hctA 


Histone-Like Developmcntit Protein 




0384 


(CT046) 


hcifi 


Histone-like Protein 2 




0878 


(CT737) 




SET Domain protein 


10 


0577 


(CT460) 




SWIB ( YM74) Complex Protein 




UVR Exinucltas* Repair System 






0096 


(CT333) 


uvrA 


Excinuclease ABC Sub unit A 




0801 


(CT586) 


uvrB 


Exinudease ABC Subunit B 




0940 


(CT791) 


uvrC 


Excinuclease ABC. SubunitC 


15 


0772 


(CT608) 


uvrD 


DNA Helicase 










Energy Meuboliim 




Aerobic 










0855 


(CT714) 


gpdA 


Glycerol-3-P Dehydrogenase 


20 


0743 


(CT634) 


nqrA 


Ubiquinone Oxidoreductase. Alpha 




0427 


(CT278) 


nqr2 


NADH (Ubiquinone) Dehydrogenase 




0428 


(CT279) 


nqr3 


NADH (Ubiquinone) Oxidoreductase. Gamma 




0429 


(CT280) 


nqr4 


NADH (Ubiquinone) Reductase 4 




0430 


(CT781) 


nqrf 


NADH (Ubiquinone) Reductase 5 


25 


0883 


(CT740) 


nqr6 


Phenolhydrolase/NADH (Ubiquinone) Oxidoreductase 6 




ATP Biogenesis and metabolism 






0351 


(CT065) 


adtj 


A DP/ATP Translocase J 




0614 


(CT495) 


adt_2 


ADP/ATP Translocase_2 




0088 


(CT308) 


itpA 


ATP Synthase Subuntt A 


30 


0089 


(CT307) 


itpB 


ATP Synthase Subunit B 




0090 


(CT306) 


atpD 


ATP Synthase Subunit D 




0086 


(CT310) 


atpE 


ATP Synthase Subuntt E 




0091 


(CT305) 


atpt 


ATP Synthase Subunit t 




0092 


(CT304) 


atpK 


ATP Synthase Subuntt K 


35 


0860 


(CT719) 


flip 


Flagellar M-Ring Protein 




Electron Transport Chain 






0102 


(CT013) 


cydA 


Cytochrome Oxidase Subunit 1 




0103 


(CT014) 


cydB 


Cytochrome Oxidase Subunit 11 




0364 


(CT059) 




Ferredoxin 


40 


0084 


> (CT312) 




Predicted Ferredoxin 




Glycolysis & Cluconeogenesis 






0281 


1 (CT215) 


dhnA 


Predicted 1,6-Fructose Biphosphate Aldolase 




0800 (CT587) 


eno 


Enotase 




0624 (CT505) 


&*pA 


GlyeeraWehyde-3-P Dehyrogenase 


45 


0056 (CT295) 


mrsA 


Phosphomannomutasc 




0967 (CT8I5) 


pgm 


Phosphoglucomuiase 




0160 (CT207) 


pfkAJ 


Fructose-6-P Phosphotransferase J 




0208 (CT705) 


pfkAJ 


t Fructose-6-P Phosphotransferase^ 




1025 (CT378) 


pgi 


Glucose -6-P Isomerase 


50 


0679 (CT693) 


PSk 


Phosphoglycerate Kinase 




0863 (CT722) 


pgmA 


Phosphoglycerate Mutase 




0097 (CT332) 


p* 


Pyruvate Kinase 




1063 (CT328) 


tpiS 


Trioscphosphate Isomerase 




Pentose Phosphate Pathway 




55 


0239 (CT186) 


dcvB 


GIucosc-6-P Dehyrogenase (DevB family) 




1060 (CT331) 


(Us 


Transketolase 



59 
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20 



25 



0360 


fCT063) 


gnd 


6-Pno3phogluconstc uenyarogenwe 


0185 


(CT121) 


rp< 


Ribulosc-P Epirnerase 


0141 


(CT213) 


rpiA 


Ribose*5*P Isoroerase A 


0083 


(CT313) 


tal 


Trans al do lasc 


0893 


(CT750) 


OctB 


Tfinskctotasc 


0238 


{CT185) 


zwf 


Glucose -6- P Dehyrogenase 


Pyruvate Dehydrogenase 




0833 


(CT557) 


IpdA 


Lipoamide Dehydrogenise 


0436 


(CT285) 


IplAJ 


Lipoate Protein Ligtse*Like Protein 


0618 


(CT499) 


tp1A_2 


Ltpoate-Protein Ugase A 


0033 


(CT340) 


pdhA&B 


Oxoiso valerate Dehydrogenase a/0 Fusion 


0304 


(CT245) 


pdhA 


Pyruvate Dehydrogenase Alpha 


0305 


(CT246) 


pdhB 


Pyruvate Dehydrogenase Beta 


0306 


(CT247) 


pdnC 


Dihydrolipoamide Acetyltransferase 


TCA Cycle 






0495 


(CT390) 


aspC 


Aspartate Aminotransferase 


1013 


(CT855) 


fumC 


Fumaratc Hydratase 


1028 


(CT376) 


mdhC 


Malate Dehyrogenase 


0789 


(CT592) 


sdhA 


Succinate Dehydrogenise 


0790 


(CT591) 


sdhB 


Succinate Dehydrogenase 


0788 


(CT593) 


sdhC 


Succinate Dehydrogenase 


0378 


(CT054) 


sucA 


Oxoglutarate Dehydrogenase 


0377 


(CT055) 


sucB_l 


Dihydrolipoamide SuccmyloansfenueJ 


0527 


(CT400) 


sucBjZ 


Dihydrolipoamide Sucanyl transferase^ 


0973 


(CT821) 


sucC 


Succinyl-CoA Synthetase, Beta 


0974 


(CT822) 


sucD 


Succinyl-CoA Synthetase, Alpha 



Protcio Folding, Assembly & Modification 





Chaperones 






30 


0949 


(CT799) 


etc 


General Stress Protein 




0534 


(CT407) 


dksA 


DnaK Suppressor 




0032 


(CT341) 


dnaJ 


Heat Shock Protein J 




0503 


(CT396) 


dnaK 


Hsp-70 




0134 


(CTI10) 


groELJ 


Hsp-#>_1 


35 


0777 


(CT604) 


groEL.2 


Hsp-60.2 




0898 


(CT755) 


groEL.3 


Hsp-60.3 




0135 


(CT1II) 


groES 


lOKDa Chaperonin 




0502 


(CT395) 


grpE 


HSP-70 Cofactor 




0661 


(CT541) 


mtp 


F KB P- type Pepudyl-prolyl Cis-Trans 1i 


40 


Proteases 










0144 


(CTl 13) 


ctpB 


Clp Protease ATPase 




0437 


(CT286) 


clpC 


ClpC Protease 




0520 


(CT431) 


dpPJ 


CLP Protease 




0847 


(CT706) 


ctpP_2 


CLP Protease Subunit 


45 


0846 


(CT705) 


clpX 


CLP Protease ATPase 




0269 


{CTl 38) 




Dipcptidise 




0998 


(CT841) 


ftsH 


ATP-dependent Zinc Protease 




0030 


(CT343) 


gepj 


O-StaJoglycoprotem Endopeptidase_l 




0194 


(CTl 97) 


gcp-2 


O-Sialoglycoprotein Endopepodase 2 


50 


0979 


(CT823) 


htrA 


DO Serine Protease 




0957 


(CT806) 


ide 


Insulinase &rmly/Pro tease III 




0027 


(CT344) 


ton 


Lon ATP-dependent Protease 




1017 


(CT859) 


lytB 


Metalloprotease 




1009 


(CT851) 


map 


Methionine Aminopeptidase 


55 


0385 


(CT045) 


pepA 


Lcucyl Aminopeptidase A 




0136 


(CT112) 


pepF 


Oligopeptide 
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0813 


(CTS74) 


pcpP 


niiiHwycpiimig r 


0613 


(CT494) 




Protease 


0555 


(CT441) 


tsp 


Tut* Specific Protease 


0344 


(CT072) 


yaeL 


Metal ioprotease 


0981 


(CTS24) 




Zinc Metalloprotease (insulinase family) 


Prottin fsomenues 






0227 


(CT176) 


dsbB 


Disulfide bond Oxidoreductase 


0786 


(CT595) 


dsbD 


Thio:disuJftde Interchange Protein 


0228 


(CT177) 


dsbG 


Disulfide Bond Chiperone 


0933 


(CT783) 




Predicted Disulfide Bond lsomense 


0926 


(CT780) 




Thioredoxin Disulfide lsomense 
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Transcription 



RNA Degradation 





0999 


(CT842) 


pnp 


Polyribonucleotide Nucleotidyltransferase 


5 


0054 


(CT297) 


mc 


Ribonudease III 




0119 


(CT029) 


mhfl_l 


Ribonucletse HII_I 




1068 


(CT008) 


mhfl_2 


Ribonucleic HII_2 




0934 


(CT784) 


mpA 


Ribonuclease P Protein Component 




0504 


(CT397) 


vacB 


Ribonuclease Family 


i d 

1U 


RNA Elongation 4k Termination Factors 




0741 


(CT636) 


greA 


Transcription Elongation Factor 




0316 


(CT097) 


nusA 


N Utilisation Protein A 




0076 


(CT320) 


nusG 


Transcriptional Antitennination 




0845 


{CT704) 


pcnB_! 


Poly A Polymerase^! 


15 


0966 


(CT4I0) 


pcnfi_2 


Poly A Polymerase^ 




0610 


(CT49I) 


rho 


Transcription Termination Factor 




RNA Mtthylases 








0674 


(CT553) 


fmu 


RNA Methyltransferase 




1059 


(CT354) 


kgsA 


Dimcthyladenosine Transferase 




0187 


<CT133) 




Predicted Methylase 




0530 


(CT403) 


spoU.l 


rRNA Methylase.) 




0660 


(CT540) 


spoU_2 


rRNA Methylase_2 




0117 


(CT027) 


umD 


iRNA (Guanine N-1)-Methy (transferase 




0885 


(CT742) 


ygcA 


rRNA Methyl transfer** 


25 


0986 


(CT829) 


y»H 


Predicted rRNA Methylase 




0987 


(CTS30) 


ytgB 


Predicted rRNA Methylase 




RNA Modification 








0649 


(CT530) 


fmt 


Methionyl tRNA Formy I transferase 




0910 


(CT766) 


miaA 


tRNA Pyrophosphate Transferase 


30 


0719 


(CT658) 


sfhB 


Predicted Pseudo uridine Synthase 




0219 


(CT193) 


tgt 


Queuine tRNA Ribosyl Transferase 




0580 


(CT463) 


miA 


Pseudouridylate Synthase 1 




0319 


(CT094) 


cruB 


tRNA Pseudouridine Synthase 




0403 


(CT106) 


yceC 


Predicted Pseudouridine Synthetase Famil] 


35 


0864 


(CT723) 


yj'bC 


Predicted Pseudouridine Synthase 



RNA Polymerase & Transcription Regulators 



40 



45 



50 



55 



0586 


(CT46S) 


atoC 


Two-Component Regulator 


0362 


(CT061) 


rpsD 


Sigma-28/WhiG Family 


0501 


(CT394) 


hrcA 


HTH Transcriptional Repressor 


0793 


(CT588) 


rbsU 


Sigma Regulatory Family Protein— PP2C Phosphatase (RsbW Antagonist) 


0626 


(CT507) 


rpoA 


RNA Polymerase Alpha 


0081 


(CT315) 


rpoB 


RNA Polymerase Beta 


0082 


(CT314) 


rpoC 


RNA Polymerase Beta' 


0756 


(CT615) 


rpoD 


RNA Polymerase Sigma-66 


0771 


(CT609) 


rpoN 


RNA Polymerase Sigma-54 


0511 


(CT424) 


rsbV_1 


Sigma Regulatory FactorJ 


0909 


(CT765) 


rsbV_2 


Sigma Factor Regulator^ 


0670 


(CT549) 


rsbW 


Sigma Regulatory Factor-Htsndme Kinase 


0750 


(CT630) 


tctO 


HTH Transcriptional Regulatory Protein ♦ Receiver Doman 


1069 


(CT009) 


yf8A 


HTH Transcriptional Regulator 








Translation 


mino Acyt tRNA Synthesis 




0892 


(CT749) 


alaS 


Atanyl tRNA Synthetase 


0570 


(CT454) 


«gS 


Arginyl tRNA Transferase 


0662 


(CT542) 


aspS 


Aspanyl tRNA Synthetase 
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0932 


(CT782) 


cysS 


Cysteinyl iRNA Synthetase 


0003 


(CT003) 


gatA 


Glu iRNA Gin Amidotrsnsferase (A subunit) 


0004 


(CT004) 


gatB 


Glu iRNA Gin Amidotransferase (B Subunit) 


0002 


(CT002) 


gatC 


Glu tRNA Gin Amidotransferase (C subumt) 


0560 


(CT445) 


gitx 


Glutamyl-tRNA Synthetase 


0946 


(CT796) 


siyQ 


Glycyl tRNA Synthetase 


0663 


(CT543) 


hisS 


Htsa'dyl tRNA Synthetase 


0109 


(CT019) 


ileS 


UoleucyUtRNA Synthetase 


0153 


(CT209) 


leuS 


Leucyl tRNA Synthetase 


0931 


(CT781) 


lysS 


Lysyl tRNA Synthetase 


0122 


(CT032) 


metG 


Methionyl-tRNA Synthetase 


0993 


(CT836) 


pheS 


Phenylalanyl tRNA Synthetase. Alpha 


0594 


(CT475) 


pheT 


PhenylaUnyl tRNA Synthetase Beta 


0500 


(CT393) 


preS 


Prolyl tRNA Synthetase 


0870 


(CT729) 


serS 


Seryl tRNA Synthetase _2 


0806 


(CT581) 


thrS 


Threonyl tRNA Synthetase 


0802 


(CT585) 


trpS 


Tryptophanyl tRNA Synthetase 


0361 


(CT062) 


cyrS 


Tyrosyl tRNA Synthetase 


0094 


(CT302) 


valS 


Valyl tRNA Synthetase 



Peptide Chain Initiation. Elongation £ Termination 



1067 


(CT353) 


def 


Polypeptide Defortnylase 


0184 


(CT122) 


erpj 


Elongation Factor P_l 


0895 


(CT752) 


erp_2 


Elongation Factor P_2 


0550 


(CT437) 


fusA 


Elongation Factor G 


0073 


(CT323) 


infA 


Initiation Factor IF- 1 


0317 


(CT096) 


in® 


Initiation Factor-2 


0990 


(CT833) 


tnfC 


Initiation Factor 3 


0113 


(CT023) 


pfrA 


Peptide Chain Releasing Factor 


0576 


(CT459) 


prfB 


Peptide Chain Release Factor 2 


0950 


(CT800) 


pth 


Peptidyl tRNA Hydrolase 


0318 


(CT095) 


rbfA 


Ribowme Binding Factor A 


0699 


(CT677) 


rrf 


Ribosome Releasing Factor 


0697 


(CT679) 


uf 


Elongation Factor TS 


0074 


(CT322) 


tufA 


Bongation Factor Tu 


Mbasomal Proteins 






0078 


(CT318) 


rll 


LI Ribosomal Protein 


0644 


(CT525) 


r!2 


L2 Ribosomal Protein 


0647 


(CT528) 


rl3 


L3 Ribosomal Protein 


0646 


(CT527) 


r)4 


L4 Ribosomal Protein 


0635 


(CT5I6) 


ti5 


L5 Ribosomal Protein 


0633 


(CT5M) 


r!6 


L6 Ribosomal Protein 


0080 


(CT316) 


rl7 


L7/LI2 Ribosomal Protein 


0953 


(CT803) 


ri9 


L9 Ribosomal Protein 


0079 


(CT317) 


rtlO 


L10 Ribosomal Protein 


0077 


(CT319) 


rill 


LI 1 Ribosomal Protein 


0247 


(CTI25) 


rll 3 


LI 3 Ribosomal Protein 


0637 


(CT5I8) 


rll 4 


L14 Ribosomal Protein 


0630 


(CT5II) 


rllS 


LI 5 Ribosomal Protein 


0640 


(CT52I) 


ril6 


LI 6 Ribosomal Protein 


0625 


(CT506) 


rt!7 


LI 7 Ribosomal Protein 


0632 


(CT513) 


HI 8 


LI 8 Ribosomal Protein 


0118 


(CT028) 


ri!9 


LI 9 Ribosomal Protein 


0992 


(CT835) 


rl20 


L20 Ribosomal Protein 


0546 


(CT420) 


r!21 


L2I Ribosomal Protein 


0642 


(CT523) 


rt22 


L22 Ribosomal Protein 


0645 


(CT526) 


r!23 


L23 Ribosomal Protein 
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0636 


(L T >1 f) 


rl24 


IMDOSOrnll r iwICin 


0545 


(CT419) 


p117 


' noosonui pro win 


0327 


\\. lUeOj 


rl28 


UB KlDQSOInll r lutein 


0639 


\\. 1 


d29 




01 12 




r\\\ 


L3 1 Ribosomil Protein 


0961 


rrrtiAt 
ILTSlU) 


Ylii 


Uii lUDOSOnUI riurnn 


0250 


(CT150) 


ri33 


L33 Ribosomil Protein 


0935 


(CT785) 


rl34 


L34 Ribosomil Protein 


0991 


(CT834) 


rl35 


L35 Ribosomil Protein 


0936 


fCT786) 


rl36 


L3o Kibosomal rrotein 


0315 


(CT098) 


rsl 


SI Ribosonul Protein 


0696 


(CT680) 


r$2 


S2 Ribosomil Protein 


0641 


(CT522) 


m 


S3 Ribosonul Protein 


0733 


(CT626) 


r*4 


S4 Ribosonul Protein 


0631 


(CT512) 


rs5 


S5 Ribosonul Protein 


0951 


(CT801) 


rs6 


S6 Ribosonul Protein 


0SS1 


(CT438) 


r$7 


S7 Ribosonul Protein 


0634 


(CT515) 


rs8 


S8 Ribosomil Protein 


0246 


(CT126) 


rs9 


S9 Ribosomil Protein 


0549 


(CT436) 


rslO 


SI0 Ribosonul Protein 


0627 


(CT508) 


nil 


SI 1 Ribosonul Protein 


0552 


(CT439) 


ral2 


SI2 Ribosonul Protein 


0628 


(CT509) 


n)3 


S13 Rtbosotnal Protein 


0937 


(0787) 


nt4 


S14 Ribosonul Protein 


1000 


(CT843) 


n)5 


S15 Ribosonul Protein 


0116 


(CT026) 


nl6 


S 1 6 Ribosomil Protein 


0638 


(CT519) 


rsl7 


Sl7 Ribosonul Protein 


0952 


(CT802) 


nl8 


S18 Ribosonul Protein 


0643 


(CT524) 


nl9 


SI9 Ribosonul Protein 


0754 


(CT617) 


rs20 


S20 Ribosonul Protein 


0031 


(CD42) 


rs2l 


i tuoosomai rromn 








Other Categories 


Chlamydia -Specific Proittni 




0561 


(CT446) 


Euo 


/•ui dc din n i • I. 
LnLra cuo rrotein 


0804 


(CT583) 


Gp6D 


LnLl K riasmia rviiog 


0186 


(CTII9) 




Simitinty to lncA_l 


0291 


(CT232) 


incB 


Inclusion Membrane Protein B 


0292 


(CT233) 


incC 


Inclusion Membrane Protein C 


1026 


(CD 77) 




LtuA Protein 


0333 


(CT080) 




mm r lutein 


0005 


(CT87I) 


pmp_l 


ptf\l wwn'nwli it* Outer k4*ftiKMiw Ptiiigtii CI Fimilv 
rwi]miurpnu> wwti lyKnunint rniKiu \# 


0013 


(CT871) 


pmp_2 


rwiyiiwrpiut v/uisr (vicmorwic riWHn vi rm*wj 


0014 


(CT871) 


pmp_3 


p/\1*i mm \t%rr*h i f /"Vttf^r UfffiKtsnr Pmietn Ci Parrtllv 
rwiytnurpiuc wiucr nficmu* *nv rruwin v» i «n n* j 


0015 


(CT871) 


pmp_3 


DUD 1fftim».thiAu/<ihmi^ 


0016 


(CT874) 


pmp_4 


PnlumMnkir fVtt»* luf^nthnti* Pminn ft Psfnilv 
r UiyTnOTpjMv wUUi i mcnuranc r luwn w r*iiuiy 


0017 


(CT871) 


pmp_4 


PUD A t (Unw.fkiA unrh flfll tW 

rwi «_*♦ \ iramc*snin wiul Uv 1 OJ 


0018 


(CT874) 


pmp_5 


Polymorphic Outer Membrane Protein G Family 


0019 


(CT871) 


pmp_5 


PMP.5 (frame-shift with 0018) 


0444 


(CT871) 




Polymorphic Outer Membrane Protein Grt Family 


0445 


(CT871) 


pmp_7 


Polymorphic Outer Membrane Protein G Family 


0446 


(CT871) 


pmp_8 


Polymorphic Outer Membrane Protein G Family 


0447 


(CT871) 


pntp_9 


Polymorphic Outer Membrane Protein G/l Family 


0450 


(CT871) 


pmp_t0 


Polymorphic Outer Membrane Protein G Family 


0449 


(CT871) 


pmp_IO 


PMPJ0 (Frame-shift with 04S0) 
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20 



0431 
0432 
0433 
0434 
0466 



rcroi) 

(CT874) 
(CT871) 
(CT872) 
(CT869) 

0467 (CT869) 

0468 (CT869) 

0469 (CT869) 

0470 (CT869) 
047t iCT870) 
0539 fCT4!2) 
0340 (CT4I3) 
0963 (CT812) 
0562 

0927 
0928 
0929 
0728 
0729 
0133 
0332 



(CT622) 
(CT623) 
(CTI09) 
(CT081) 



pmp_l I Polymorphic Outer Membrane Proiein G Family 

pmp.l 2 Polymorphic Outer Membrane Protein (truncated) A/1 Family 

pmp.l 3 Polymorphic Outer Membrane Protein G Family 

pmp.U Polymorphic Outer Membrane Protein H Family 

pmp.l 5 Polymorphic Outer Membrane Protein E Family 

pmp.l 6 Polymorphic Outer Membrane Protein E Family 

pmp.l 7 Polymorphic Outer Membrane Protein E Family 

pmp.l 7 PMP.17 (Frame-shift with 0468) 

pmp.l 7 PMP.17 (Frame-shift with 0469) 

pmp.18 Polymorphic Outer Membrane Protein E/F Family 

pmp.l 9 Polymorphic Membrane Protein A Family 

pmp.20 Polymorphic Membrane Protein B Family 

pmp_2 1 Polymorphic Membrane Protein D Family 

CHLPS 43 kDa Protein Homolog.l 

CHLPS 43 kDa Protein Homolog.2 

CHL?S 43 kDa Protein Homolog.3 

CHL'S 43 kDa Proiein Homolog.4 

CHUN 76kDa Homolog.l (CT622) 

CHLPN 76kDa Homolog_2 (CT623) 

CHU'S Hypothetical Protein 

CHL'H T2 Proiein 



Miscellaneous £nzymts/Conserved Prott ins 





0193 




argR 


Posst le Arginine Repressor 




1046 






Aron atic Amino Acid Hydroxylase 


25 


0232 






SimiUriry to 5^Methylthioadenosine Nucleosidase 




0128 


(CT035) 




Biotin Protein Ligase 




0513 


(CT426) 




Fe-S Oxidoreductase.l 




0911 


(CT767) 




F«*S Oxidoreductase.2 




0373 


(CT057) 


gcpE 


GcpE Protein 


30 


0407 


(CTI03) 




HAD Superfamily Hydrolase/Phosphatase 




0917 


(CT771) 




Hydrolase/Phosphatase Homolog 




0488 


(CT385) 


ycfF 


HIT Family Hydrolase 




0701 


(CT675) 


karG 


Arginine Kinase 




0526 


(CT399) 


kpsF 


GutQ/KpsF Family Sugar-P Isomerase 


35 


0919 


(CT773) 


Idh 


Leucine Dehydrogenase 




0022 


(CT349) 


maf 


Maf protein 




0997 


(CT840) 


mesJ 


PP-loop superfamily ATPase 




0151 


(CT148) 


mhpA 


Mpnooxygenase 




0730 


(CT624) 


mviN 


Integral Membrane Protein 


40 


0861 


(CT720) 




NifU-Related Protein 




0479 


(CT380) 


phnP 


Metal Dependent Hydrolase 




0106 


(CT015) 


phoH 


ATPase 




0329 


(CT084) 




Phopholtpase D Superfamily 




0435 


(CT284) 




Phospholipase D Superfamily 


45 


0581 


(CT464) 




Phosphoglycotate Phosphatase 




0897 


(CT754) 




Predicted Phosphohydrolase 




0509 


(CT422) 




Predicted Metalloenzyme 




1030 


(CT375) 




Predicted D* Amino Acid Dehyrogenase 




0531 


(CT404) 




SAM Dependent Methyltransferase 


50 


0337 


(CT076) 


smpB 


Small Protein B 




0394 


(Cn56) 


UyC.I 


CBS Domain Protein (Hemolysin Homolog) J 




0510 


(CT423) 


UyC.2 


CBS Domains (Hemolysin Homo log), 2 




0382 


(CT048) 


yabC 


SAM-Dependent Methytransferase 




0787 


(CT594) 


yabD 


PHP Superfamily (Urease/Pyrimidinase) Hydrou 


55 


0611 


(CT492) 


yac£ 


Predicted Phosphatase/Kinase 




0579 


(CT462) 


yacM 


Sugar Nucleotide Phosphorylase 




0578 


(CT461) 


yael 


Phosphohydrolase 
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0345 


(CT071) 


yaeM 


CT07I Hypothetical Protein 


0366 


(CT450) 


yaeS 


YaeS family Hypothetical Protein 


0591 


(CT472) 


y*gE 


YagE family 


0039 


(CT335) 


ybaB 


YbaB family Hypothetical Protein 


0101 


(CT012) 


ybbP 


YbbP family Hypothetical Protein 


0915 


(CT769) 


ybeB 


iojap Superfamily Onholog 


0137 


(CTI08) 


ybgl 


ACR family 


0529 


(CT402) 


ycaH 


ATPase 


0438 


(CT287) 


ycbF 


PP-loop Super&mily ATPase 


0734 


(CT627) 


yceA 


YceA Hypothetical Protein 


0954 


(CT804) 


ychB 


Predicted Kinase 


0261 


(CT2I7) 


ydaO 


PP-Loop Superfamily ATPase 


024S 


(CT127) 


ydhO 


Polysaccharide Hydrolase-lnvasin Repeat Family 


0573 


(CT457) 


yebC 


YebC Family Hypothetical Protein 


0689 


(CT687) 


yfhOJ 


NifS- related Amino transferase J 


0862 


(CT721) 


yft»0_2 


NifS-related Arrunotraitsfer*se_2 


0547 


(CT434) 


ygbB 


YgbB Family Hypothetical Protein 


0237 


(CT184) 


yggF 


YggF Family Hypothetical Protein 


0775 


(CT606) 


yggV 


YggV Family Hypothetical Protein 


0396 


(CT258) 


yhfOJ 


NifS-related Aminotransferase^ 


0605 


(CT487) 


yhhF 


Predicted Methylase 


0575 


(CT458) 


yhhY 


Amino Group Acetyl Transferase 


0592 


(CT473) 


yidD 


YidD Family 


0982 


(CT825) 


ytgN 


YtgN Family Hypothetical Protein 


0657 


(CT537) 


yjeE 


YjeE Hypothetical Protein 


0768 


(CT644J 


yohl 


Yohl Predicted Oxidoreductase 


0336 


(CT077) 


yojL 


YojL Hypothetical Protein 


0217 


(CTI40) 


ypdP 


YpdP Hypothetical Protein 


0140 


(CT212) 


yqdE 


YqdE Hypothetical Protein 


0263 


(CT221) 


yqfli 


YqfU Hypothetical Protein 


0139 


(CT211) 


yq«E 


YqgE Hypothetical Protein 


0270 


(CT137) 


ywlC 


SuA5 Superfamily- related Protein 


0879 


(CT738) 


yycj 


Metal Dependent Hydrolase 








Homologs to CHLTR Hypotnctf 


0001 


(CT001) 


CT001 Hypothetical Protein 


0020 


(CT351) 


CT351 Hypothetical Protein 


0021 


(CT350) 


CT350 Hypothetical Protein 


0026 


(CT345) 


CT345 Hypothetical Protein 


0035 


(CT339) 


CT339 Hypothetical Protein 


0036 


(CT338) 


CT338 Hypothetical Protein 


0055 


(CT296) 


CT296 Hypothetical Protein 


0062 


(CT289) 


CT289 Hypothetical Protein 


0065 


(cna8) 


CT288 Hypothetical Protein 


0068 


(CT360) 


CT3 60 Hypothetical Protein 


0071 


(CT325) 


CT325 Hypothetical Protein 


0072 


(CT324) 


CT324 Hypothetical Protein 


0085 


(CT3I1) 


CT3 11 Hypothetical Protein 


0087 


(CT309) 


CT309 Hypothetical Protein 


0093 


(CT303) 


CT303 Hypothetical Protein 


0100 


(CT01I) 


CTOI 1 Hypothetical Protein 


0104 


(CT017) 


CT017 Hypothetical Protein 


0105 


(CT016) 


CTOI 6 Hypothetical Protein 


0107 


(CT058) 


CT058 Hypothetical Protein, 1 


0108 


(CT01S) 


CTOI 8 Similarity 


0111 


(CT021) 


CT02I Hypothetical Protein 


0121 


(CT031) 


CT03I Hypothetical Protein 
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0P9 


(CT036) 




0145 


f(T1 ts\ 


PTI 1 A UvNAlhMinl Pmtwn 
vii I** n j pvuicitvii rromn 








01 52 


(Ul 1*7) 


(~ TI^O tlmmth*lif ill Pwtt*in 


0176 




/" T \ $\ 1 luin-i thjrir il PmMi* 
w i n j |/w uicuc*i nvmn 


0188 






0189 




111 MmwitlMiifl Puffin 
\<i iji ny pw uicu wi nwcm 


0206 


(C 1 203 ) 


v> i *uj nypwncucsi nwcin 




l\» l 1 / 0 ) 


v 1 1 '0 nypoumicai nvwin 


0230 


ICTI '») 


i i /y nypouKucAi rnjietn 


0234 


tCTl 8 1 ) 


\* 1 1 » i nypouicucai rTOccin 


0249 


(CT15I) 


fT 1 < 1 UmuiltftlJ- 1 1 DMMi'n 

i)i rtypouKQCAi rromn 


0253 


(CTI44) 


f* T 1 4 j4 limn-' -* » |>- 1, I 

nypotncocai rrotein^i 


0254 


(CTI43) 


1. 1 14j nypoMeocai rrotetn_i 


0255 


(CTI42) 


^Tlirt 1 !■ .inlfc .lir.l — ■ - ■ ' | 

1 141 nypoiftcncii rro*etn_i 


0256 


(CTI44) 


/*T f J y* tl. .1 * - - * ^ - ■ • 4 

tTi44 nypoutctica] rrotetn_z 




(CTI43) 


\, i I4J nypouwocai rF0tCUl_l 


0259 


(CT142) 


1. 1 i nypouteocai rroietn_z 


0276 


(CTI91) 


v 1 1 y 1 rtypOuiCuCal riotcm 


0288 


(CTI95) 


CT195 Hypothetical Protein 


0293 


(CT234) 


CT234 Hypothetical Protein 


0301 


(CT242) 


CT368 Hypothetical Protein 


0303 


(CT244) 


CT244 Hypothetical Protein 


0308 


(CT249) 


CT249 Similarity 


0312 


(CT10I) 


CT101 Hypothetical Protein 


0328 


(CT085) 


CT085 Hypothetical Protein 


0330 


(CT083) 


C 1083 Hypothetical Protein 


All ■ 


\K.mai) 


OT082 Hypothetical Protein 


0334 


(CT079) 


j innwiiy 


0342 


(CT073) 


\ftvf j n y pvUWu ■ rrvtcin 


0343 


(CT073) 


f frmm^.(hif> wtih fl1J??\ 
^iitnic-tniii wiui ujttf ; 


0350 


\\r 1 Wv; 


c 1 1 fMJ% Uu*w\fh^fi^sl PmifHti 


0369 


(CT058) 




0370 


(CT0S8) 


wivjo nypouicub*! rrvtcin_j 


0374 


(CT056) 


v« I ujq n j ^MUicum nwciii 


0379 


(CT053) 


TAC ^ Utjwif K»ti fMi 1 Prmt*in 

uivjj nypDUwneKi riowin 


0381 


(CT326J 




0383 


(CT(\A1\ 
\\, 1 Vi 1 ) 


Wlvtf ITTUUiflCUUM riuicn* 


0387 


(CT043J 


f • 1 ^ I4imnfh*fi^9l PmMin 


0389 


(CTD4 1 ) 


V* I v* i nypouKiinj rrwicin 


0393 


(CT038) 


\>i\mo puiMBm rruicin 


0395 


(CT257) 




0399 




i '■ 1 J C ^ Uwrwif K^ti^al Pmtrjft 


0400 


(CT254) 




0401 


1 *~* J J 


CT255 Hypothetical Protein 


0405 


(CT105) 


CTT 1 05 Hypothetical Protein 


0408 


(CT102) 


CT 1 02 Hypothetical Protein 


0409 


(CT260) 


CT260 Hypothetical Protein 


041 1 


(CT262) 


v I nj|iuuKii%#i rrwin 


0412 


(CT263) 


CT263 Hypothetical Protein 


0415 


(CT266) 


CT266 Hypothetical Protein 


0420 


(CT27I) 


CT271 Hypothetical Protein 


0422 


(CT273) 


CT273 Hypothetical Protein 


0423 


(CT774) 


CT274 Hypothetical Protein 


0425 


(CT276) 


CT276 Hypothetical Proteins 


0426 


(CT277) 


CT277 Similarity 


0434 


(CT283) 


CT283 Hypothetical Protein 
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0441 


(CT007) 


C 1007 Hypothetical Protein 


0442 


(CT006) 


L iuuo Hypouicticai rTOictn 


0443 


(CT005) 


CT0Q5 HypotneQcai rrotein 


0474 


(CT365) 


CTjdj Hypoucacai rrotein 


0476 


(CT865) 


CT865 Hypothetical rrotein 


04SO 


(CT383) 


CT383 Hypothetical Protein 


0485 


(CT382) 


CT382.I Hypothetical Protein 


0487 


(CD 84) 


CT384 Hypothetical Protein 


0489 


(CT386) 


CT386 Hypothetical Protein 


0490 


(CT387) 


CT387 Hypothetical Protein 


0491 


(CT389) 


CT389 Hypothetical Protein 


0496 


(CT391) 


CT39I Hypothetical Protein 


0497 


(CT388) 


CT388 Hypothetical rrotein 


0506 


(CT421) 


CT421 Hypothetical Protein 


0507 


(CT421) 


CT42 1 . 1 Hypothetical Protein 


0508 


(CT421) 


CT421.2 Hypothetical Protein 


0512 


(CT425) 


CT425 Hypothetical rrotein 


0514 


(CT427) 


CT427 Hypothetical Protein 


0518 


(CT429) 


CT429 Hypothetical Protein 


0522 


(CT433) 


CT433 Hypothetical Protein 


0525 


(CT398) 


CT398 Hypothetical Protein 


0533 


(CT406) 


CT406 Hypothetical Proton 


0537 


(CT8I4) 


CT8I4.I Hypothetical Protein 


0538 


CCT814) 


CT8I4 Hypothetical Protein 


0554 


(CT440) 


CT440 Hypothetica] rrotein 


0559 


(CT44I) 


CT441.1 Hypothetical Protein 


0565 


(CT449) 


CT449 Hypothetical rrotein 


0572 


(CT456) 


CT456 Hypothetical Protein 


0582 


(CT465) 


CT465 Hypothetical Protein 


0583 


(CT466) 


CT466 Hypothetical Protein 


0588 


(CT469) 


CT469 Hypothetical Protein 


OS 89 


(CT470) 


CT470 Hypothetical Protein 


0590 


(CT471) 


CT471 Hypothetical Protein 


0593 


(CT474) 


CT474 Hypothetical rrotein 


0595 


(CT476) 


CT476 Hypothetical Protein 


0601 


(CT483) 


CT483 Hypothetical Protein 


0602 


(CT484) 


CT484 Hypothetical Protein 


0606 


(CT488) 


CT488 Hypothetical Protein 


0609 


(CT490) 


CT490 Hypothetical Protein 


0622 


(CT503) 


CT503 Hypothetical Protein 


0623 


(CT504) 


CT504 Hypothetical Protein 


0648 


(CT529) 


CT529 Hypotneocai rrotein 


0658 


(CT538) 


CT538 HypotneQcai rrotein 


0667 


(CT546) 


tT 54o nypotneDcai rroicin 


0668 


(CT547) 


rtypotncocai rroicin 


0669 


(CT548) 


ti 548 rtypouiencai rrotein 




tfv< cm 


i» 1 3 ju rtypouiencai nwein 






rT<« UvMttiMt'eal Pmtmn 
\» i m nypomcovsi rroicin 


0675 




r*TAOX UunAlkMiMl PffvWm 

\+ 1 wo rtypoincncBi rmcpi 


0676 


(CT695) 


CT695 SimiUrity 


0681 


(CT691) 


CT691 Hypothetical Protein 


0687 


(CT482) 


CT482 Hypothetical Protein 


0688 


(CT481) 


CT481 Hypothetical Protein 


0700 


(CT676) 


CT676 Hypothetical Protein 


0705 


(CT671) 


CT671 Hypothetical Protein 


0706 


(CT670) 


CT670 Hypothetical Protein 


0708 


(CT668) 


CT668 Hypothetical Protein 
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0709 


iCT667) 


CT667 Hypothetical Protein 


0710 


1CT666) 


CT666 Hypotheticil Protein 


071 1 


lCT665) 


CT665 Hypothetical Protein 


0713 


(CT663) 


CT663 Hypothetical Protein 


0717 


fCT656) 


CT6S6 Hypothetical Protein 


0718 


(CT657) 


CT657 Hypothetical Protein 


0720 


(CT659) 


CT659 Hypothetical Protein 


0722 


(CT654) 


CT654 Hypothetical Protein 


0725 


(CT652) 


CT652.1 Hypothetical Protein 


0726 


iCT620) 


CT620 Hypothetical Protein 


0727 


(CT6I9) 


CT619 Hypothetical Protein 


0739 


(CT638) 


CT368 Hypothetical Protein 


0742 


fCT635) 


CT635 Hvrjoihefieal Protein 


0746 


(CT632) 


£?Tfi3? HvnalhMinl Pmtein 
\>iw* njr)wuic(imi rnwiJi 


0747 


(CT631) 


CT63 1 Hypothetical Protein 


U til 


f fTM 1 i 

\\m 1 QJ 1 ) 


\>iO'i nyjjumcu .*i r i u twin 


0755 


(CT616) 




0760 


(CT6 1 1 ) 


CT61 1 Hypothetical Protein 


0761 


(CT6I0) 




0764 


(CT648) 


CT648 Hypothec] lal Protein 


0765 


(CT647) 


CT647 Hypothetical Protein 


0766 


(CT646) 


CT646 Hypothcti) at Protein 


0767 


(CT645) 


CT645 Hypotheti- al Protein 


0770 


(CT642) 


CT642 Hypo the ti ial Protein 


0774 


(CT606) 


CT606. 1 Hypothetical Protein 


0776 


(CT605) 


CT60S Hypothetical Protein 


0779 


(CT602) 


CT602 Hypothetical Protein 


0783 


tCT598) 


CT598 Hypothetical Protein 


0791 


CCT590) 


CT590 Hypothetical Protein 


0792 


fCT589) 


CT589 Hypothetical Protein 


0803 


(CT584) 


CT584 Hypothetical Protein 


0807 


(CT580) 


CT580 Hypothetical Protein 


0808 


(CT579) 


CT579 Hypothetical Protein 


0809 


(CT578) 


CT578 Hypothetical Protein 


0810 


(CT577) 


CT577 Hypothetical Protein 


0814 


(CTS73) 


CT573 Hypothetical Protein 


0818 


(CT569) 


CT569 Hypothetical Protein 


0819 


(CT568) 


CT568 Hypothetical Protein 


0820 


(CT567) 


CT567 Hypothetical Protein 


0821 


(CT566) 


CT566 Hypothetical Protein 


0822 


(CT565) 


CT565 Hypothetical Protein 


0827 


(CT560) 


CT560 Hypothetical Protein 


0834 


(CT556) 


CTSS6 Hypothetical Protein 


0840 


(CT700) 


CT700 Hypothetical Protein 


0842 


(CT702) 


CT702 Hypothetical Protein 


0843 


(CT702) 


CT702 Hypothetical Protein 


0852 


(CT711) 


CT71 1 Hypothetical Protein 


0853 


(CT712) 


CT7I2 Hypothetical Protein 


0857 


(CT716) 


CT7I6 Hypothetical Protein 


0859 


(CT718) 


CT7I8 Hypothetical Protein 


0865 


(CT724) 


CT724 Hypothetical Protein 


0869 


(CT728) 


CT728 Hypothetical Protein 


0874 


(CT733) 


CT733 Hypothetical Protein 


0875 


(CT734) 


CT734 Hypothetical Protein 


0884 


(CT74I) 


CT741 Hypothetical Protein 


0887 


(CT744) 


CHLTR Possible Phosphoprotein 


0896 


(CT753) 


CT7S3 Hypothetical Protein 



69 



W O0027994 [ file:/A\dcwas03\firmdata\lp \ FolevPat\PatentDocufnents\WO002 7994,cpc1 



Page 72 of 330 



WO 00/27994 



PCT/US99/26923 



10 



15 



20 



25 



30 



35 



0906 


(CT763 j 


CT763 Hypothetical Protein 


0908 


(CT764) 


CI 764 Hypothetic*! Protein 


0912 


(U 1 loo) 


v I 'Oo nypoineucsi rroicin 


0925 


(CT779) 


CT779 Hypothetical Protein 


0938 


(CT788) 


L- 1 'Be nypoineocai rroictn 


0939 


(CT790) 


L. 1 790 Hypothetical Protein 


0943 


(CP 94) 


CT794. 1 Hypothetical Protein 


0945 


(CT795) 


CT795 Hypothetical Protein 


0956 


(CT805) 


CT805 Hypothetical Protein 


0960 


(CT809) 


CT809 Hypothetical Protein 


0989 


(CT832) 


CT832 Hypothetical Protein 


0994 


(CT837) 


CT837 Hypothetical Protein 


0995 


CCT838) 


CT838 Hypothetical Protein 


0996 


(CT839) 


CT839 Hypothetical Protein 


1002 


(CT845) 


CT845 Hypothetical Protein 


1003 


(CT846) 


CT846 Hypothetical Protein 


1004 


(CT847) 


CT847 Hypothetical Protein 


1005 


(CT848) 


CT848 Hypothetical Protein 


1006 


(CT849) 


CT849 Hypothetical Protein 


1007 


(CT849) 


CT849.1 Hypothetical Protein 


1008 


(CT850) 


CTS50 Hypothetical Protein 


1010 


{CTS52) 


CT852 Hypothetical Protein 


1011 


(CT853) 


CT853 Hypothetical Protein 


1015 


(CT857) 


CT857 Hypothetical Protein 


1016 


(CT858) 


CT858 Hypothetical Protein 


1019 


(CT860) 


CT860 Hypothetical Protein 


1020 


(CT86I) 


CT861 Hypothetical Protein 


1022 


(CT863) 


CT863 Hypothetical Protein 


1032 


(CT373) 


CT373 Hypothetical Protein 


1033 


(CT372) 


CT372 Hypothetical Protein 


1034 


(CT371) 


CT371 Hypothetical Protein 


1057 


(CT356) 


CT3S 6 Hypothetical Protein 


1058 


(CT355) 


CT35S Hypothetical Protein 


1061 


(CT330) 


CT330 Hypothetical Protein 


1073 


(CT371) 


CT371 Hypothetical Protein 



40 



45 



50 



55 



0486 
0279 
0505 
0193 
1041 
1044 
1042 
0585 
0562 
0927 
0928 
0929 
1045 
0251 
0278 
0907 
0171 
0172 
0608 
0735 



Coding Genes Mot in C trachomatis 
Hypothetical Proline Permease 
Possible ABC Transporter Permease Protein 
3-Methyladenine ONA Glycosylate 
argR Similarity to Argtnine Repressor 

bioA Adeno$ylrnethiorjine-8«Arrano-7-Oxononanoate Aminotransferase 

bioB Biottn Synthase 

bioD Dethiobiotin synthetase 

Similarity to Cps lncA_2 

CHLPS 43 ttDa Protein Homology I 

CHLPS 43 kDa Protein Homology 

CHLPS 43 kDa Protein Homology 

CHLPS 43 kDa Protein Homolog_4 

Conserved Hypothetical Membrane Protein 

Conserved Hypothetical Protein 

Conserved Outer Membrane Lipoprotein Protein 

CutA-like Periplasmic Divalent Cation Tolerance Protein 
guaA CM P Synthase 

guaB tnosine S'-Monophosphase Dehydrogenase 
Uridine ^-Monophosphate Synthase 
Uridine Kinase 
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0980 Similar to Saccharomyca ctmrisio* 52.9KDa Protein 

023 2 Si milarity 10 5'-M why Ithioadenosinc N ucleostdase 

1046 Tryptophan Hydroxylase 

0477 yqeVJh Conserved Hypothetical Protein 

0048 yqfF-Bs Conserved Hypothetical 1M Protein 

0587 yvyD_B$ Conserved Hypothetical Protein 

0143 yxjG_Bs_l Conserved Hypothetical Protein 

0448 yxjG_Bs_2 Conserved Hypothetical Protein 



0006 


0180 


0440 


0977 


0007 


0181 


0455 


0978 


0008 


0190 


0456 


1018 


0009 


0203 


0457 


1023 


0010 


0204 


0458 


1027 


0011 


0205 


0459 


1029 


0012 


0209 


0460 


1040 


0028 


0210 


0461 


1051 


0029 


0211 


0462 


1052 


0034 


0212 


0463 


1053 


0041 


0213 


0464 


1054 


0042 


0214 


0465 


1055 


0043 


0215 


0472 


1056 


0044 


0216 


0473 


1064 


004 S 


0218 


0481 


1065 


0046 


0220 


0483 


1066 


0047 


0221 


0492 


1070 


0049 


0222 


0493 


1071 


0050 


0223 


0494 


1072 


0051 


0224 


0498 




0063 


0225 


0499 




0064 


0226 


0516 




0066 


0233 


0517 




0067 


0240 


0523 




0069 


0241 


0524 




0070 


0242 


0553 




0099 


0243 


0574 




0124 


0266 


0600 




0125 


0267 


0656 




01^6 


0268 


0664 




0130 


0277 


0677 




013 1 


0283 


0678 




0132 


0284 


0685 




0142 


0285 


0686 




0146 


0287 


0724 




0147 


0352 


0731 




0155 


0353 


0745 




0156 


0354 


0753 




0157 


0355 


0794 




0158 


0356 


0795 




0159 


0357 


0796 




0162 


0358 


0797 




0163 


0365 


0798 




0164 


0366 


0799 




0165 


0367 


0829 




0166 


0368 


0830 




0167 


0371 


0831 




0168 


0372 


0881 




0169 


0375 


0882 
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Cblasydia i 

cpn_oooi no 4 

CTOOl nvcornncical protain 

KRUCDEIT^SUlRKAKLGKIirc^ 

ATorrvn ;-;AC'/:cu^rcprcntKjRHSKCD5CsscccHSHHSDiai 

-pnjinnrr 57i w 

i . " •■ . • . :;.'» >*•»!•*.• • «: *n •■: .■ . 

KH WNVCDt*REDSVTSDFMl£Ef UWVPE3 IfiGLVKVPAV X K 

CPn.000] 889 2370 

OdtA-Clu crna Gin Amidoc ran* f arse \ ^ 

K rKYRYSALELAXAVTLCELTATCVTQH?THR I EEAEOQVCAF rSLCKTO At .FQAFLIPK 

KRSRSCPlfiKLACSVPWCXKIWlWmaJCrTCASl^^ 

KUM)EF«ICSm.YSArHPTHK«<DLSRVTOSSGCSA^ 

QPAAFCGVVGFKPSYGAVSgreLVArASSiaOIC 

R£TPTOSFKSXLSTIVPKWOWtm«J^rREM^ 

SHAVSIYYIIJUAEAATNIARFIXJVRYCYRSPOAHT IS0LYDLSW3B 

NYVLSAERQNVYYWCATAVRAJCIVKAJlr^^ 

ODIYTVAWttAYXJ'AXAVPSGFSXKLPLGWIGCOT 

SXRYAKSWLQOOS 

CPO.0004 2334 3933 

gacB-(petl!2) Glu tRNA Gin Amidotransfarasa (B Subm it) 

SXCQKCeSlUlSIMSAVYAOWESVIGLEVHVEOTA^ 

LPCStPVLNOSAVEJCAVLfGCAVECtlSLI^FIiRX^^ 

R X KAXVOGEERYFEXAQTH X CDOAGKJCHFGSFAGVCYNRAGVPt.X EXVSKPQCFCPEBA 

VAYATSLVSUIWIGISIOJMEIXOTRFtJW^^ 

L£AEroRQIOEYLWrfflq)rKLVirAATYKWDFOT 

LOLTtSYIERIWCItPEIJYDKYHRYXQEYXSt^n^ 

RSLSKHVTVEPX3GlCmCVKLI«5GXFPeOV^ 

GKNPEEILKEKPELLPHSOECILOKI IAEV\rt*NFKIVl«W*»TKALOTM30Iia«T 
AGKAPPXRVNELLLLELOKG 

. CPn.0005 4097 6892 

prap_l- Polymorphic Outar Meabrana Protain 

sdihfdlctxxrfslcgfplvfstiii^v^^ 

qagdvysltcsvs isnvimsallkacfnvtsgsn^agmhhslypmnxssgnigsavbc 

:SBPCATAJffSCFSTWFIOSI<aiKE0CCtYSl(KAL^^ 

SANVrXVSIYDSVSnrOHAATFGGAlHSSGPlOXAVNQ^ 

OIDIXX^YVLFMHEALTTAIGROGAVCCIJTSCSS^^ 

GCGAIYAWaJISSGGPTLFIWIISYANSC&IUMMAII^ 

SLPFUUIKLL0NAKFUa>QAItNSYSXCrYDPrrSEAXX»TO 

SGfiXStW^PlOSPXSTXPONVNLSAaYLVXXEGAEVTVSia^ 

KEDIAITOJUOXXiSLSSSSrAANrtlCANTAiaO^ 

FTli5LEPGA<3GSVIVTAGOFt*PVSPKYGF0CNI<ICLAWnn^ 

• KBQCVFmLWOttVIJVRSX/CTOETHASS^^ 

SGGYVLSVNNiLI Tt>tW YT SMATSQIJ'SRDKIff AVS?PgrRMYLCSYLYQrm'SLafI PRY 

ASRNFNVMTCXLSIUtFlXJNFraFHFLCAYG^ 

GCSMPXXVFEMGRLFOGAXPPMnjOLVYAYOTFKETTA^ 

EiaAl^g^YPFSFSYIPDIrTgDFSCt^ 

FNX7YTZX*IOG5XE€RPHAPXrNXNCGSKFRF 

CPn_0006 7299 7141 

No robust homoloy prneic in Ganabank/EMBL at of 11/7/98 
KQLQEPlJtSAU£RI^SMUftIjGVFSP£rTF^ 

CPn.0007 . 7498 10496 

no robust homo loo prasant in Oanabank/EMBL as of 11/7/9B 
KSPRYNLSLIFSFLWI PLTOSTTSSt^TSXXOBanCSKRXlJlXIAXVLXAl^X XLXAG 
CVVIATVAIPGLSSVXSSPAGM3ACAl£CVHLAl^ia^^ ITPUIOS 
PRSGISISGArOTIRSLPTYIXDraPQSKRXUULAI^ 

CLS8VISSPAOCACAI/^VMIJtf^X7VIJJaCPJV^IVtj^ 1 1 1 PUOJiPfl SGXSISGA 

DSTIRSLPTYPUJEXWPOSWlKLRIXAIVLIVrSX ILIASGWLLTVAIPGLSSI ISSPA 

EMGAeAI^74LAIfiIXWLLKia*TOIVVPAPIPEEV^ 

PEDtSAFB7fXICWESHlXNKKSLPYCGHGt£OCrKKQXRVVRSS^ 

EEEXFTFl£AWaU.IDLATTLVERJCILTE»X£WW 

vmFNII^XCRrrXIFENHEHGVAXStXHXKAVtXEKVXYRSLO^^ 

LKGNTFFSLEt)N10mMXra»E H LES L SSYMCVr^ 

uALSEISRDE3WQKXAKUCH0E£LYTOAW^ 

FgIERVQ£RIRAIQKLYPNlLE3lE E£l I UJL1 VTPTVOCTTASSDLTOILCRIEVSSRED 
NQNO£SCVKVIJlSHEVD(SWEVla}eYGFra 

KLSY FWCVNNXX EVOY AKFRUCVLESDLEG ILAOTCSAESLLTOEELP I LATRGAIXKAV 
FKG5l£CAMSKAXPYrXEDP1tFQ05DTOUULTtR^ 
ERKl^EJKCTFERAGLGVUEXAVESTYDLRSLTJ?^ 
RRAKTRLA^EMTORYRDrKHALEAMOFNEEALLOEELS IQAPSE 

CPn_0009 10780 11655 

no robust homo log prssent in Genabank/EMBL as of 11/7/ 98 
':KY5YtANYPPPPRR5LCVSCSKLI«X^ITlXVl^X^TX^ 
LGGVLVXSCI^LLVRRCSnrnntSEEIPPX^SVTPSCCPALEXA^ 
t^lXrTYIOEVTACLERLKDPKYmRCLLTEAKmJ^ 

A YYVEHCOO PLEN IAY E I FSSO ELRDYYCACVCGYLPSG QARA0RLKRSVK EVMDft FMR V 
TVfKSWEA5VML0H5YGVAftELFKKAVCVLEE5VYK X LFKSYItOAFYECEXAK I0RDGRFK 

wt. 

CPn_0007 11669 IJii'; 

no rontir-t homo ton pr*s«nt in < '.enetvui k t EMBL as oc 1 1/7 /oh 
t/rjAltAWU-RDINTX.VEOLKOTIPVVUEirOCTD I ETVR KiXMWLOR Y AOJC F I LftEKEEK 
MFJlHELniATMVnilASniAYAKAKAAFEKER^I^RKVK^ 

Aft EMJi UxT'LYPEVSVECRVLERyRTICICVNLENLYAD t EKXYlft 1CVREOEHVWKEVENK 

Mhnrftm;t:KvtcMEcvstru7fu,eocumtfr;KKLTKAEEVvrE*oCFTV\ 

VTNRLE XLiTEXMKEM IFTUEEI EXTLWMVEtJU^KJn'FEKASLOYNjUJKEMLAKVEPO 

* KESPTYR::; ^HtA( J»JWM4TOYTITOERl^:F:;DLE3KVmTC^ 

i .NFfNF.KLlJrfVi ;AKt JTOARUX.VATVTYMFPYL0YI»J tKREXVR:\>MAKTERYRE(R0 

Ahv^KFj^4j\tmtiJtF,ranvij^Dnwu^DCTK^WRn>.t<>afAAAW<vicoF 

*:lii_tlH» HI24 14 121 

r*i i»#l*i::r lv«v>h»t |ir»nt»*nr. in lir-rifUtrik/EKOt. .in .»» ll/7/'t« 



ntYFYUlSYPPPPOHSVrS . J K UIVLA rTTS.rrHLLL I JZAL FLTLC T PCLSAA I a 
rcaClCWLOEW<ISiaXCLLVKREXrTVBPEEXPEGV SW 
p1CEXfCLCTT)IQE^F^gt3UrniCDSir<EJRgn^^ 
ACEO<tKJffLINOGRSlWT*IStSt£tFTlTOlCRi£T^ 

SLHCEI HKVAVAFPRNSYAKAEXAF AK *T »"iAT FT WYBSLTOSYTOKFLESEKAKI P»C 
HfTWlJUJOAKSCCAEXKlCMPWMWNlCKOSFC 



rpn^ooio.; 



14269 



IS74« 



rtJCIuJtIUCCAiJUCTTrCKKiL»Kl^AVCE^^ 

YPEVSVS IRIWI0ETRSNlXKAYfAIEWRCCVWroi37YVKI^^ 

SpmXSSLEOFtKSUOTSOOjmXOTlIJCI^ 

KOtPCRITXIErrtJWAZXJ^PTKWtfOCACSOW 

RLVSU)HIJLRJWYTtrOKRTODSCIXSE\mACREOlJ^ 

CRLIOff LLU . V^V I JLlA PTCaQCryACYYI»rYTtVRV^ 

AGX£CIX3KVXJC£CEYI&Y1lEE]Ua«EXK^ 

CKAKPLnntnxts 

CPn.0011 15877 16614 

oacB-(Pecll2) Glu CRNA Gin Amidotransfarasa IB Subuait) 
FWYSXKTAAPA ILHVS PTPPEETX7VX PKD5KSRALCXTLLVV l CXl>l.VV0GAXVl<SGVXS 
GLSALXVCGt£X57XSLGVVt^VLCXXXJXJUCftELTW 

QSTEKfilJKIEGSRYSDQGrLNIWrDKXinT F^9T,SSITStriuXRQLrDEEKICLLSSE 
b r t rg t tAMUTCOCttflWLMLCNIJLDIRAYtC PMNYXVAKVI EKAJCAWHt F XVUTMAR 

ELsrrr 

CPn.0012 16S96 1B2U 

oatB-( Pattl2) Glu CRNA Gin Anidotransfaras* (B Subunic) 

GIRVrrU!NKYCLlJCGKrOEIflJuUXIU^ 

pyjKCAEVr^ATlJQOKIUJf^IFWUtENllII 

POtgnrVLTQftKrPY^I ^ y^^^gr^^^J-^w vr«T.rnAjrTrTFTAMiiiAtfLlH 
VKDUUUtYGCTVDPl^ITrEAJOOCVElXASLI^^ 

XNUUUJCXXJGSrrSVLPZXOCXCTCLSLEElPLLTTItSU 

vroMXYV0EYE^'0lONXx:^ad0GIS0it^QlcXQ0c^ 

GFWFWCnfflWUUaXYIllSTAlOKM N J' W K HZ LTlUnWCCWKPLLClXYICADSYKD 
fjry^^^ rT rtrvT-r rrwrT^frrvr)incrriflHjmftgwrrTYFm.pTirTT ir fTinaci 

CPn.0013 18509 21106 

prap_3-PolytBorphtc Outar Hsobrana P roc a in 
LROUJUTr«XYWKrSPLR£KKVVM}aPLWXI.ISL 

Hjy] 1 ^ 1 1 SFSSKTSSATICIKrVrroSVVTENV PK 1^1. i V STgCrTNDAAACPCNTLOC 
GPSFTrSinDmASGAMGSSAAIllCTVrt^ 

kxtaliuuwhtxyfyopitvtcstsvao*lminsp muiJN*j.y IM 1 vfsuialtxaea 
Ki ana rrSKiJawAnoCTVViJQG^^ 

f^IKIDSWTOIKlJAATAQICDIRIDRPVVIAISDIS F YU N^ U ttmittaitJLD 

AGXDmSAXJSRSXDAVQSVTGYOGIO^IJWSTmKATV^ 

XXW^XIMtfr^IELCiTIXUPYEKWWAGXSN\aJU^ 

S T HMPGGOT LSLGFAOtJ'AJ<r«I>YnimTirAJCTYAC 

EIIXPYVSinTJ^^rrOOLSYOm)HRHXTESU>PPPPTI^ 



AEOY»nnflOCYSPDVCRSKITOTTIJJNOaSWKro 
LFQ<raFMGSSRSYlM3AGSXIKF 

CPn.0014 21365 21922 

pBBLj-Polynorphic Outar Mambrana Procain 

XONOSXYrrmffSFPKfVPffTFAIFTLSKXATerVLOSSASFXCan^^ 

TTYLnCGNVTLENI PGTCTAITXSCFNNTTCCDLTFTCBOISLJJVn/IWnVACAAVICSS 

VVnCSTTTXGFSSUnASICSSITrGICGAVSCSTGSULTX^^ 

POXLFH 

CPn.0015 21935 24174 

pnp 3-PMP 3 (frama-stufc with 0014) 
LEFWWSUJSKOTSTEHXAITAXTLSLTC^^ 

CNOCEVSFSC»n'SSDSGAAIFTEASVTISNNAItVSFIONlCVTGAS*STTCD^ 
KTSTOTICVTUTCNOKLLFSNWrSTrAODA X YVKKL£LA£GGLTt^^R>15VN0GTAMQGA 
f a T rr^r^rr c r cAno^nTvrLCNTvrk h w . i MTL<:<7rDLfrT*;AK>4TALP_sAACnArvTYP 
P rTTCSSTrVTOVUCVKETPADSALOYTCNI XFTGEXUETEAAX)£»ILTSKLLOPVrLS 
GCTt£XJOtGVTIOTQAFT0aADSRUKDV^ 
TKATSIO^Tl^GTrrtXDPtCrFYIimSLIWPOSYOIl^LKASG^ 
F>fYCY0GTUCPXVVCTGA5TTATFNWTrrCYXPNPERIG5 

ETANECLOCORAFVCACLSNFFHKDSTKTRRC FHKLSOCYV I OGNLHTCSOX XLSAAFOQ 
LPGRX^ltnf FVAKNOGTVYCCTLYYOHNETY X5LPCKLRPCSL5YVFTE X PVLFSGNliSYT 
HTCmDLXTKYTrYFTWC5MGN0SFALEFaSRAPXCL£ESAl<FX^ 
CFKEOCTEAREFCSSRLVNLALPIGIRFDKESDCODATYKLTI^TVDLVIISNPOCTTTL 
RXSGtSM)CrFt3TNl>ROALVXlUCNHFCrNSNFEAFSOrsrEt^ 

CPn.00U 24J83 2«188 

pap 4-Polynorphic Outer Membrane Protain 

RSDFAUU£Cl«RSSFSLU.XSJ5tAFFLUtSVSA0AA^ 

AATSOASGTTY X LDGDVS ISOAf;KOTSLTTSCFSNTACNl.TFLCNCFS LMFDN I ISSTVA 
CWVSNTAASG ITKFSCFSTlRHLAAPftmKCA t K tT0RLVFX%5 IGNL0LMEHA85EMG 
rjv XimCTLSLTCSTRFVAFU3NS330CO3 AX YASGD3VISEHAC IU5FCNMSATTSQGA X 
SAECNLVtSNNQNI FFOCCKATTT PTOA IOCNKAGANPDP X LTLJONESLMFtNNTAGNSO 
^JkXYTKKLVt^SCRCCVLF^JNNKAANATf'rCCAtAlLOSCEICtSAOtXIKI X FCCMTTST 
TGr: PA.7VTR NA I DLASNAK FLN LPATRQ IKV X FYDP tTSrXMTOXUlUJKADAOOGNTYE 

>7( (vrrr.zxui ee elk r. ponlk jtftvvV bXA/^ j^vlk u rvrvx'/NNT rruvtnsKWMO 
' rnTPrAT ai-3 tvtlm ; i a r n XDCLtrm ifa r t kataa::koval: v.r tKi.vrwJNYYRHH 
itLr^wv^r l t tcL:wn>rrrpG t i-nri* : i/jttnjiy » .-yw m*m:i a »i* i**knkkcyln 

U) 
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i v :i i« avrr: *- :u mm*.- *« : r ai*i j ikia*« »m<: w i: ^ma'auvi jfktaziknffn 
kaki vc*i*iYiwmi *akn»iiivw»m:;ywii i;K::rri jvk ii j> a*:u:t.pfVKMARFAvr! 
im ttacpnt yti ty:: i vw t *k vtLAK'* : i K' .■♦ > «\ ( i v/a: a :icn: *M»n m «Fi m [ yaiio 
rti*KKN'nT» w;:»V::kii«>'NIJV/I' , a : t KKKKK::f*:rrYi)l fc :tAV\riA/rRNin*XTrTU( 
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vr?xovrrrr:"7rr:L5R0AixvTirV xhhafas NFr. . rcvELUcssRSYAiDLCJCRPCF 

CTPn 00 Id 2751 J 2'JOO) 

pnp_5 -Polymorphic Outtr Kemorane Protein 

rrMimv*KLu\LLca>3:ixvi^^ 

L.TJEVLY I DFOCGGS [TCTCr/ETACOLTnCNOITXrrLSVOACAKlAVAHVOCSlWLS 
FTDr«LVITESPKSAVTTGXGSLV3LC*V^^ 
f KffRA I pr^WTSSKKfTJA ITTTO^TI DWt^TTLIf FNENKAVP9'3CMJ3U5AASTrTAN 

.::-.*.t/rr:: -nav^u;:; : :;:.-ft: :r ".:^;7i:nKcr»v^::rrTr:.::T-;;c 
: :r; ; sir.// r «\ t .wa: . .' ? w .r - jrr.vn iatpi/ » *.\ r ? r ?.Trru*:.pr: 

OGDIWBGNQVTrrAPtUTTWWIHL 

LRINEySANQKLSCS IVTSCERLSTAEA IAENLTSR INQPVT LVDCSLVUCQCVn. ITQG 
F50EPESTLU-DL5TS L 

CPn.0019 29007 30356 

prap_5-PMP..5 (Crame-shift with 00181 

Asrmrvrn^iHACTrroxypiNiVASAANKN 

DYSFVKLSPGAGGTI ITOTASOKPLEVAPSRPKlfGYOGHWNVOVI KWCTOPSOANLSWV 

RTGYLPNPER0GSLVPNSt>CSFVIX3RAI0EIH^ 

NDCYRHSGVGYLVGVCTKAFSWTINA^^ 

EFRSPOCrm)SSSEACCNCWVTnitOI^YSHRNNLtlllTJt)( l'l )(PEAOCSWANDVPClEr 
CATrYYYPHSTFt^WYSPnJlL0CTyAK0n)FKETOCSEV^^ 
RFSDCKRGSYEi/riAYVTWIPJaPKST^^ 
PCIEW5HGAIELRCS5RNYNTNLSGKYRF 

CPn_0020 32717 30603 

Predicted OMP (leader (14) peptidet oucer membranel 

KLCTPNXJUittRCrLrrarVLKSSAI^ 

l^IHKNLRIOAiOCWVDITVGQSUXVAIOIVMVWW 

R F AMYPWFLSGSM ITLT PET IVI RKCY I STS EC PKKDLCLSGDYLEYS S DSLLSIGXTTL 
RVCRIPIIJ'l^PrSIMPM£rPKPPIKn«XnOGrLSSYlJOISYSPISHI « 

FKJCVGKXTMiiraoKOVPENvnMCSY^ 

GEYHLSDSWnVAOiriWmCJOJTCPniVCCTWNW^ 

YLTWYPXSXYOTCWLENIVECGYLNFAF^ 

TLSSTLSSSLIYYSD^ZSSiUfSOLSAJCIAUYRnii^ 

PLAKNEDKYirSIOBAFKSUa-lJt^GmrSVt^^ 

FPKTACEULPrGKlQrrVSLQAIWIWiaCHCWDKMitltt^ 

IKCDRIHFILOVSRPXDOLLDSPIJCJHRNiaLCJ^^ 

Yl^YOMILCTKIFEHWQLYGVVTRREAI»RFrFPTJCLDKPiaCPPF 

CPn_0021 34470 32707 

Predicted OMP (leader (19) peptide] 

CSRSPYPNI EILARCVDCISJCIJTILTI^LLLCSLPI SLVAXFPESVGKKI LYI STQST 

QQAUTYL£AZ4>AYCI»DrFVLAXICEDYW 

tSOAHETADPLQQLLVl^VSGHIXacrSDDLLFl^^ 

DHLHSFIHJU^EaQCl^AMFIJUXTEXSIAYrR^ 

FLPTIJIMIXTSASP0P0EAILYALCIOJCTOSYYWITO 

EgDMJyiCTQAIJERPRALYAIJWLPSEICIPI^ 

OTPtaiXmTOVOPHYKEnJO^FSXGRTLOlMaCVNI IVP0OP0ERER1XSTTRCLE 

EQZLTFXJTU^XEAYX.PCZYXLLAS0KTOIATTAZS^ 

Z IRAYAIXAIYNLTXDPEXKRSUQIYAJOKLXOCnXiFVDTfMORPH 

RTKlMLDILISIATSKSSEDIiUilOl/fimi^^ 

CPn_0022 3S042 34395 

mat 

TIl£VXSNCCNV5NTRSFYSMSLPLVt£SSSP1^^ 
CDPUYTQELAAOKAYAVSELHSPCKIILTOTriV^Yrair^^ 
NtfHtDVVTSIAVUlKIKIiTGSET^ISLTO 
nJcrwrY-v^wr^.PTTTTinaj-rrrMTnTMnvgT 

CPnJ)023 36657 35014 

yjjK/alr»ABC Transporter Protein ATPase 
ENRAKZXYSXOHrVMLSAiiSZVLOXIClCSX/TrRILPW 
TIXJUXNGMXEPTICSXStPXK^ZLIU}NZISFK0rTVI£^ 

LOErrDAIGHELSEZ££XZCE£lX^fRADSEA£XZXTCIC ZPNEMPDK)04AM PZ DLQFRV 

LLCOALFfSHPEALLLDEPTNKLOLYS XNW1431FXJCX7YT&TVX WSKDRHFI/fTXTTHX AD 

I[7fUriIIYPGNYD0HVQOCTASR£OEKADIKSXEKKISOLiCErVAKFCACSRAS0VOSR 

UtEXlOCXPOEtJaCSNIOTPYXRFPl^DXSSGICVVI^^ 

GDKL3i:GNTCLGKTTUaaiJtf7/EAPSSGSIKI^ 

EWLRNRKTCZrTOEIRSVLCn(HLrGGDt*ro^ 

EANNHLDLESVSAI^AIWJfYKGTAIFVSHDW^ 

TACHXOLL 

CPn.0024 37605 36661 

xe rC* X nt ea rate / recomo ine ee 

REVMZAS t YSFLDYLKMVKSAS PHTLRWCLDLNGLK X FLEERCNLAPSS PCQLATEXRK 

VSELPPSLFTKEWRMYZAXmZEICKAKRTZKRCI^SIKSFAHYCVXOKILLEKPAETZH 

CPRl^KELPSPMT^AOVEVUlATPDISKYHCLRORCLMELrYSSCLRrSEIVAVKKOOrO 

LSTHL X RX RGKCKXER Z X PVTSNA IQWXQ X YLNH PDRKRL.EKOPQA I FLNRPGRRI STR3 

IDRSFOF/LRRSCL^GHITPHTIRHTIATHWIXSCKDU^^ 

VKUQCQTHQEAHPHA 

CPn_0025 39610 37694 

fllaLVACsA-Sulphonydrotase/Clycosulfatase 

I LMSSREL I ILGC3S0QPT1OTJIQCAYLFRWWECLLFDPCECT0RQFI FAN I APTTVNR 
t FVSHFHCDHCLGl^SML/'fRUiLDKVSHP XHCYY PASQKKYFORLRYGT X YK ET XQWEH 
P X SEEC IVEOFC 5FRI EAORX^QHOVDTLCWR ITEPXTT IK FLPKELESRC I RCL X ZOOL X R 
D0EI5 IfT^TVYLSOVSYVRRCOSIAI X AOTLPC0AAI DLAXNSCKMLCEwTYLEOHRHL 
AEJlIFHKTAKOAATLAKRAA'rOKLILTllFSARYUrtJOFYKEAjAVFPfr/CVAOEYRSYP 
PPKNPLUIK 

':PH_(HJ2'- *'»*J7 »(t76,*» 

<TI4'» If/purlntr. inal ptot«ut 

:«PAic;m. i p::utti: VTiwniKrvP ikoaakik^ i hu \ cn I ayl 1 1 i^wtvlvcaml 

I VI -IHHlljl IK1 KMFf HTVI .F.Kt J W.VTNKPK:>:FJL';i1t I :SttUU OPfttfilCf'^E t RS2DR 
■*•'' « JII'KAKt .TAtllTATTKK.'Tirnrn'KlT^KKKKKTKIKl JIRTr;:; t IIKR5AMC PMVP5K 

Ki«tvijj(icrvHj'ra«jy^L'a^iux::K;rrrvr^KAiuwT(w*rt>p 

■IH_»0i:7 MT7H ^tT T ^ 

l.nt i«nt ATI* 'tiwtibHir Hr.tfiMiiu 



PSIRTIVDSTTNSOSPILD. .^VEKtXiCCECESESOSTERLLPSELP ILPUKR PPT 
PCHAAP t LI ESS PYYCVUC'^K.:; rCKV ICLVLTKKXKAP I UCVSrWQLKlCTGVAAAILR 

IMP X EOC5A0VLSS2 SEE HUTT C^^JJ2!S^2^»yS2iJ£«?^2^f2E5l2^2^2r22?? 
LLKLNPLTKEELO I rtCKSPfTErCKg WSV ALn'A W EjELQ lV U, 1 ! lIl WawrWIALI 
LLKJCTLOLSRLOSS INO K I EAT tTICSCKCfnJEEQLrrXlornflr riffPR AIDX EXff E» 
LRKRlWlttAMEVICOEirKiCTl^SAEY 

IVLNKDHYCL3E X KQR tlELZ JVCKLSKSLXC5: ICLVCPPCVOCSrCRSIAlCVLMRCT 
PR rSVCCMROEAEX KCHRRTY rCAMPCKMVOALKf^AWPVIMIDCVDKICASYKaJf A 

■ • -rf l: ;: -* - : — :-~vr • jTirrrr J rry-rr r:.tt;tu 

UCI'/ONOEKPKSKK XTFK 1 J JKNU/rYU^' lFJJURF\*E^P\XJVATCl^WTiilME3ATL 
YIESN^V^SIJCTXtfflLTCQACrVHKESSCrAWTYIiiSAlitRYAKjt 1 > f V I LSC IVHIHIPE 

GATPiacpsAcrwvrrsLi^LLLrrpvvNNuamarrL 

LNILlPPEIWRRDYEELPAYLXTCIJtlHrvSKYOrMJCVAFPKLX 
CPn.0028 43328 42543 

no roDutc nonoloo present in Cenebenk/ClBL at of 11/7/98 
RKFLCrPHP rVTStXJS LSFLPYLCXSSC 1 1 DCCSNIVEHYU{^3C0TSVI itcvscattl 
SVl»AiJ>ISKSEXIIKIl^XL:Ul.IIJUJ > IKIVUU:UTrYW 
TPDQENI^U>LPSPTTlJatIHALKILVRSCrn?^I0ECFSmiTOL^^ 

rsYKSLXjtirmfSLvsvPNXscEXRAX^nrK^^ 

OTTOKXACFCtiTTfTWICIYPL 

CPn.D029 43839 43390 

No robust ncmoloa present in Genebenk/DfBL as of 11/7/98 
SNW<Era^IYCFT*tJ^YIRrFAALNIR*«I^^ 

KXF0XXIJm^XlC55LXSLI^LCTOEATOSDXLYCrSRrOYLNSPEXEDPRXPP 
LOniVSRSVKEI^IKI^LNSEWWTKP 

CPn.0030 43840 44529 

SCP'O'Slaloylycoprocein Endopept idaae 
LKCTCWYSZJITZ»mRMYPYTCYVZZD«WPrX>CW 
PIJKSIQ^rO^AVAUHOJrSATRXCISrAO^ 

LFASSPSOKrrVECVAPSVEOXRJtHVtSOPMFVEYZJKCUPim^K 

CPn_0031 44708 44884 

rs21-S21 Riboeonal protein 

CFH.0032 44081 46098 

dnaJ-Heac Shock Protein J 
SLIffllWVCSVSGMPYYSILOXSCTASAm 
VSEAYXVI£C)POKRD5Yt»FCiax;PFACAOSFX2»a9CNKEt^ 

0SAVWTOZKSCE»3COSWWQSRCrrSttASTCPEro 
RSVHWIPACVDSGMRlJPXECYCaiJSOqMWSCDLYV^ 
OFVTlAAlOaaCEIPTUJatXKCRLTVPEClOSCTXL^^ 
VETPOtO^EEOICEXJtJlTTASTElCAENrPICXRSFLOKI KGTFSDfTV 

CPIU0033 46129 48171 

pdnMB/odbAtodbfi- (pyruvate! Oxoisovalerate Dehydrogenase Alpha 
k Beta fusion 

ERSHGVVQK3VISS IlllWaVWtX^ 

KSLIPCXE^FPYYRIWFPICirCDUEIFASFIARTTPtm 

C0S5VVCTCrt0>ACRAKAVKKSSADEVVYVSCXa^ 

I(^«8IHAlSVTPtS0CGAX)LASLaRCH0GXAVYEVt^^ 

AUUDVVIu^SHSNSONOElCYRSAXJlIJU^KiraPLXlX^ 

eAOEEVRXSCEXA£AI^FPSKGSTSHEWSPYTETXJXX 

SEAtVESKTRZJSCnriVFGEIVAaJICCXWPCVTR^ 

HAUCIHXP\^IOFAXTYIWPCrNDU^EASSIYYRSACTWEVn*Vll>PSC^ 

HSOSXECFlJOfCPGXKVAYPSNAASAKAlXIUUURCPNPV^^ 

SHI7rVLX^CKAAIVHPCKI^TXV5VaMPLVUl^AOEUk^ 

TVUCSL E XT CR LLVI HEAS Lf CC FCSE1VATH SEQCYAYLDAP IRRLQCLHA PVPYSXVL 
ENEVLPHKESZXjQAAKSIAEP 

CPn.0034 49496 48210 

. CT345 hypothetical protein 
VNFLZJTTCRCXLMAEXSTPSX^ISSXVSOKrPPVPOPOSSPWXPTZPTOAPPIC^^ 
CTPSS IVNAIAFAILAFLSCLCCVFA ICLCCSLEITMPLFILTAVTIArrU.YFIHYLEX 
PKIPEPLPTPPPSPTLRAPTLTPEIPAPAPCIPLPPTLPKVDRTICLTCNPOIHYPSTrDP 
KACFSUJ<OLFSLJ3PETRPEDRXYSNXLASIUJlSXDC3CFRrKCriCC«rSHI3K 
CAWXSSHSSMDF^TTI^RAPAVTTCIjORSCWEKZXNNX PTPEKHLPZCSCVSQPWDVEE 
GAOLYTSHLrvrNPPTLrTLIKEKWRAITlJ^DFSKKEAFTNLVlAYLOCrorCIEKKLE 
SV0 LEVFC LNNLS ADQ LLJ- '1'I V ESCCMLALLESVR I LLAS KEEYAI-SWSVNS I SQVPL0 
TACRALTLN 

CPn_0035 5H46 49569 

CT33-> nypotneticjl protein 

ARTTLEEWCSSUCPLPICTFPCATALYITHRRERKSEHOHWKRCOVFSSFFFRYPISSHL 
IRLRASCECFOORJ1PIPLCCLYWIACXTSRCHPECSAL1LIFIO<FLPRNPKOHU1^ 
W t ISLMLTPAPFLHrCPISCTPVZHHA«WCTYYCEALC IOTPCCKRAHHLSCOIUSESR 
LEIjKKVYELECTLKHT^O IVF1CSNACY1C E Z PRSRrY I MKEKC RCSSCHFI/IHRPPSSEVC 
PFA5SlXlX7TPLP0Nl*0LFR0KCI^t^AXSCWHFSt£ATT 

EVLT3LAC X FPMSLJVWR^ISVTLirFSVCFCCSC^LfiRLCACF ILCSIPrSPPSPTP 
VL^'FLATLC XIXPPPK I PSFLYTP*fTOFLSPI^YPIRYLAKrLAISL3AOLriVLPllS3 
YFCSLPLECLLYHL tVPFT ILP I IVPL IAT 1 1 LPCCCPXTEALIOCrL.«WPWLKIIPIIILK 
TLSFAPVPP%*«.TLA;;LILFF 10 tLRTWVSPYAS XSATT/RFIETL 

i.*Pli_im I*. SO'»K'< SI79s 

iTTTJH rr/jwirtwtr i.-.ii pror.««in 

AK:nJtfn."EU<KKMKKITlND;rf hT//ftSFFPFDVLC t EOLPKEMSWEWnAK I PRLPRCWm. 

w a^ncr^n t nFTLor**:: wis. i uiket.pz xcrffsllet t evy XYRLEKEP'rouwrrvF 

KDf3ft^iKViKri'( J JN'ri'JIMRLPf'LnDRHYEK FFH [ Hf XJFOKWEDO » f Pltff'ja-WJWK 
I J-■A>^.V/HNK^♦/A^^*k :v.:u; I FfFVrrEEPFAYOCFFFOPF.I RPUI^IJItWIXNKET.LE 
llh: :i JCT t EU J IU';»t: *YV (V'.FU.'Wl.Hr/IJI.'JECVYNE 

r:iii.uin7 *i I vnii '.an r . 

prfrtti it:; i<ti/^t4t«'.ti i i.>r I'ffttfin it|»r 

km t; i**ij(Ai wnT i v i vi'f: iMMKr-nrr/LESEKDrrooo i kei/jata: t/kn 

AA* i t l(VJ* PA< IV rVHI.FI t '.F.n 'Ui\ ^T/AT.KT I NAJC IHT* I t*UiM*J «Kt U7T (R.'TRA 
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HR I LOK tODAFSSCFHEL 

•.TtijmiH wwv sunt 

MCltt EECLTAVRO* P^WDRVQD I KD laffiVIGHLCCCHKSSLCCSDONL II rSttLT^S 

. .: ■ ; - 'i.i.rpKi /,>?■•!-:: .•r:->T^i>^"::r-r;-:i-::KTfKi.=. 

RKIKHVUJAAJCXJNCVPVSXCCaJU^ 

rccixxTEUi^wcrcsEvmiftiwMtiTS 

CPn_O039 S42S6 53963 

CT339 hypothec ical protein famnn . 

ISItMGYAKItlCKEAKXMWFLDtEASXJJXRl^ 
CLOPEDPEVIESLFRAAnCLJUEZOKDQOISLKRStNPF 

CPn.0040 55673 54318 

crxacrriJuiitJUCAucvKLsncEPOioc^ 

[?ICTVWTPVKW<mYIIDEVH^^ 

L3RO)!CHHl«RIPEXriLDCISXJiM 

LTPXSI^OTVXQALSrASQnSlJTI^ILOWJYATA^ 

aRp^EUssiKswFsaJwixETOTaQvsw 

SVEVKSSAS IKSAAVCTLLQT AWEPSCZUtO 

CPTX.0041 55888 57342 

No robuit honolog present In Genebenk/IHBL as o^ij^'" _ 

cxylvhhsyp ppqhsvcs issTmgjtvtArrrLVLSvLLLi SGAi^TLcrpg.TAcysr 
aflTm.fiAT/^wscxLCiivicRgvsicvcmrPAvq 

0KEOKT0K:i^LP0eiJ»LnRYI0CArACU»LiaJUCra0GFl^^ 

WIAZrVEWILCOBCattJZrraQTRYICRDtJKRZDSLyTO 

tXKSAJlEVVDRFWtnCNIMCIAIffPDRKVYSVACT 

FCEY^WUCXJjGDEEXSAHAEORFQDIIOT 

RYOHRIT!UUlWY7tVAEH0IJHATl«Via>St^ 

r p nt .im .Mfv^CT .P PAO ggLflgt.ftAf.yP £1 AVSWfKAttRtVAS DLEKAHES I DKHYQSCVw 
EQELY 

CPn_O042 57346 58182 

No robust honolog present in Genebsjik/SMBL u « 11/" / 98 
EEgEKOEAETRZNSTK IRSKEEVSEYLOOVPTOLCSCSXH LTAMtJl fA LSV Pf ffMrffTT g 
SIILSDWAmrEVlCRX>IEX*iLSRVEEIERMXJttf^ 
TRAfEFYFKESPAYLTSEBUSStOTL0BJU«E50W 

QGvsLiKmuvrsTyK»u'SYHsrRijn/pc^ 

DAFOAP0DCJC£CLVEEAOAUt£rCYtAYBniUCSK)a(H 
CPn.0043 564J2 60372 

No robust honolog present in Genebenk/EMBL as of 11/7/98 

HHRTIKQVPLSWtJPPPPaiSVCa^rcLS)a r RVl^ 
VSLGVGUXSAU3VX.VISCPULX^ 

AXoiLcoi^EUQLOTDioKsasaiaajaauocDRC^^ 

rVElXXJVMDQESRYlJ^HEVQSIAHiaJVODVMIR 

AKSVVARFMKVnUJtRXtAIMFNXKAYGAAXNAr^ 

KRJUgLPtffiNNSAftAXORfRrVXDKWgDIJCgl^ 

UUnUOJKVKSKOLWTATKRVKEASV^ 

OUU^ECKRAOEPXEXLTALYPEVSVSVArETSREajCFNIf^ 

WTEOia<R£A£fIUVKCTKVRSNECVA£^ 

CYTXL5DAANRLRVLCEDI CDTLPRVBXIEtOilJlMAStPIitPIKOAJrTXAJVOYNRClCER 
LAKVEPYYXESPAYVNSEEW>QSLIX}ASQCIQRVPKGFXFRNGSMYI 

CPn_0044 60278 60778 

No robust honolog present in Genebank/EMBL as of 11/7/98 

r AKSDCRVW I RU*S AYK ESOXVSSLETCACTYR ETUl£OVQOrrrOGVS L I KFTT.TT LS S 

TLKSXLSTOPLXANIPQttrirY^rrDDXOK^^ 

KEAKPUO&EYRlXQEERSNXEKRLIY^aCMAVARQRVQEF^SMEIPE 

CPn.0045 60961 62790 

CT345 hypothetical protein 

CXYTYH PPOLPPDHSVUATSWQ PKLR I LT ITFLVU7/UXI SGALF LTLCVPGLAAGLS r 
CLGXGLSA4£GVLVVSSr^rLIfUUarcKVR 

tQEWSCUUCLKDLKYEOX^TEVOEXLRVrCFVRKDM^ 

rNgVQSISHIU^VPtWIGAWJ^ELCCYLPOT 
A*AFOENACWAKHAFX*AFGAl^ECVYIt^TE^ 

KSAJU^RFREVMJRWEIJtJtrrVFWVIUirc^ X PEKRRNXV 

MSHKIJWEATMRMKGAECTYSVARVAFEJCIXSRWW 

RER1JSELEAL YPEVSVSWCITRrnc FWXTAYC^ EER YOS WROOEDYWKE EENKEAE 
FRETGTKVRSPEEWEYXjO ilenuedcskqltiaev /VLCVELEATAEFEYTILSDAAN 
RLKVl£EDIEDILPRVEEIEIMU*IAELPrLPIKOAfTKAPLOY>CCKWlAJCVEPYCOE 
SVDYKSCFRV 

rpnJXMb «775 63263 

Nt> robust hmaoLoq present in CenebdnK/EHBL as oc 11/7/98 
CTF^WWDLONVYOECOKATCtXSEVSAYRDKLR^ 

KSKLS7DTL I AO I PCMXFYEEVYDC X DKARV0SRWLEX3ERY RKAKKOFQEHLXEGLFKE 
HQALKKAEY R LLftEKRHNXJ\K LL ICNK I EAAOCR V^E PrTOC 

i'lTiJMM ' Mi7 o!6« 

Nri roNitJf ttuHM»lt<u t*t<*i}*mr tn Ceni'buiiK/BMBL <ir oc 11/7/1M 
KHFHLKVT r V IKKKKVl-S I H.TMYllF01t I RKrLTTCAP/LriKr.U*KC^ELWFVYCCCPET 

KVKi//r::aiKwi, 

•'Hi.'HMH i.»M«7 tttflOt 

•V»|IK Mii >:<di:iitrv*!ti riypnthttt Let 1 IM r<rM>nn 

KKEt .PMKIJYNItAC JlKI jnrjWVHYKI.YTFW.'lK I VA I FTFAWI .KVLWPEYKAUEISRtS 
I .TAMUlF.'tU MIAJIKKYKkTAIt I :;EAF* iK\nfIIL1^.-K;::U.':KH*l*AOEVrDYWfKKAAO 

ki a-rmrvw srryw 'i jcou* i yppluskekktlc hi uir.m :nv r. aoiTCHlk truoeN 



y^vSSSa^igaoslyclvaaj^ 

f™S™«St^^ :ocicujxorscspn 

J11LU04/ l'J8*'' 

No robust nomolog present in Genebenk/Dffll. «s of H' 7 '" 
{SSr^rswTLxrvsLvPiJCLViuTOXMi^^ 

AT 

CPn_0050 66849 66499 

No robust honolog present in Ceneb*nJt/OOL At at 11/7/98 

OCCCILLPiriJJ^LLISVLrOLIMl^rRI^rAIJ^SSI^^ 

cpn.0051 «w 67 1 11 . 

No robust honolog present in Cenefiank/OffiL ss ot 11/7/98 
WYLIAJWIPWramiHK^PSSHAOOVSRSTVYPSM 
KSEOLKDOHRXPUrraCKHPTXS ILNVNRTSWLS XFYNCEftGF 

CPn.0052 68008 67304 

heoC-Porphobllinogen Otaliutt 

imLSvr yspp t ^w iry FMUttASRHSiaJUttovmsc xsiJJtswYPiawroxjTTrrT 

05RrjaciPU{LVENSYFrTIXrrt^VHICCra 

IXVYADKrVKEW^PM^SLWlSAVlJtBXJ-POOOIL^ 

XVlJUUUSXJU{£i01AY8XEVPPYHAI4}SStAZ^ 

CPn.0053 69350 67986 

^SSbtoStcwcgatapi^^ 

SlEXXNESRlFIOKAWDimflOC^An^LTXI^XIDPC ICItiTlZMTADttASO^YltVL 
YVOGEISVT0TSXJUUOuJ4XSSPl^Yl^E^^ 

LKgAPSWAflfcfRJVTYllitjXAJCSWX^rX XCHVTKSCE XA CPPVXJXLVPTV LYTCy 
o^jySjtlJlSVKNRJ^CPTKEI-L ILSMHADCLXEVSKPSCLTLOLAlUPl 1USMX X PIUS 

gcMXin^" MgpFAWPVRicrAcrpptmrsij^vLEWtAtMajTW 
ki TTpAA mr r Arj ^^ c ^ VMttrjJ '^^ egvCL ^ 

AXLPEOIXSSLPltXXRENJIu^QCVICTXiaAXlU^ 
CPn.0054 70089 69313 

mc-Ribonuclessc III 

TLsrrppixxpi^Kntixua^MHPPXDitAXCAja^frr^ 

OYtilGJOGEKIOSERCM^YAKLrES XLSAVYLX3CCLS PARXLTVPlXPPflC^XX^PUtt 

(»PI0^LI«miC0FKVlJWSTAVTIiAOGWSY^ 

AAOOAIXTYOTOJOWTKCV 

CPrx.0055 70096 70S9O 

CT296 hypothetic*! protein 

CnaCYLXRimRSAXWKIJUaWHClSXlJENXXTXKI^ 

TvnfRENxre»PExrrvviuauuxrrAA£iv^ 

f ||j | ytfy!cr.Tmc PYP.^YEtJ3tCTTCLLLAFSSVCI PVI 

CPn_0056 W17 7J746 

srsA- phosphanenneamtsse 

DLFCATLTFCTCXaJlSLMCItmnaNLFTIWtrPOOL^^ 
PJ<NSXErAOETA)WUt0CCTVnJ^Ym 

YKVYMASOCOVLPPLOOEIVAACSAVNEIl^PSIDHPNIHLICICrYEALYBJTrL^^ 
YPEANRISCRSI^ISYSPU«rrCISLVPHVlJU>CFLSVHLVEXOAICI^^ 
ECPEALTLSIEXWtfgCTtfIATOPOAPR\^A«XEIXW 
V^KTWLC2IU^V1CSI.VTTIKI^XAKHYKVDLINV^ 

G AEESYCCLYGTHVEDKnA IIASALI AEAALOOKLOClff LCDALI^ LYTTYCYFANlCrES 
V^SArrOEOEIRWOSMlXEISSANFrSGKYCTVEyrE^KOCrCrKUJIC^ 
TSKI^GnfFSCnCRVIIRPSCTEPKIKFYrEKSTKYPERVTOKEIOTOREAESFOHLOOFI 
FOFKEKFSNL 

CPn_0057 72913 7)554 

sodN*Superoxide Oismutese iwil 

ILKRYVVMSFVPYSLPELPYOYOALEPVtSSEIMIlJ4H0IO(H0IYt>WtJ*AAXJt^^ 
TrXNU4ELXAUPALRFNC(X^XN^5UVETt>PXOOCC^ 
nJOCLIEVAACVOCSCWAWl^FCPAXOELVLOATANOBPLEPUratJL 
LOYKNVRHDN'LKAFFOIINWCHIENRFSEIISSIt 

CPiuOOM 73627 74562 

«ccO*AcCoA Car boxy last/Trans t Bras* Beta 

XRWLVRI^SYOKPKtKVOKXKAXXSFSCWlJCCNHCHEMIHANEl^NYICCPKCSYMYR 

AIERVXUJ^KDSVTOPLYTOUCSODPLEFIDTDTYANRLEXARAWrESEC^ 

LHPVAIJkVMOFWFMACSMCAWCEXLTPilEEAI ETRLPVX XV5ASGCARMOESVPSLH0 

MVnCISAALAKUIEACUJnrXS^TMPTSOI^ASFAAX^ 

XCEX^PD0A0l»ErLLE>O(IDKr\^RJCELICTTU7rL£i3YFLA0C 

LKEIFLLTDOSE 

CPn_O0',9 74S62 7^050 

dtjt-ifl/TP Nic. iwor idoftytlrolds't 

XKKHTA^TNDNI ICNA I UCTVnrELa^ELPr/TTPCAAT.AOLRAN T EEP IALLKXJRA 
LtPTW KAEI PEOYEWVRPRSOLALKMO (TVUI.'»PCT(0.'!DYR(!E IRV ILINniOJTF I 
I EPKKP I A0VVU;PVV^TF^0KLAETARC::r7:FCI m"jc 

rPn_QO'.0 7S004 V p * r »3H 

pC9N*PT3 IIA Pior»»ln 

P.KLPET/EVLV 1 LEOAKHPSYCrjrCOCFSLFJLL^PRLVMFUCICI WROEtLOW.TTM VOA 
AOLLE6KOAFFOALVRRENIH?m ff»r r/A I PI ICKLESC'JIF F I A [C; t IrTO ! I LM V\ I D*. 
ALVRL'/FL ICCPEMA0AEYIJCLUTT[*TU"LJ1KESRHQfJhLOVNTXEEVMNVFVI » 
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i.-t* mi*.! 7ssoi 7«o<i 

txSH'FV', UA Protein ► HTU ONA-Binoiftg Damn 

Sate- r?TXVKMOUCU>T/WLLIRCe^^ 

VC r LTTLT ACQDICHLNLWK rVHI^UURSFrXNyPKTOU^YVKEVCSaTW 



7«M 



77690 



tVWKP^TRKQAXlCEPRAWC^^ 

SS ARKRYCTPSSAAPSUTXTCIVWIWTItM 
USGASIOOSECTEOSUinJSOGASLPVLVRSOT^ 
VRQAiZARSLTKlCVARaGSVTSTtJIYDPe 
NCKOOKTrPSEDASOCEOOTCAGL^ 

SSETnJWPCSSCVSICIUCIHNSISVCTI(VVtVIAMIVCAI.I XANATESCTTSDPTPPTPTP 
CPn_0063 7U09 78267 

no robust ftoaolog present In Genebank/DOI, ai of 11/7/98 
CPnJ>064 78340 78576 

No recuse homoiog present in Genebanlt/ML "^IJ/ 1 ^" 

LVmXQCSAG^YltSAPAElWPPOPrLAW^ 

LFtrVHUPLAASSYLUF 

CPn.0065 78882 80651 

CT288 hypothetical protein 

YTJYYKYWrFKXNYKTDTPTHrKC PKLNP XKVN7NTFERNPKVARV10 ITAWUJIIWi 
SGIVUICTPUUPISMltCaUASOCWVOtrriATILOAWISY^^ 
ERPgjQ^j^SIJlJCEVMDXJWSVVKHLKK^ ^ 
MSn«DACLIKWYREELLICEQTOTOETR^^ 

surai^siMSRXHTrrrviiJtt^vv^^ 

LSVLVROILSNTTOVmODrnCDrVXNVDIEU/CrVTM 

seTTirroa^AitLJtOTswiSTiTSPOWTODPErsiJWKp^ 

fTj^ftlt TvrTvrrnTnrTmRTftr.f nooT.g^TucwRHPRciyYo^vrybLn'njRiQ 

KUX^iTNHUUAOCCrrOSLSDLVDIONRILSI XVEGDSDSRTEttFOt 
CPn.0066 80916 826SS 

No rooust hooolog present in Oe n eha n k/EMBL as of 11/7/98 

<WYMANPTOSRPPSPEISim£LQEIJ«SSlITmSKrPP 

tCEDttCPUSSCEVVT^rvC ITl*^PE^^^MEVRVKf INCSCHI^ EC! IgAHNICDLRG 
EFVRFXWISGVCU&CfXGIRNRIPFlUXOTS^ 

GGLYL0VALDMSIY5KH ILCVG IGSSYYIOGNTlWKNYKVTI3JWTTU-I3U^^ 

LPYADSAEmJXPSVRCPSYOWWJBXEQCLIMIKNW^ AlrUDSSSIXALVVKLWDH 

STWTRLIBaORCBSQAVLEWPOPSHCRDIW 

rVTYAIVTCYSIKnJYriUXTNRPqCRRKrKVLBLAAT^ 

VMfflPPUSWFCTASrXTGSPlYVttT^MFTTSlASMjC^PWDW 

KLDSLBPSQNU.rmiaCLFMPLX ICm^^QVPRVVIRPNTrAVYDLNQTWCAWDS 

<^OAI<OTINniCHILLVTNTTTTVRS^ 

CPa.0067 82920 840S3 

No robust haaolog present in Qenebsnk/EMBL as of 11 /7/98 
KssarsYittPfwvmRVNssoAijeDcw 

UALTlJ«XLTSVPYIALOVFrAIVItXKXXFAI£SEW 

KNXtMEKEKEDP&aCtrrATOX PKRSAX^^ 

CPVTLFPtrrSEEEVLIRSVVCSYIAMXAOfflCVr^ 

QRWtfFLFTOXDLATFTLAYTRVNXXailAPFWK^^ 

CYYARIAFNQTOTLYHQLFNVEJQ^IYAI8<D103^^ 

OOEDREYPSRAAODOPMC 

CPn.0068 84909 84331 

CT360 hypothettcel protein 

SFWIKKrFIYSLIf^SFSAPlJCICNEDVSSOSRimJPmJTQU^irTPIEBaCE 

IRNEL0AIS0G0KSSEEXCESCCTSO5ErU«5QCTWeSSNErVLDFn}^^ 

COSCQfVAQIlICFNREFDIRKRII^LJ(NRi2JIJU3COL^^ 

EODIKQTLMUJCK 

CPn.0069 85191 87086 

No robust homoiog present in Genebenk/DfBl* ss of 11/7/ 98 
LNFLYVYLLI RiU3IMTTPpPSRSSSPPPYIVia^DU»TNWSSRATPPPPCV3G£XP 
PYFSASNTVVTERGAPSLPSPOQIXSLPCYSROPPPGYrDETASITSRTSEEMFCTLVST 
LCCFANSERDWEDHEVNCX Y XASTSDTOl£AVCXXMtXTELftG£ 
£NTCWSRL£VSHTWA>rro>raRFFSRHWWCW 
YVTXIIJIPWIRGIJYHVWRVRGFWPSCU*^^ 

TFENAIRYGEROXVCSEQOaPETOWTSPLTSUX^^ IASR 

LMHCCROCRIXSNYMPCRSSHPFKrSMYVLVRU/TIAOZYIMSPYYSFQSNI} IVCLIF X S 

SAAVETVSYIFLTVTDSTCGRRYUrVPRLVCTC^^ 

NVRFIUrmCTTRWFFAWNLIUWmCUWCIC^^ 

YAIVFP3IV3:KLLTAlAHAim«lJUJ>PYW.IES^^ 

ULVINTCIYMLILPANLXFMVYSVRRYHItSRR 

CPn.0070 R7399 87208 

tkr mount nanoioq present In Genebanfc/EKBL as or 11/7/98 
YKVCl^tLKNONPFSNCSRTYfiORPPKVS PHF£S X LRL0SVCFSS0GT1XX SFROTCUCR 

ISLYt 

r :Pn_«07 1 «80»i6 87599 

*T I2*> liypnr-.lwt teal protein 

(K:;LRXJ IlJCFrrrLOHARCUUCOMKI IKLiTCTFOrDHLYLKL><ENSJSRDAFDKKRML 
K»n,Wta.v:DLYLYEVY0O3 [LFFFTYTKALlCCGIASUPTBVYSCCrPST ILTCKP I F 

rtjHLTV(L;:nmjiii >EwLymrhko i avqylk ppqt 

•lllJHl/.! H'HM HS057 

*.T:^4 ltynnf lhfic.il protein 

I* IY W.TKT: rWKEKVU Lt YClXPYFFIIYRMSTPUISes: [SPyOOYVPOEI^CDRLSSSR 

::rn;wjc:HA:y;u:riv?:ppi;;ALVALTD«LVPW^ 

M I4.Y 1 1 JWAL t LVCtllfTVALTLT INUWU! ICVVFntFTATCLDKBWHRHVNSUWL. 
Ufllt; I iJXAiMTTHQ I LLATHtAS I ; !AL I Y AVPQAV3LV tGFIt IHNQLJ INTVYCARU3D 



EATYArOWtAHKItPIEHIC irmWINQf CLNALI EINRWCTSPA TANUA S 

^n^pyo^bc™ -3YU)LNr«;;rco: I araooc tKnjenao ikxepdr I 

IB3NN 

CP!U00'73 8715) 97S74 

A VT5T T*JL*WYAH P 



VEXiimJCrrVAL£tX9I^AXRSSCirfXCACrXSKINA 
CPn.0075 91081 91350 

sece-preprocein tranilocase 

FCFAXY7VCLVLRXS XTCLDG XTTFXJC 
CPO.0076 91334 91903 

nusG-Transcriptional Ancicaminecion 

oprcsvKKYWYVWr^AQEWv^^ ILPI ENVMEVKICtH 

iwVIlCYTWPCfLLV»«LTDESWLYVKSTA^ 

gnft^QrEVGSKVlCIHDCVFVNFIOWSEVn^^ 

CXVAfGOSSE 

CPtu0077 91956 92435 

rill-Ul Ribosoaal Protein 

FrVSYPLTVEVSC^CVRFSMSVIClCVX KI X KL0 1 POCaCANPAPPICTALCAAflVHXICFat 

EyWU^TQiy PCDUJVmVYAliKl r i t iiwj pfvsslikictlnixsgctx pnrnxvgkl 
T^yf/EAIAXOWWX^OrVUiSAJOWVlCT 

CPH.0078 92453 93160 

rll-Ll Rlbosonal Protein 

SCSXIfn0«^XR<2lJ0VYX^SKSYSXJt£AXOXlJC0CPP 

OOTCAVFLPNGTCKTLRI LVFASCSNKVXEAVEACAfiFMGSX^VCXXXSWUriTVAVA 
TPOtggVaELCKVLCPRW LWPl PMUi V 1 1 W MCAISEUUtCKXCI^Al«ACVCWWVC 
O£rCS50XmXEAl£SAl,XKAXPFAMCGQYLVSmS 

CPn-0079 93170 93688 

rllO-UO Riboeoaal Protein 
ttOMKOExnilASVmXSAAOCFXLLRYIJtrrAAYS 

nu^t^axvccsDTXxsiiiTATSccxDPv^^ 

GASVlAVN0U»StJCEXJt00^UAAPKS0VVCXI^^ 
CPQ.O080 93720 94121 

rl7-L7/U2 Ribosoaal Protein 

ePTCFAVTlXDVPAZMCXICVIJCWREVTCLALX£AX£KrCCL PJCTVKnCTSKSQAXiyrVK 
KLOOAfiAKASFXCL 

CPO.0081 94219 98016 

rpoB-RKA Polymerase Beta 

FREXL^HX^SRRTRKLKCPEHVSVIOUCgDiroLPNLXCIQIKSYICOFLOI QrT * ITWW X 

(aJCEVFIUXFPXKSYI^TVtrirLSYNLCVTIOrSPCECXRra 

XXXXEVYHGTXPLKrDKCnrXXN2A£RVVVSCMiR3PGXNr80E^ 

RC5UL£AIFOXl«X^XHXDAX]CRIWKXXJaTFXlUajE^SOA^ 

OTALLVCRXXJu^XD£ASSLVYaXACCXLSTAHlJO»flJ^ 

RCJUCDPTDSYEAALKOnfRRIJlPCEPATlANARCTI 

LCFSXOOEAUOVmJUCEZVXC^XJCmtUJOmEXAC^ 

CRSaUkRKttrVRTRKNlJDrSStTIXTPCICWSAKDtASVU^ 

A£tTWCIUu^AU;P<3CU4RCPAC7CVRin^ 

NEFGFX CTPYRXVRDO XVTDCX CYHTACVEEECVIAO AS ASUJrifKKFTWVCWVRYACl 
AFEADTSTVTWIDVS PK0LV5XVTCLX P FLDOAANRAliC^NWROAVPtiJCTMPVVQ 
TCI2QlAAia3SCAIWAEEDCVVDFVDCYXVVVAAKHNPTl KRTYHLXXFLRSNSCTCXN 
QQPU^VCDVITrcDVIAXCPATTttGElJ^^ 

AYTSXYXCEFQ.TARI7rXX£X£CIT7mXPNVSX}EVLANLCS)CX IRXGACVKPOOXXACK 
TTPKSTTELAPCERIUtAX FCEKAAEJVKOASLTVPPCTtCVVMDWVFSRKIJRl^ItSroC 

OCTIERIEOmVDU^PNCEMYTVUCCI^DYETAUJRlXINYrri^ 

CVIROVKWASWUCt^DIOHAGRHOnCCVVSKIVPeAISfl^^ 

SRMlLCCVIXnflA'AAin'ACIYVlCrVWGCFPCORXWDmXEO^ 

E»FO«WXWXYMLKLSHLIAOKXHARSICPYSL\m»PUXaCAOM^ 

EAYGVAKKLQE I LTV7CSDCV5C RTR I Y ES IVKCENLLR SCTP ESTNVL I KSKQCtCLIVR 

PMWOA 

CPn.0082 97992 102221 

rpoC-RNA polywrase Bete' 

rriWYRTFKPEXOCXFCtKrFCPTKDWnXCCr/KKIIWICCira 

HAHXfXAVPIVH IWFFKTTPSR XQNVLQMTASOLERV XY YCEYW t DPGKTDLTKXOLLM 

DAQTREVVPCVCTDAFVAiaCCEAIYPLXKnEDLQSLLX 

UCX IECFVSSSNHrtMMVLKNt PV'/PPDUIPLVPtJOCItFATSDLMX^YRRVP iRWIM JC 
AI LRLKTPCVIVRNlXRHU>EAVDAlJT3NCnH^HPV^K^ 

NLLCKRVDYSCRSX'I I VCPOJCFNOCCLPKEHALELFIPF I XKRUCtOXSVYTtRSAWM 
IORRAPEVWDVLCC t IWTUPVIXNPAPTt Jlftl/i 10AFKPVL1BWA IR IHPLVCAAPNAD 
FO WtAWIVTU^XAOLEAKVLMMATON t Fl >.^7^CrVA [ P.1KHTTI i TLVY LMADPTYF 

pEEHunmc i fkoe t EVLRALNNrjfiF i wwF'iWRormnfj i ii (KRKtKVRtnootterr 

PORVI ftm I VPK ELJFONY.'5MI*.1FP I II //.VKKVT.I£ATVHFI JVXJCDUV tOATKA 
A [ JMT.I J(DWR I r-0 iKmi I LXOAYbTM I VRKOYODU ITBIERIIlStTI ^ I WTTV3EQLCD 
ALYVEt :;KOTRSKHNTLFLH I Dry *K:V^ ryU'JVI JinLHAKrM*AI IRTPITHNFRE 
';LTVLCY.*; I ^HUARKCLADTAUCTAn:* :YI.TI*P.LV0VAODV t (TRKIlinTMtl C ISA I 
Q(Jl ^EELLrUCDR I WJRTVAKDVYV JDK:>j J Ay: "«UDV t >i.S W)AF^ [DCVV. I KTtKtRS 

TLttrK;t*mtw:AiccYt 1 !utLA»a<i.K»«ii.^v'; : iAAa:;i';Knnvi.TMHmii/v;tAATS 
.WEI im;cGtLVY*Duwvu W3(Ni.viitrr'iAiiiw«irjrrtwKVKitLi.'rricsi E 
:;LKVPr*vEwvKii.vAorrppv:vw' i';i-vT:ijaitri icratmFmYKntvwmmjtw 
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VP/LLC ITKA.1LGTESF [3AA5FX)inT1WLTOAACCSKTt)VLlfinCDIVr>CKMI PCC7C 
rrrHKRIKOYLXKECEDLVFDPVSETCCVC 



CPnJJO-M 
v.US-VJlVl 



■:pnjioei 



102296 103312 



.\"T?r::;.::,r/.v.?:rKF 



jELU^WVCtRGNOX^^ 

KAKATL ISPFWCR1YCWWI AAYGDBCYSI DM Z«*2ir^S 

TKEOVUUOTLLTISnCLLDELKltSOHIVXIcnjJPA^^ 

eOMUTEXLAESXRXPACOrrOILrrAXTCFXfCOIAAECA 

CPn.0084 103356 103751 

TSILKQESDLIS 

CPn_0085 104512 103766 

LEALQNENK 

CPn_0086 104898 105527 

atpE-ATP Synthase Subunic E 
MiiuMiJttixaajcQiCDMJttiyiW 

EERAHQXX KOCEVALSOAOKRALEAUtOAVEWCI FRESLVEWJEHVTTDPEVST10*IQAL 

VOALEAQCVSGNLTAYIQOTVSPRAVNELI£^ 

WVLDLSSS AUXI rTRYLQKDPREMIFOGS 

CPn_0087 105540 106376 

CT309 hypothetical proceln . . v 

SHEKirSinCVVVKTOYYrt^rLPTOl^ESVPlJ'SISDUJD^ 
RpnJPENFAFFWACKPXPFSPCEVTQENVDtKLSSQOWbUiXllIf El** KM LMNHKSSQD 
RU*HFSDLTlUmSYKOTHSSKTWDYTRrra 
DPVVLEVU©I0MPWELPEEFSDU)GWJ1J^^ 
OGNVXIJUtCA'nMPAXRNSlASVEKSREIXNHXEXAXKW 

CPn_0088 106352 108145 

CT288 hypothetical proceln ^ 

SYWOOWVTvSECTACCMVIEAYGmJtVI^^ 
QEVKVCVFEBrCGAOlGALNnTSGHU^ 

iomiaisihnlwiytpvasvsdtu^^ 

ctynahtvvwuuuwogkecaptwqrwpiko^ 

kwi'fc'i pgproacinvi^khlskyaavdivilcacgeraceva/wi^ef^^idphtgk 

sxjmrtciicotssmpvaar£ssiylgwtia£yyrcjmm 

RIJDPGgEArPAYl^SRIAArYtRaSAI'ITIUJUSbU^ 

LAVVCAFCCHJKARJUARRYPSIDPLISVSKY^^ 

CSEJOaOWEVWEEWSMEMXYtJCAXLYBFC^^ 

RirtJAicpvFnsPonAj«rn£WSKXicrucLicr^ 

CPn.0089 108111 109466 

CT289 hypothtcical prottin 
LDCWKKOWYKWRJQ3MCTrrmTDIKGNLI 

DLWCVm^CVnXJTSCLSTCDHVTiaXSIWfEVTTO X 
EIATPTFNPVCRIVPRaf/RTNIPWDVn^ 

ADiwxccMcaTr^srrvEEsiao^FACKCVMriHiuw^ 

AVEEKKNVLVLLTDMTAFADAUCEI S ITMDQ I PANRCYPCSLYSDLALRYEKAVCI ADGC 
SrrUTVTTMPSOOrTHWPDOTWTTOrrUlNNRIDPTOLS^^ 
3DU*^IRIAV*DSRXATERKA*;Fia£NVro^ 
GWKXLAOSPTSEEVGXKAOLINKYWPXACLSK 

CPn.0090 1094)9 110080 

atpD-ATP Synthase Subunic D 
VUUCSMSVOVKLTOCPRLEXOKLAMjO^^ 

QAYERXYArAEIJSrPLCTKV^ICSrEtOSIDWDPXNIACVEWIVREVTLF PASY SU^ 
TPIWIXTMLSASKELVV1QCVMAEVSKERIJCXE»E£ELR^ 
XAVFLSOftSXTOVCCjVXMAKKK XEUIKARGDECV 

CPn_0091 110074 112053 

dcpI-ATP Synthase Subunic t 

'/RLWIHKYLFICRNKADFTSASRELCVVEPISiaCCriTTEOCKRn^ECLKVrOKLEA£YS 

^EALEFVKOESVSVEDtVSEVLTUIKEIKGLLETVKAIJUOSrVRVK 

RKTGISLRFFYItTHKIJNEt&EEOSPNVFYIJrrAWFDYYL^ 

VNELCVDLANLQRE I RKR3 DRICDLY AYRREVLRC LCNYWIEQRLHOAX ECCEDLFDCKV 
FAVAGWVIVDRI KELOSLCNRYOIYMERVPVOPDETI PTYLENKCVOfCEOLVOI YOTP 
AYSDKDPSTWOTAFVlJTSmVNDAC^rXFLMSSUJS^ 

A t LCLCC ICVCTTTTSFFt>15FSin'SVraEYSMTH , /l^LJO<AFinrLO>*l PKA YKELTMEY 

PStKAtRDPKATUJVTEIC^rESRYVvYDKFIDNIUlE^^ 

RY3r,ICWILFMSTCAYLWPIYl£TVSLIHYLFKVPYElJ3G0lCYY^ 

OH^WWIVEEt I WtOVF3DVl^LRIYAtJCUWAMHGATTNOMCARLPMLLC5 tVlLLCH 

T/tUtLSIHCnvniCUttiNFtEWWYSFDQQGRPLR^ 

'.H.JIO'C* 112121 1 12571 

ir.rJ(.ATt' :syfti-tus» ::ubiinlt K 

l>7t JTJutKV::H 1 0M:**Wt rPALVLDLAMin^ tOCtlMAGV.VWAVMilR I DEO K*KLt(*M5A 

nr-;::«r: iyok i i juxm^wa i knctlspvcg t a t jl.t^gaallv:^ A-KCt:Kcr/:v: i qaya 
k::« t ywcYAAti : IVK:;f^.faWFALLLL 

•.ihJitm tu440 iiiois 

f rr :»i i hvi • ••»«*. ii.-.t t ptotein 

• .TAI.WyVKm J4LDl.Ht»YMn:M40RLCL3riLFHCLLLFlJ»YY>*::KLVn rt-TVLLAA I5V 

uitju x.*::Kr::ut^TEYV>:nEY:yuwvou3 ico:^:»OF/Yt^A)wvTwr.LP.';RMRKCLPVT 
i .yia/vyy* w wacr;rYicVN*yw*rRVYCuv;LEYr eu> ; : t trvHVALCsnHQRtvsRRii 



'".'•j»frfBri •^vi*:.rr. : *- ; :r:?T. !v:.wr:.:\; ■ ztwvtrvv-f 

£££mt«rW)LIKOt^YATY;kW 
ffjSttAQttAWPKATWKIWO^^ 

irTtiosciPiuwas^tou^cpEycLYSFcvviyriRLGirvTO ^ r mrar f 

KaMu^EMVEHXAaiJronFCOICANmLW 
CKU0095 115956 UB790 

ok*u>S/T proeem Kinase *^—..-m 

RFUlEAJtlAADUHPCWPVYTIYSDO)^^ 
DCrSVCAJTJXFKWCCTIEYWSrcitiUtt^PW 
DUi)IDVSKEEVLSS1WIPCRXVCTPtWlAPEIU^ 



FPYWU0raaXVLB»iaPSPQEVAJnfREIWFI4A^^^ 
ESKLKCSPWm^TTAIJWItSSSWiaJ^tLl^CT^ 

IU£XTI ( SiaOCli030TOXlXFTSENAI£^MYOGTCT%fU XKERTLSVSLVKNSXXXOKCS 



0DIiSOKrm^ALB0HNHSI^IJVtCTTVLIM»fY^ X 
fes0SXJWSOAVPtJAnJ«3CLYtJRALVl.YlWIAXSP^ 

Y T^Mr^ FyxLATEgfiRJfflCTAAJIJY^jSJCALVY/ORLQCYKrET If ST T 1*1 jrKYSQKPE 

ingJD3HVVYlttitESFYKltt«LALVFKILVIJIAIWI^ 

ti0PTVLELiuMI0mrXiY«S0TXPtILNSUWUW^ 

SSCIOIF^ESliEWKATEEXVErSPTOLCAriJ'AIOSIFlOCEDAEIU 

YIFt^AIOUttJJSOCEMFOALtWRSKWENFYTOYlJ^ 

DIYTEKOUCDEOKEtJVLYCCYlAIJOGAi^JUW 

|Q3AX^0ERJUUiJWF^YFHCt/aO{DEIu?LC0THYKL^ 

CPH.0096 U4347 U6837 

CT296 hypothetical protein 

PTTLSIUlEfTWCSIJVYVSSXKVRNXJOWSXHrKSEEX^ 

AORXRYX STI^PTPTATTITTUPNPICVEEXHGWPTI AIKOKHFSHYSHAtVCSTTILrSM 
t ir^J TTJfflARBPimCgVIJLYSXBCVLSTIM^ 
OCPnCVJO^KPIYSTLTSOXPEDCSVDmOTLIKSWXAR 
CSVLSPEE Lm P ttlKIWlPPV TYTP L TOOX^SPHAlJaiCSl^^ 

. HLsiicEiccsriuacssYtYtrrrraALAxui^PNifrpwm 

VRLFXXmXlDOATYKVIACVWXCOK^^ 

r UAIXIU-Xl I ICrQa i 7OtW HVrPSICWCSPSLSI0EXLQCU03M«STt^ 

XLVIHEEMSlJu^XDXCPGACXFOCEVLFNCXPECriiOSSSLTAXY^^ 

ESREA^TSVOIITEATXHKLia^IRX^tJuUXCVIGVSGSCK^ 

OZNPIWUaT>CCICRlJHrrRDLPCR£OR^IPLTYIXAra 

AHFSFWPOCACIOCOSUnKTISOOCrrPIPCSXCOCKRYHSEVU^ 

TAY^OCTTISWPiaHDUHAirStJUJmJM^ 

TLYVUJEPTTOJmroiOAI^tVU^LTYLOrr^ 

GCYlIASCrPIOXXOLKmAJUkLAPYII^U)XPAAnCSEPPSSPK»XLXm 
KHIDUOJTOISLIAIACPCykSCIOiSLVrDILYASOflAYAEl^PYITO 
CEVKSI^PVISVRKCSSSKRSYKTIASAI^^Frff JTC^ 

TI IDSLUCSY^CDYVTITSPIPtCSDlXI rLQOgJKECFIlCLYSEnKLYD LDERLPL WLI 
EPArVIOimCVSPWlSSSLI^AISVAFSLSSEWIYISOIOXORJu^YSLCWiaJWa^ 
EITHOU^SDHPtt^LTCOCa^IIJaSLEEHKEKXAHYTPLErrSl^mSY M^ 
UJBINASOPlJEtXTTIOEFIJtrCRGSSEFPOO^^ 

CKCSCXIIOyAMYVRXrarrSXIiJIYOEIMTFUSFtirr XCTCOTRS X XQOUtflRlTrXSKV 

CLSYITt^R0DTl£IX2>IYftIJ{lWUCXXSX& 

LVAITOmaATOItSCSLIPKADKAIFl£PC^POaGFl>©SI7rEW 

CPKAPLS I SKANHTRCSDRTLKVNLS I HH IQNUCVSAFLHALVAlOGVSGSCICTSLLLfC 

FTKOAEIiXAKE7rTTFSDLWIDSHPXASS0ROTtSTYFDIAPSLJAFYASLT0AJtAl>II 

SSTTgSTim tQGO C S OCOCUTfOWDRAITfALEJWPCtTCSCFIlIQPW 

ELLHTPI ETVALRFPFIKXIOKPlJCAtAOIGLCYLP ICOXLSSLSVSDCTAIJCTAYFLY0 

TPrTPTLFt.IKlJ^l^PIKXOHLPEKLRSLlNSCHSVIYIDHDVKLLKfiADYLIElCP 

GSCKOCCKLLrSCSPKDrYASKDSLLKKYICNEELOS 

CPn.0097 124549 126006 

pyk -Pyruvate Kinase 

DSMITRTKI ICTXCPATTWPEMl^KIXiSACKWARLWFSHCSHrrHCOAXCFLKEl^EOK 
RVPLAXKLDTKCPEI RLONl POP X SVSOCQKLRLVSSDI 003ABOCVSLYPKGI FPFVPE 
CADVL I DDCY I HAVWSSEADSLELEFMNSGLLICSHK SL5 1 RCVDVALPFHTOCDIADLK 
FGVE»NMDWAASrymYC£DIETMRKCLADUWPI(MPI lAKIENRLCVENFSKXAKLAOC 
t HIARGDLS XEI^^PNLOKMMAKVSRETCHFCVTATOMLESMI RWLPTRAXVIOIA 
NA I YDCSSAVKLSCETASC AHPVAAVK I HRSV t LETEKNLSH DS F LKLDOSNSAWJV1PY 
LSAICt>CIOtAERA£»KALIVYTES«PMFLSKYRPKrPI 1AVTPSTSVYYRLAUHC 
VYPMLTOESDRAVWRMQAC IYC I EQCIUaiYOR ILVLSRGACMEETNNLTLT rVNOXLTC 
SET PET 

CPn.009«» 127494 ! ;!♦*»•» I 

No rofxisc homoloq ui«senc i r. <>ri».-rvink/eM8L uu of 11/7/9H 

l .VTIKKFHQ I KRT I LEA PLYY LV.Tfl t tAL , ;R1ITI'W.'*FI.TiL/»Knpr;f tAFYI tlJOYRKTAL 

TNLALAFPEKTFOERYK I ARQCLOML I TTLLIiU AI ECL'A W I OKU IT I VTJT.'TOIPICCFS 

:;KBvrnriEOL£ETf , KNi^EKcr;LaF'.'';ii<.wwKi.nKi. / itkwyh hafakaifu qrui k 
k iFAUfEVFKOK ivrrKi«ii^iF^\iirjt ;kk> ;r\rt :r^AUK;::Y*pYrLrt^;pArnTd 

TALI AYKTHPTV I AVNV! 1ROAKCFFT/ 1 ^Kl .7ANX:'-M WK! n/A f LMIJUlt* IFUEKGX A 
Wi'FJtAiMWtHKRWKRKtrTMVIKKKYPY".!! ll.WH ^iV:«;ilK: :KiyAIABCf:V?TTLMLAL 
I /lAUILEELORCKPr/.TLtOt J*MnOOI .* Al .1 tlf "/I A 1 1 hLTHNf 4»nr.YKUFHKT'»*AVY 
: :k l*Pf JW:UJ» I r>OAM XN; :i Ji I FY: :r i ii .k i ik k^n kk vr : :r n mi I ,JV r 

tJri t-*»f.:r IkMDuhvl (U.Hl^il ' - ti -t.Hik/hi4JU. ••! ll/7/'m 

r/f -Arrrrr jc i:;HFAAKtiArw<«Mi u. : !vrrm > Ki > AAn .i .i i*v f f *KArfm -w irm isck 

f AWAK! I PADTVATKAf Jl! IKUiftSSSFtl . I AA: :K li-l -TV : I Kl 1 1 K F>11 .TRK 
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129996 127141 



T0U hyrwrnetical OfV.«m 

RToilPTFtLLCLETM I KFLXLF IRHWPRKWSICFA I Z fiinL\TOSV?ZTRTLT>JVFVR 
KHNLVSVOKEINrRIQIKSVDANDirVRLTOYHTDt 

YLHOXVSGPKEY I NALKEOCLCLTnJLNX I3FE£LERNRIA0C3KDEI IFPI P«MKK 
rPFWnTOU^FOAOFUlUJlJeirciFl/rtilt^^ 

- - • • ■ •■ ■ ■* ■ .•.-.r:.:.Tr:: .7 :*.vr.T-;rT *- :twa : -r'.rTTLFJ. 

I • ;.-r *:::tr :: mi «***•?•-'• ;^:j\:.rrA:.. ;Krur.:A2:v?;ywu 

TKTKETTKLYKKEW 

CPn_oioi 

ybbP family hypothetical protein .^--„t 

PSTLCNrSOVTTOGPSKTMPrOITYTfTTPliEI ILIWVWJmAKTFW TKAMPWr^ 
AFLTLFVUmKLHLP t ZRRDCHWNIAAIWF 1 1 FOPEIMAURIRT HCaCCTTIDPOE 
OrVEOLAASITOl^EBOICALVVLENXDSrrrn^FSSVXI^TrSEELLrr^t^SPL 
HWAVlLICOlIAYARVVLPlJOOTTOLSRSICrPJIIUALCXSORSD^ 
SLSRDCLLTRGVKXDRFKAVUtS ILSPKEWOUCPLFSWIWKR 

CPn.0102 130099 131466 

cydA-Cytochrome Oxidase Subunic I 

FY lOFWKFMnALXLSRIOFGLF ZTFHYXJVFLSHGI^MKLVZMinLYLVTICKOIYTIC*!^ 
WJCZFAiTFVLSVTOIHOIFSI^^ANFS^^ 

^F^KVSKKMHFFSTO<VAJJGAHMSAFVI ICANSWM0TPSCYO<N^mC1^PALTSF 
WCWrSPTTIDRFIKAVLCTWLSCVFLVI SVSAYYI*nOCRHHEFAK0QOCICTlCAVIVL 
\T£UCAI^ARGVAKNQPAXLAArEG I FKTEEYTP n4AFCYVlKEKERVlGI£IPGALS 
FLVKRNI KT PVTGLOO I PRDEWPNVQ AVTQLYHLH I KLVCVHVALTL *SWSAYRGWWAL 
KPFTLVILTFSVWEICNECC^AA£MCIttPW^^ 
FSLVTI ALLTLFITVLCKICI KKGPEEENDLTEFEVK 

CPn.0103 131465 1325U 

cydB -Cytochrome oxidase Subunit II TrriirT - 

MffilFMEI^TSLLPLAWYVlLCVAVFAYSFGDGFT)!*^^ 

PVWDCNEVWLVI WCCtfACFPACYATIiSIFYMPIim-VliYIPWKSIXntSK^^ 

WXIFWDIlFICSGTAISFFLGTrVt3^LGIJ>LSPOTSYAS 

3AFAIHCSCTAU«rSOSUIARlA00miLSSPWYVLmSASUSIP 

l^LLZALTSCCCVAAlcrSVSKKRYGYAFlYSTlJJ^^ 

SYTZYNSAVEITntJCSIAXZVLXGIJTirrrrW^^ 

CPn_0l04 133BS4 132676 

7T017 hypothetical protein 
EXSMRMLQZSHUJALCTAXNSPAIYAADSOSVSFPE^ 

aCTI IRETSKGDLVAVtCESXDYYVI SAPFG ITGYVFRSFVIXNVVIBEQWNVRIjEPSTS 
APVLVRI£RCTOIOTASQEPHGR*I£WLPSQCVr^ 

AMDUNSAU^AH lELEKSLNEZOIXAXYiaCINLVCSEEFlCSVPGXOCLZQICALSIOOA 
YX^XSZXSOWTSIASSQCSTPKVSSSEVTTSUSIWZIUCa^^ 
RrWASMQOtaiOKSEALTOEAPYRAEOiaacOVLAQaEVYPHV^^ 
^YmtNUX^AAXJlVTVECI^RPI^FAPTAYYVVCIRXAS 

CPn.0105 134883 134039 

CT016 hypothetical protein 

YVPFRXF5NQHPKLLI YCKXXEIHLQWPOTAK ZRFTPK Z AMKVKINDOLICX PPTI SARW 

SOIAFIElSOmENKIXOTJaia^IIX^ISII^LTOSIIDlATO 

RlPSKLGVCVIiWUMrrKr^ 

OSC»KLYIVTOT^PSI)0FSVYl£PPZGCTCGEPBCEHZICAVLYT 

CPn.0106 135073 136374 

phoH-ATPase 

EKVRTONKKTMVIDTSVFIYDFEALFSFENrRX I Z PFPVZEELEAFCKFIUSSAKKASRA 
LSNI RIXUNAXTXVTDCVI^SCSEZJIZ EVAPI^NOCffUlCXZXTLELLiKZ I AJOIEPMVF 
VTKSI^RKVRAEALQIESRDYESKRFSFRSLYRCrRELOVSOEOI ENTYKNCYLOLPtCV 
V^SPNTYFTHSACENHFAJ^YYVSECKXIAZJCAMDKSVVCZKPUrrEORC 
VKLVTLICOACSCKTIIJkLAAAKHICVFT3ICrnfNKVLVSRPrVP« 
HWMOPIYDtMEVLF5ZNOWTO5EAU2ALHDA10CLEHEALmRCIU 
LTPHEIICTIISRACKCTICIVLTX^PTOIDSLYFD^SlCLTYLVCKF>WI>I.'/CHMFKrR 
TERSELAAAAATIL 

CPn_0107 137321 136392 

CT0S8 hypothetical protein.l 

KKSPPPVTPKE I PTOPKPP I PQRPEVSFTPTDH I VPCS IEASP I LGKKPSPDSMVSPLSL 
FHmLLEKWTPVEEPFPWPPAEKNOK IFAWALNOSKLI FVSTSCN IAOPRLVTDShSMMI 
VNAAWTMSRDCACTNOVI-SAAVSVDSWCLSORPLN PERCCTPLNECECRACHWRJIACCS 
fnnGKQGKPKYLA0LLCPKAVDKHNK50AAFt^t04AYlJ^F5LA0TlX«\^FL0IPLZSS 
G I YAPPENRKK PNSEENKVRMRWZ HAVKC ALVAAHOEFCNEPOnDRRML r/LTOUCTPA 
rTOPKKKSKL 

'JPnJUOS 137PS7 137303 

CT019 

KNLFHYKAI LMS IFNEEVF I ISKRHTPLCQTSTALRNTPLVNPLHRTNLOR I ASY IPIFS 
TF 1G IKTLKC I S3LOY SMVLKTCNFSSVCIcrLPCPE I YEELPKVRXEAWLEIFCIKALYY 
LVL3VIK Z IKLIVRYLCPCCRPPEPREPONPLTPTPLOMGOO IDAIFSTPT3PTCFKDPF 
LOOLLQEOKKKAPHL 

-:Pfl_0l0«# LJB646 141783 

1 laS* Isoteucy I -cRNA Syntnetas* 

RQKMTADEVCKN3FAK KEEOVUCFWXDNQ I FEKSLQNRQCKTLYSFYDCPPFATGLPHYG 
ItLUVTT IKDVN^RYATWDCYYVPRRPGVIXnCVWEYEVEia 
EECP.K [VFRYVHFi/CTY [NRIGRWV0FSSTWKTH0A5FT4E5VVW 
VVrTrrTAl/TTPl^FEASONYKEVODPSLWRMPU^ 

■ lirTLVYVR I0W:K.T;EX)WIl^7^RWF:^PEEFVILECFCCKDLVCRTYTPPfTFF0S 
KREETCAFKV [ AAjFVFXCECTCVVHMAPAFGEBDFLVCK ENHVPLVCPVOAICfclFTEEI P 
WVn KIIA WEI I KFCJCKECR I FYICTVKHRYPFrWRTOTrL [YKAVNCWFVAVEKIK 
I •KKLhAN.'i:'. OWVPEtl COti^FOKWLD JARDWA C^RHRYWtTTP I r IMK3A0X I LWHC I 
HRLEELT* rrOITOtllhUFIDOUl IVKWKmiR [ PYVFDCWFWaiAMPYAOrtHYPFENQK 
KTEEAFt'AOF I AEUl iO0TRnWFYTt.TV I :*A 1 1. FOR PAFRNA t VNC 1 1 LAECftMKKSKKI J4 
NY rt;i'K-/VU/TY*^UM,R[.YLUU**VVVKAEDU*F30Kf; I Ef .VMM I LLPLTriVLCFFNTY 

AKi.v;Fmit:5un 1 ltayte i cyw t UMVft :wi;kvi<e::m wyi ilnkaveppht idk.tn 

WY 1 1<(* -mm hVEAFmrrWKAAFtTrLYEVMVFirKV I APFVPR>ED [ YOKLKLEKEPES 

viurr^Kvwr/Ki[xPLEKRKirotRR[vx:u:i(::uiKotKrxvROPi>Nrrr/r^KORLS 



SDFOCtl*VDINCHAT01C:T7SwtOC 



CPn.0110 M'J 7 » 
lepB-Si9iul Peptide:* t 



14 182"> 




GOTff SofTOFlIoSYC RL t FPCTSMYOQ F FOHKEVWOOEPMClJtOPKLSPVSYAOLIOC 
Sy^ZtTEWARTSW^NP^^ 

T7fKSS£l^LTT3PFIVAOCCAY1CYHOFKIKrSCI^ 

s!£S^fcxiryxuc3:hplto^ 

FP14QCRLYIMDSPVF Z lO^PTLOKFVTSETEItOESSSETOPY IAFVQKCLrrn^kfFVE 

FlS^OvTKCHVLN^^rnWADSREFCTVF^ENl^ 

PTTI^VSCZAMTa^LICYVYYOKWWUT 

CPn.0111 

rro21 hypothetical protein 
OL^PmpWsTYTERILOmMKKW 

rooJiFiJurrcDVTVLSRCsoRTrvLVKSTrprrvwiEi ikfpciahkerpsueoasuxt 

VIHOLKPSOVT^rt^SEWFFSLKnmCSLEPVCK^ 

IKTKCDCENXP*«PX^FEnAFLTSISVNAW 

WVSIETPKCTSIVRAVOICHCAT^PYVYSLPOSICTO 

CPtV.0112 144743 145093 

gatB-IP«tH2) Clu tRKA Cln ^i^tran^erase (B Subunit) 

SHPFTOSWCnmACCRVWCIXKRY 

CPft.0113 14S329 146405 

ofrA-Peptide Chain Releasing Factor (RF-ll 

CFMOaCVAEYtMUAEvXIIUSHPElFSN^ 

AOCKOALUOaSPEttVVKLEEClllENXVXU^ 

CCCTAALFVCIX^MYm-YASSKtMC^^ 

CTHKVOKVPETETQC^/HTSArriAVLPEPSEE 

vtdsavrztklptovwtcooersokio^kdkamr: 

CSCORSERZRTWrSOMWTOTRICLTLYNU5»^^ 
CPIV.0114 146371 147261 

hcmK-A/G specitic netftylase 

VKPTTSYSNKEZ IQXAIOECTAYliOTYOVPLSDCEALY ZtM)LLEVSSRAXX/X9LVaiSCT 

MUffiYWOUJaJUKRCFTAYl^VSrUaJUJn^ 

EZC7Fr13ZCCC5GCLC^ZXXSCTllVEVVn^Z7VCPCA 

APYTRPADAFVCNPPYLSFNCZZHXDPEVItCYEWKALVOCSTCXl^ 

ST<WBWtXZCSSQC£5Zl«ZFS]«CZYX;RLH0DI^^ 

CPn.0115 147279 148622 

ffh-Signal Recognition Particle CTPase 

KZKSWIO^ZFSFlVSSRRXinraZSESZREVIUALLD^^ 

CEIIWMVrSPQCCFITtCIitEELVArX^PSREErTIOlCrPS Z Z t AiCCT.QflAfllCTTTAAKIA 

tJTVIIWiaCAKXVLVVPCKJOVAAVDOU 

NCJCmLOTAGJU/UWUKEELTAIOKV^ 

LTCVXUMTIXaARA^VFSZKKVUKPZKFECCCEItZ 

NTVKEHR£YlSE£EDA£I>SIQQ^WAAFTYnyrnC0MKAF 

SOXeXEDSEQGKXRTCAZ ll^KTTEERKELVXLCKSRMW IASGCGLTLGDVNOFIUCOHfi 

c^iocrrjQ3Msrcao<EOVT«KKsaGM><R 

CPn.0116 148592 148972 

rsl6-S16 Ribotonal Protein 
EKNNmRXSVALlURXJIOOC!UtNKVmLVl^^ 
ERZrrVL£RCAQLS5KAEAl.VXQCAP<^SAlXSKOEAIUa 
AAXDATK 

CPn.0117 148983 150071 

cmD*tRNA (guanine N-li -Methyl transferase 

TOQUDII^LFPGYTIXTlCnZLCmiKORIXTO 

HIJ^tAEPVTSAIRSVTUCENSKVIYl^PCCALLTAEKSRELAAASKLIIXCGKYXCIOEM 
IESEVDEEISIGDYVLT^CGIAALVLIOAV$RFIPCV^>G^^JESAERDSLE^CLIXDPOYT 
RPREFEGKrVPEVtl^OMKAISOWRLXOSERRTYERRPCX^YUrfLYKRSrDHKFOEITT 
TNRWFKCDKZSVVlXVNKLXfUUCNFYCKVTCLOAHSCEinC 

KKNrVTt^t^LDCACEECFCYLUyWEL/CGKIiEKOADEHAVWALAOT FSWK 
RKK 

CPn_0UB 150075 150464 

rll9-L!9 Rlbosomal Protein 

KKENFRWYI KVNLUCELEJEOCRNDLPEFHWET IRLATKI SECCKERNWOCTWHARR 
GGC3CETVSLHRVAYC BOtEKS FLUIS PR IVS I E I VKRG KVARAJU.YYLRGKTGKAAXVX 
EFVCPR5SXX 

CPn_OU9 150520. I5llb4 

rnna-RLbonuc lease HII 

LMNTS tSEIORFLSH I AFEKELV5E0FSWAC IOEACRCPUACPWASAC ILPKCXVPTC 

VNOSKKL5PKORA0^DAL>CDPEATCI^tCVISVEP.I00VNILEAT^ 

PO t LLVOGL YLPHO t PCKK 1 (OGOAKSAS IAAASILAKEHRODLKLOLHRLYPEVOFDRK 

KGYnT.IUIVEAIRRYRP.-Jrx-HRKSFSP [KOMCAIV 

a*nJM20 iUl25 10I77B 

gmk-'MP Kiiuio** 

EJXmiKAN-/r;YC»tlK I UTSPFrPDWKCO'KLFT I SAPAT JVTttCTTLVRMLEOEFSSAF 
ALT I ra/TTRK PRECGV IX DiWH KVilHEEFC'R ( .1 •DROALLEWVFLFt ! ECYfTTOMLE ZERIW 
rU/lKIIAVAVTD IC»*-\LF IR::nMPW:; I F I Arr-VEELERRLA-TOWEEaWKERLEHSL 
tEIAAAN0Kt//Vt [NnUlX^AYPVtJ!::! KIAKUIRfKt. 

»:iti.«I2i i*.i7i. f i r.;:n».n 

f.TOtl livif.rtiff pt.ir. ; ii ( 

Ml (M IKKW*FTNKKLNKU': \:f'r::i.VtlVA t K'JAK I Y. \ AK' :r<VICl'WVA 1 bTLVlXORD; I 
OI'KFTKK WJTK. \ I TV*:KKK:*.KirrM: I0r*1t\*. lAYTVCOVK 
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. . : .. vm- at. V; *<.'*-*•/: . i'-'n^Ai*"::'' • - mswsa I 

LFCA^O0KLtA*»XJY P * * PE3AVA IWEHIJ PKSLENCNLOTMYARCLWHEtXLDVINEE 

FHutspRurrrvc 

CPn_0123 155975 1S3774 

recD-Exodeoxyribonuclcais V (Alpha Suftunte) 

SPS^FQIHSYBSPliYEYRGVTWYLTSI^^ 

ERI^EVSGISrTRCVSICKQLCEOXIUUCTliFia^ 

EDPFiUJUtEKECIGFrTADFIAKKlX*^ 

OVVAKlIJWTI7TPITLEEI3T0Il^ 

LJCHrLFSSRRIRSircattlAWNTCENI^IDlAOTR^ 

:TOAiuxFEOvm«x:iw*FTGiCAAic»^ 

IDCDLIIVDESCHMDtHlXJWriUCALPDmW 

VXIUJKiniOVHDSGIVrNAHRVNECElW 

KFVPOKYHIYTODIOVIAI»MiaCCTIfirVNL^^ 

IW*m«EVF^IGYVSTINr!^VWW<ECKHV^SFSn^ 

ESPCIIIP IKTSHFMMLYRNLLYTA tTRCKKLVI LVCTKKA I AI ATFJWRVOHRCTCIAE 

VLKELOnCXNYADL 

CPnJJ124 156575 158068 

No robust nomolog present in G«n«b*nk / EMBL a»of 11/7 / 98 

irskortvaitllvlgilliasg: irwvAiPGi^AVAicuxc^Aurr«xiTOVL 

LIWDUJ^EOVEIICOARTRVNNSUX^OYVrn^ 
TNlXODIEEIFLTLRDIRMAUNEEmmiAKQCLAOVCTfit^ 
RQHLDrMDPRWSMITKKVKSIVVRriYVSTWKQIKSNI WDFGQIiUO CJiMYKTI EE 
VLYOSFOKCTNRAALLSBCna IHTSStUWEXDSiKKmiK KF rA SH t. P ir KKF RH.FL 
CLSE£DVXDFTGASGWDCSiaj>RKEVPtJ30CT^^ 

3DHLSVlWLSAAnJAUCVnJV0»raiREWJt« 
LEXEELPLVAARMStENS 

CPn.0125 158072 158605 

No robust honolog present in Genebanit/EMaL as of 11/7/98 

KISSCAIIMSEVKPLFUaraFtaATORFONLIlt^ 

KOFVKRCIEOFEARCLCVLKEEIJkSLTBDFHOKAKAETSMLIECW 

WERWIWAERYRDCKGW^VOVEOKIMISSRW 

CPn.0126 158806 161085 

No robust honolog present in Geneban*/ EHBL as of 11/7/98 
' IXN^gYYQg^FFFSGAISSCCIXVSLCnrCI^LSVU^, 7 f T T AfiTifJ FKIQStgJlEVPK 
AP0LULEDASEItXJtVKASltSLA5LPXnS0USnR 
SRiamAAAONYMXSSX£EISEIl£E£EHHLIlJU:ESLgMIGKS^ 
HLSEWlPYlAVNDPRXailTEESWEWSHFIWWSAnCXAOILnafflffl 
ELLETTrYXSLJCilSYIlEXiXXSBWIIHIWPLFP 
EKTFTWII)EECAISY14DCWDFlllESIQNlCKSRVnRDYIST1QCIA 
PTTECKIDWjnAOIUFEROSOEPYrLEHrETKVRl^^ 

NAMDIJCESFEXX OXERVKYQKEOKLYWET IDRNEQELKEEXGEStiRIANlUUCGYRAGYDA 

KESIi^VDVPOCLVYSYYEXMEAVVRNRI^^ 
PEGHmiJCERELQETTLSCiaaJCVAQORT SFT FSRLSRR 

CPTU0127 162152 161130 

ytfF-Cat ionic Amino Acid Transporter 

£SFMFPSAWESRTRNm^lFHCn.VAn-YVCrVFVIPKFli:SFCDI^rVLTRYTrrCIF 
SLIACAJKNPSVIKKTPLYIWRKSUAXri^NIWfrCITW 

AVLYHSNTKOKELPYSLLF AX S SVI I TCVI LTKLSALMLPTAASPLYS I LEV I AVI LSTS 
LWVIYVIRlfcStl^KHPKLTPDTWSYLIGISAMICLW^ 

GSERLLFLLIXSAMCIFSSAKALIAW^^C^SLN^^PALLCAILIFEPIFGLVLTYLYSOSL 
PSLOEGICXFIJCLSSSLLCXNaJCRXVQKSLENSOVSSSNE 

CPn.0l2Q 162262 163053 

bpll-Biotin Protein Ligase 

EDRCRMLRNQVLVYC5ECVS PYYLRHT IR FLKYYSTQEGAFD ILRVDGNFL I KNPFWEET 
TRLL VF PGGADRPYH RVTJiGLCTAR I FQ YVSEGCNFU2 I CACAY FC S KM I YrY EP EGAP L 
OOAROLCFFPCTAICPAYRONFSYVSPSCVRVSPOLFSOFCLCYAMFNOGCFFECSECYP 
fr^IIESRYOOLPCKPASIVSRIVSKClAVl^roiEYLPHYCRWKENVOCT 
TTLORYCONLVORLROPAFSKAOC 

CPnJ>12'> 163747 163064 

similar icy to CT036 

DEOY ILSH IHMOmFVTSEPUJKTYOKLOEKHVKNLGI ASOVSLTOWNXTQYENNLrE 
TTTNEITYYFPWHNPDI LRSEWDP X SNQLYLX FKKFF X HYHNLFSTALERNOXLLIDS L 
NTCSSMP t ARQMEUAFUCVF EQU7YNED EYT I EPR DYFNR FVYKNSOT A PQIOSFGLLH 
CY EEMSYASNN t RNVLHtS IVLCSP ILYQL [TEF0TTKIHADDFOCL I 

^Pn.Olif) 164251 16J7S1 

No rotAJur horooloa present in Cenetunk/EHBL ah «t U/7/«»8 
.t^MVKC:;:: I J 1 1 ENKKPACt .LPESKFAA [TKLSLAILCLFLCEAAC I LIALSnLLTNTLL [ 

rALTii:; i t vumusu, [f7Tce:;K.^KD£0KrK.: rFPKETP.iLOPWLLNPi jcnk com 

KTt.LLDPT:J I NLKNFJ.PFr:;FEEWKK IFUCCPOFLI K.'JU-AWWX ( LE 
fhijilll 1 1-444 1 It.'AHf] 

!••: :::urKmK; :i .. :i A m , iFFFT:*.AYVFP:t I CFi JXFmwiam::: :::fvy» : rra* 1 1 J<Ti VAO 
to/Km* w : iovi4-;Tr:wLrinu.vi*AF r Fr-ovti vfvi.t i au mm i ::i.vlfllxrsv 

U^IHVTiWI Mi'SKKi'-YAUtUMW^ I .IJVXkV/J 1 MX:!* I KVUMMl-;* Iht l'KDP*JOAA 

vi .i i „•;! vrKi ■: i tvi vkju .i ,i * ;i vkkw x y i or^/i+K ! - :n i kr v: :i .i .vh w :Arru>OLNEO 

• VHIIKtlNI :k('1,KKINKKAH WliUOt^IJCIIE rM^U.EKTOVCLDl-.a4::HyV.'?jAMP-JVYRY 

i .ut.*M.rr: :hi j« Ti n.t^a m ;nwitt *i atojpkm ai>:;dfijw -knvhw ;kf i jai;bk 



ALLKNFCClCrKDLKOFLV, - 

SrCFS^HTOTAX*:£D!OiACR*WCl^ 

a^?1^vIawa^vfSf>y^ 

i^llFi^P^roOFWKLLKDHFI)UCDF1CKR XATWrRKAYPEXRLPKKHCUSKS: 

mwivj "' :.. ""*' ' vy '• "•"*■"* 
CPnJJIJJ K7349 166S64 

k^Iymf^^i^?vSyS^ 

I?I^LAWWAAEIEYADWSSISKLLTIC0L£ISE1.I IH^NrrXfTTDWCTICT 
WSLVvS^POKETPSNUtflDRAP^IRJ^Fl^ 

HTSSAkSpkSeWsI^^ IKPLSCOAHKHFYSSTPOrOOWJ* 
0 nfTPCTPTEEI XCF XRCLT FH 

CPIU0134 1«9»» i« 7 <« 7 

FADYWtSRTTMAAJtNII^^ 

TTScm^xxximHEJw^oMW 

GAOTmUMIDKAWVVVOEXJCKISK 
SXSxT^EAJCFTTVLDVVEGMIirN^ 

^^JvLoawI*^^i"EEXECiy^ 

CD1A ILTGCO LVSIEJGKKI^nTL.AHLCKAXKVI VTKEDTT I VBCLCNK PD IQARCIU I 

SlEBSTlSrTSieaClOTLAJCLSW 

CXLPGOTAWRCXFTIZAn.^^ 

5Ju«SANECYDAIJtOAYTWinACILDPT^^ 

K5SSAPANPSACKDY 

CPTU0X35 1«W48 169143 

eroES-lOXCtt Chaperonin 

KStSTTI^ IKPI^iXffllLVigtEEEgATAROC I XUOTAKKKQPRAEVLVI ^t iUJt! D OCT 
LWTCWrailliOCYACOEITIOOEEYVlWSSEXMAVLK 

CPIU0136 171419 169569 

gg^ffSffEALPTOTOVPPIW^^ 

CTSHVOXXtlPESI4£LUKItrSVERiaDOLYIYAKUTO 

FSOEIWOPAilAl^EElWAAlAK^ 

j£iranU^SSUDAEXPFGXAnSST«EEKPLSK^ 

YOrWfrTAJaUCJCVOAmJEAKW^S^ 

RYTKUOCEAlJaJCEfKrYDVYAPISOTTSJCNY 

LSHBWVDRYlNICHKItS»YSSCOT 

OPYHIwrfPIJXA£XASTrNEMLI><EAI^XSro 

aWt^iiUaaeoctj^teot^tygkloketycx^ 

OYATCIIAAX^FA£KXX.TOEKIAlJXYWFXJCSC^FPLNIlJ^^ 

ArrnaaDLLSSLtSED 

CPn.0137 172263 171502 

ybaX-ACR faaily 

VCSMNVACU^KLTTU^iaFOnYGPrCLOVCDPOTPVKXIAV^ 
AJIVLIVHHCXPWCGMPYPITGMIHJa«Olii™^ 
NWHDUCPFCSSLPYU^^SF^PIDIDSFIDU-SOYYOAI^^ 
GAYRELSSAATSQVDC F ritWFX)EPAWSTAlXSNINFtAFCHTATEICVCPKSlAEHXA8E 
FPISTTFIDTANPF 

CPn_0138 176094 172700 

• haaL-C luc aaat e - 1 -semi* ldehyde - 2 . 1 - an x now t as e • 

TNSRX^LAIKXX}LIflN»flCLTKRNSMLMCSNOI^^ 

VTPPIVSSAOCOIFtimCREriOrCOQWC^XHCHSHPKIVKAIOICrAIJCCTS^ 
EEILFATKLLSSLKLKEHK IRFVS SCTEATMTA VRLARC ITNRS 1 1 IKrXOCYHSHACTL 
LCCXSTTEeriDMLTSLIKr PSPHSLLISLPYKNSOILHHVHEALCPOVACI IPtPICAN 
HC3IVLPXAErXJ»XXEX£KRFGSl^IMDEVVTCn^Af^GAOOIF7n^PD 
GLPAAALVGHRSXUHWECTXFQASTMSCnm^NATCHAAlOX^OSEOrYOm^O^ 
LTrsnEEEIRSOCFIVSLWOCTKFSLFrrESAPTNrOEAJWSIT/EXrOm 
VYLSPS PLEANT I SSAHTEENLTYAON X X IDSLIKXFOSSAORFF 

CPn.0139 174686 174093 

SPT1Wltumn«IPYARLEKCSLLVASPOXNC<^AR^IUjCEHS 
FEISDDIFTFEKVSNHNIRiTMOCPLOAWO^LIJ^SCSErPEOTLEICPSVYLOCDLPFL 
OEIASSESCPEXNtXrFCYSCWACOLEJCEIT^NIWLAK^DYVFYSEPEDt^ 
UMKYA3LSTVP0NLLLN 

CPn.0140 175140 174673 

PRSNOOKXf^SI^KELLEETPL'/liNFYKLVSFC^/ACHILCTEEKKFAIYCHVSMOOA 
FOGAXTTEOISPORPFAHDLLJJFVrSCFDIOVLRWINDYKWWFYTRLFLEOKI^EFLW 
VtFVDARPSDSIPLALTHKIPILCVKSVFDAWPYEE 

CPn_0l41 175817 175110 

rplA-Rtbose-S-P iscmsrasa A 

HS55AVEKDLMLH EKXCLAH EAATQVT3CM X LCLC5CSTAK EF Z FALA1 IR (OTCSXAVHA 
lASSOJf^ALAKOLA I PLLNPEKFSSI^LTVTX^EVDPOLRMIKCOOC^ IFREKILLRA 
AXR5I I L7DESKLVFVLGK FRVPLEI5RF0R5A 1 1 EE I RH LCY ECEWRLOOTGDLF ITDS 
:^IYLIFr;Pt«YPNPEKPUJaXOIH^/IEVnP/XElCVEVWSSNSOCLI^KKYSV . 

CPi».»l4^ I.V121 I7SHU. 

No r**f«'i::f »*tiim»l»wi pr*v:*nt u» '^nfttvintt/tJIBL .jr. or il.7/«»fl 
fM-'IYHK "F/JTLEKFI 1FK E LOLLTTKNr; I U/PC.'JH FE tnUVnilDNAIOK I R:!YPIJ(PIAEN 
V INTl.-PFOLKtOYrXCV*;JKRFPFLYr: VIP I XUWIMVTfm 

<:riij)Mi i:vm7 iv.2M 

l*nTIM(T.*;LKnrLK:iIFr^"VYV:FLRI > EllLFKTPE."LKDrj" [^LUOL/K^tFUIAEODLIKK 

'jkaa* a*^K rra :efhr.\twi iylfmw if i k;-a mnpATBr»VFFX/ ;erah t ddtaltdk vjvr. 

IHIlWMIFKFVKAliEPKFTTAK'rfTItPAIWyl'XKV^IFPItfirEVrRKFYfTNOELirorVA 
. ;VHKV I l-W.Yt>AtX 4 KYU , H JWTHO U.71iCK*A;r.Wff:iliCKf ;U0Dl,IOOYIiINMUV IAD 
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P POCLWTIl J P/CRGNYKSKPTAiZTSYOFTAXPLf . A"0CTY LXrOM CR3SDF3Pl*TFI 
XOtTV:U*XVTSrn>TLOnC0CV IAR t IIC*A0YU»lXnL5;JPOCaFA3CEICNW.TCi; 
GOWAjrVAI.VXEISEEVWK 



.'ErUJ^rTEUi t AltTXAAttEAKRYVtEICCaCrtTV 



CPnJ1144 



177142 



mow 



r^FNwS^EA^^ 
NpnujrrAVKDAUiREPTvvEfiEVDPK^ 

AANLLK P ALAFGTLHC I CATTLNEYOKY I EKDAALERRFOP r FVTE ^*f^YT H^rnr 

FXhTSNIGSPEXJraSKJCGSELTKEAILSWSPVtJe^^ 

arVKXVGIOMRRIAOR^ 
oKALLKCDIKPOTS I ELTMAXEVLVFKKVETPS 

CPnj)14S 180717 162369 

ncaactiwlnx^^ 

VISRBLXWETCOOT^PYKliSLCAYOVL^^ 

tLTLG^CFSPttlSLIOKI^ICLSLAS^ 

RAlWSNCLDILAIWHAEWRYLDOSIOri^ 

EtTOI^FtJSvTtBUCFTWCLC?/SSLYr 

TTAflGRHLPTASYCDCLDLEIlLOWJTPEDnGL 

YUra£L0EU^rV0ILGGKKX£GXSLIGKS^ 

PGSNY£EIASYEraJCKAMKSSMPCCB»RJU^ 

SLHLRLCXILCDRHEYTKALKYTI UERlilEDQCriJatnHRSrALrYEVKKI ISKVAPQK 
ANTLLLKESER 

CPn_0146 182595 183095 „„/•■ 

No robust homo log present in Gtnebanfc/EMBL as of 1 

I IVGISKSSSE\ATTOTVHCLCFXl^XSVVPna^LSDAPinACS ILVLTLCLCXLVCC 

IAXTCWCVPGVII/*»ICAIVl^SlJttSIJWLWCaX^ 

OCPSIUU^SaCUCOCSPRASTPSCUELOArXQAVTOAXOaMSDO 

CPn_0147 183213 183671 

NO robust homo log present in Genebank/EXBL as or 11/7/98 
HGGPHAVC5IXIAVTCAATSVGCVrCSR£AIPA7NTEEW 
CLGLWLAGCCPiaiAACAITHIJJGVAIJJ^ILrrLR 
SATFPLEXjGVAGEAGRGGGS PLTQUStHSGMSS 



CPn.0148 



183122 185702 



pknl-S/T Protein Kinase m 

CKWWRVSSMESBCDIGAieFWDVRXLYRKGOS^ 

SOPPKEATWJWVKLAJCLOTroiLSIENVSESKSRCnJSrrOEDD 

LraEIVDIVSOlASUJ3YVH5£GtJWEE»ttJW 

ItSOriSDEENP^SKXKEHVtXHTSEXa03GR£XnYAr^^ 

FSDTXYDWFUSSCUCFMESRAJCEUTLXfUaCTlCEELOri^^ 

SSCSlJQAVIjeVG£TXVSHC<3KESAEHL£F^ 

HSFFXI3VHPVTNIX)FIRYI.ErCGSEGDmN^ 

GVTWVGASGYA£WtCKRIJTEA£WCIAA£aaVAAXAYPCCE£XEJCSRAN^ 

YPFNPYGLYI»tAG*l\nrE1^DWYGYDrrTIW 

LRCAKRHRNNPGAVNSTYCFRCAXNIN 

CPn.0149 185706 187700 

dnU-DKA Ligase 

ERFIOCEENSGJu^ YLALCREI£Z)KCYSYYVU1RPKXSDY/EYC 

WSPSTRLQDRPSCrrFSVWKKEPMLSIANSYSKEZl^EFFSHVEKSUn'SPlWWnJCtD 

GIAVAIRY£OR^VOAI^RCWKC<jEI)rTSHIOTIWLPlJC»PEIlAPEFIEVMCVrF^ 

STFOIINEXCWlXmFANFRKAAGGTLKU^POEVA*^ 

NLQiRCLEWCFPVSCKPRIjCSTPEEVISVIJCTIETZRASLPMEIDCAVIKVDSLASOR^ 

ATGKHYRttALAY KYAP EEAETLLED I LVQVCRTCVtTPV AJCLT PVLLSGS LVS RA5LYN E 

DEIHRKOIR IGOT\OTAKGGEVIPKWRVCREXJlPEGSEVi^PEFCPVCH^^ 

VSVRCVNPErVAGAIEKIllFFVGRGAI/IIDKIi^ 

M0IPGXRERSAJWXUSXEOAJCHN^UlRFLVAI£IPLXCXGVATVl>CHFCT^VXSAT 
FEELLSLEC IGEKVAHAI AEYFSDSTHLN EIKKMODLGVCIS PYHKSGSTCFGXAFVTTG 
r-£OHSRU>ArrAIRKCCGKVCSSVSKQTDYVV>OJN^ 
LIHLE 

CPn.0150 187759 192444 

CT147 hypothetical protein 

tiY^KmsYNCPYFiSFFVuxwNMASssNNSTOKXPS^^ 

N3LTPEAGTSR SVff SORKHFTXVUlVSI^EMINNOtWCYSRYKT I ILIATLVTVAITCI V 
PISMVTCIPMWVPCLIUFCAO^Ari^RLOSKCKErHUryRAYOIYROOlXSO 
K37XYKYSITWKPKKGFVGIUA/ENIJIP0U4KNK^ 
WVSGVNSVBJORLASLIMSVrafflXUIDVeSR 

DISRDI LAICGYCHNVCVEAKXAI DQYKXWYLNSSTF IAWNPOLPAIAOSYLLEQQRHLD 
YAAKI FODLSALTTAHCTGCALrnf J1ST.T fYYOQLI E3KGVGEKI IASXKOKHLOLAH0D 
::OttEHUCKWSNLYHVFSITIKEFTECKLEONEW 
!T/AT1l3EXKLAOYU£ICOREPFLT«MKAIATC^ 

ERLEXJHLftREOWTiA t UJKNEDEVl>UCSTMEAOt^rKDLVCTWECKYOEFKKNXLSKVL 
V-^cm:^KLLNR(^VU{AES2TODLVLifVDRMSECUCKTI EEIDCNLFOVTPEELSLL 
AHtT/Oi;! *N£tPL t VOECNRL0EAI SnECVSOCLKUUCUlIRDEK INKNX ES5RKNLVA 
I Af'JAH; mitN t ft K.'CLAPL IQRNRACLDN t LONXY LFHCS2 1 RN I HALDTETLVAT3SNM 
F-AMIITPTWN I YTM.UMXtOSKPAPAPKEHPDL^a.PEEVOOAVAEW.CCTHRLHKO 
VI,rr<Ri'MJt JCNM I:'«L0Xu* INKtOtAKAIVLC IVAVLFCVLitA IF ICON IL.1LLILSCVG 
I .LLTOVi 'II. I PUK ( :»KL*KEFEKQVLETA0.1l*irATK : LfSEFNNKOLNRLAr.LOONLNLE 
< :ptrTWAKNIV::OI &UrTKEK::tJCDLTK&FRKD:;nU/JKR IKRRFKBGUJOEAPWRPT 
1 1 "Jf I Hi lAKVFAKU IRELO ILOKQKECI J t RGOAl//*vEPHXt-"LEK JKYWlEKAJtAAAHT 
KKTHKUtUIWUMCNNtTYVR tOrtFFRTL tOEKLCFOTVOE ItJWKEAKELHELAAI IYG 
NT*J X; VKWiAKKOFK fcJJVU 1 [ Af .'KOQI iXLEAYUr/TA^Oi It jCKI ICHQA5FRERI LLN P 
f ■ SAK I k IWWlTIA ! R EWLKTLCUr/LTP FVR Fil.'.TF-TPUrTJYNy 1 LKVREQLFDl EORL 
VtM^V; :m »YAAVOAALAAYVRKl IETTL I VlffYOl/i'AW V JKVTTLMff OLHAVEELV 

K^.vh-n*i«inrt;i v f MinviuvuirjiLftocDs^^r; t idwkklfeu*nw;hnpkdpec 



CXOEOYACMOCXEDUUGOKFCObJICi 
AA 

CPn 0151 :>2,i25 

SScwlsP^K^^vxscENF«xia 

£E2i5fl£^ 

IClJ'XXRPORYXCYKTTnTKLHELXSYLLRIFASncrS 
CPn_0152 19W74 194318 

J^OCDSECVAEEVPI ETYLRDAOT XLETVOEHPOLNAYRI^ ISCFSLjXHI ATttAltlW 
PRO^ICAtSWPIAD3GIlXXELYES^^ 
LIRICDHWANSLmPYIUtOOCIDCrLVSRTCCTm^ 
APXUHXLCQXVSlfFORTL 

CPn.0153 195430 197892 

leuS-Leucyi tRMA Synthetase 

V^WYKRARCFSvUlPK^rcUAlW 

JSS^tWroawLYtxxaAYMACsuv^ 
L£MTntLimijTCn*viAPEHPtaisxvsEiTO 

TCVPTCNYAKHP ITCWliP'VWX StWVVl^fOTGVVMCrWAKOeROREFADff Sl^XHIVX 

VFYn!cWOTEPFiatLINCXa.VlASSYRXre 
mSjeSKUgGVOPCVLXEEYGAOALIW 

TSStVOOIEI»raVlJWia.VTmXTEHXEKMSl^IPSSFMEF^ 

AVWOEPIAPHISEEl^UWPCIDOAJWPOXDESYLVACrVTF^ 

MCttlKEEvULSRSVVAKYUN^ 

CPnJ0154 197174 199202 

ameA*KDO Transferase 

TOTCIW4lJUWHRIFKCFTriJWVC^ 
ICBIPLVWFHUSVCEVRIXLPVXJXFCEZrTGH^ 
SXIMFSIIIKSVVAiaJIPSLVWSECDCWWiEEAXRICAm 
KflJQuXn^YTSPVlttFUJQDtVOKORrLStXSt PEWaQVTC^IICIY VAAOTALHL« gr 

MaD MJtt J pTCTI^ OT " 5g>0 ^ ni>g> ^ p ^^ ^ 
LlttPVGl*BRGAWrSYVPVWA/DgXCIAICQLWACDLArW r FBPiaCC »gJLgL QCI 
vnirGPWTT^Q ?FT iAQPT T I ^-"yr ^pt t ww njjacgv»JA3fVO>aKVTVK 
ACTASFtJRTWRALKSYIPLYKNS 

CPnJ>155 199697 199488 

No robust homo log present in Ccnebank/EMBL as of 11/7/9 1 

NSLSFCVPTX^^XSLXPXEiMRhXtJHKnOiSSSICFSMOTOIRTrTK^^ 

YFLRENINFN 

CPn_0i56 200147 199770 

No robust homo Log present in Cenebank/EMBL ai of 11/7/98 
lX3C0iaiJWfl«WIV^EZPCRSAfTCRTAPTPNKYPIA 
ryFYRAATPOSDHPDOCCriLLiPIJCElJ^FFYCDLRESmTCrn^ 
LFIRDE 

CPn.0157 200753 200298 

No robust homo log present in CenebanX/CMBL as of 11/7/99 
FSFVTYKEAU04lY0FSPCASPWA5U<A0lil5YFCLCXnvrRI I5LRPSCLXLAKXE 
KAWSTAEKX UC ILSFILFPLVLIAIAX R YLLYNKFNXOLDRAVFFI PTEXTKAEELI I A 
KN PALVKEAALTVS PLFY SLPKXYO LKKVCT P 

CPn.0158 201463 200894 

No robust homo log present in Cenebank/EMBL as oC 11/7/98 

PPKITLSIN IOtXLEDLDTDS I PWPKLYLSEDFOFAYYPESKAX I DTVAXLEJ04MPGEEF 
Cl^SKKILARYUX0l^tXPCt>lFPTffriDGCRESFLIEFSHETKKPTV*4AFIYFYYYH 
SrCPKLEPtDrKOACCEVKNRUJ^LiC/RPOACAONICRiOCPYCP ICFLIVWEZNYCSV 
UCDHCrXXDN 

CPn.0159 201811 201467 

No robust homolog present in cenabank/DfBL as ot 11/7/9P 
CCFQQETATR I F5HTPSCFSLATEOC/0VSTAEKVI K I LAL I FFP 1 1 t*I AJLAIRYFLHRK 
FDRXCFVIPCDTPKEtXLI LAANPQLVEKAAREA/H PGFFALPTKYOSMY IOTSKC 

CPn.0160 203794 202127 

pf kA- Fructose -t>-P Phosphot rans t erase 

TVELLSLNKSYFEtCR LRYRPEILTiXETIRSKH IOETSSPPiPPPELOKHl PNtTRX PE 
VS IYTEQETSSK VLK ICVLLSGCOAPOOMNW IGL FDAURVFNPKTRLFCF IKCPLJLTR 
CLYXOU3 rSV I YOYYNMCnFOMLSSSPEK I ICTEEOXKN t UTfVKOLICLrxai.1 IGCNNSN 
TOTAHLARTLAHNCKTGV ICVPKT IDTOLKNCWI ETSLCntT:«RTYSeHICNLAKOAX, 
rJUOCYHIIF I R LMO0QA2 YTTLECOLGTLPN I At. I SEU IATRK I ilLKOtSEOULGLVRRY 
K3CWf/rnVL t PEEL IEH I FT7TP.KL I JLAtGOZr. I EK t UK LSPETLKTFHLFPK 
OlAWUU JU»WirWRV::K [ATEEULAVMVKKF.I EK I KMWEF!l:n/r3lFFTTYEARACrP 
aNFTOr^n t ALC I I.IAI^LVROKTrr/M IT INNI AJS^EW>:ATP!.YKPS«LO«CCTE 

•rwrtnTOTPicnpAwtH j w«Dr^::/FJX.vi'Fpnpi/jYFt;KEFi. t noRPtTUJrfOCT 

MJJPPVALYtrrdRKRjti* 

(t»f*pii'rttx1 .icy Iti. «ii-.t »ii.*:;»t l>iuuly> 

MR:?»T3RK0Ei'tL I*A WLtiKIIEOI'Tt«-"W.'n J iWFTTF^;) XJrtTUP/NPPYP IVtLLHC 

!A:.ijrr»iSKR:wvRiAuEt.TnuiiAAU'vi>i.i; anax;B;Kiiin^:i.fKYKu»*t»'E! ieyt 

li:;U Jl ( OOERLA I Fi::;:;U17TLAUyrM'FFNK IKAI JVVWAIT rillKJ-MAAK^NAPEVI 
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TKV/UITVJOffLNPOrYTOrUCIOlVKCUl *U>P tLYHJCEODliVSrWHTTL 
rrCArANCOKPITILTYPOVOHATPFAESSALSOtTOWUCRELTSCC 

>:?njn»2 2Q5B70 204803 |(# , JM 

no rooust hooolog present in Qenebenk/EKBL as o£ U /7 '" „ 
fvytlyn:oswt»:mklys issovxrrpwiPoueKvnsru^aNRiicvvsrvicTO 

<v4 t ;r.CTAWYKLm , /PV,XKKLOAMtaNPlXVEAAUnT^ X *SSH 

. . *i -nr. • i jl rrw-r . 'x.\yicpv t-.w .vr-?rrv r r^-J? 

■ ■■'■■M- i;;^:^rr^::KAL:*iim>:pi:r;,~: ^Tr/;v.^ir!T-'.i!rTiwr?vPTRO r » 
pixcbt^FXiuowEi/nmrpicpsocpoNPNTOTO 

CPIU0U3 205831 206394 

No robust hoaolog present m Genebenk/EKBL *» ollin^B 

roc*mciKacoi:iec:siiHTPTPATPLCTO^ 

1 ireyxju^lvttypkegoiuaspiolr^ 

TYirTFADTSTfIL3IJlSYOWiSPSDnrWCIwrCSIDOPVOAVISn^ 
DPLLCTtffi 

CPn_0164 206444 206998 

mo robust hooolog present in ^^^J» °f " VT 

LCFlKIYIKIIFSmCOlifrRSTICSSOSirSRSl^KWVOTtJCtt^ 

AFLTLIVGGAL1 ALAOOCVLSFPLCLILCSVLVLFSS IYI.VSCCl^FTLKE>ffKrCSVK5 

•KINIWTEKORNKDIEKAI^PDlJTCENiaWVGNRSAIWl^ 

KYTYL 

CPn.0165 206983 207582 

no robust hooolog present in Ccnebank/E^ ** of 1 1 /7 /9B 

NVUlWWV^IDKVDPCSEIDtWCVVSCy^ 

RSKVQEOARTVSOEDAPtJrLTRSYTOtnYLTPUtACPRDL 

KHP CYYARI ATWESVCVYMLFPI EWJTIO^YVtXSDy SIOZOPOC^AII^rvyTLDECKDr 
LICHXXrriJLICwjf TLVFCT 

CPnjDi66 207594 207962 

No robuic hooolog present in Genebank/EKBL es of 11/7/98 

NCUtOYWCSDSIMStSXNRSIWXASTPFPIKI.^^ 

LWLTVAGILPtLPVLILEt XLITVLV1LFCLVLEPYU EKPSKlKELPKVOELSWBrD 
STL 

CPn_0167 208309 207977 

No rebus e hoaolog present in Genebank/EKSL as of 11/7/98 

NLWSHFPK»TMl*FCWIUJUtPnj«SIl«CI£M^ 
XTVEEI^AIORUTKISWCTLyTmPILPAHKSFSlCSDIGKRrPISPXK 

CPn_0l68 208716 208417 

No robust hoaolog present in Genebank/EKBL as of i 1/7/98 

SYTNUOTUNPmFFNFOTPCYYAiUJUVI^ 
LKSILSTVQ ItiJEIcrCFTOFIATHXDTTriCROCAD I PCS 

CPtv.0169 209537 208710 

No robust hooolog present in Genebank/EKBL as of 11/7/98 
SFHIEmCDOMenraBrSOPLVKEUTOPLRHL^ 
TALKCACZLSrU^VUIVlAnnCAtFI^ 
KDLeWEHPELFGtNItt£t*OJR£AMQV^^ 



VSHKDAPLFCPTKSVYRCfffLTPLRAESPRYI INKAI 
CPn„0170 211098 210025 

No robust hanolog present in Genebank/EKBL as of 11/7/98 

NVWawiIRGBCrNTCTVIAFVLSKS^^ 

EAI DQ^PKADlHVKLFCTITPQIJWII^Vra^ 

FRNFQD ICHEKDPDLSVLQYNI LNYXJF11SFDKVKATVOGXRFFPCO XCttEEXJLLLX FNNY 
LOOCLDDTXVYTEVQOttXRX^HVLYPSLPEKHARMXFYOXL^ 
FNKrTAPOINTOEPAOEWQWWCVDSTFPCLFVCIOSACSESAP^ 
DSGFGCEAHAGEGIETRTIFSSAKWPEGLIEIT1OTFSSIJCTO 

CPn.0171 212444 211149 

•guaA-CMP Synthase 

XIKX^AIWHlJfflFXLOFCSOYT^naJUCOVWaFVYCTO 
SGGPHSVYENKAPHLDPEIYKICIPILAICYGHOUIAM^ 
CELF1WIVDCTSLrJTEIRKSHRntfVTTIPEGFllVIA^SQ«^ 

evsdstptcnk:letfvoe icsapti>nplyxqodlvskiootvi evfdevaqsldvowl 
acgtxysdviessrsgkaseviicshhnvgglpknijc^ 

SSYiLDPJIPFPGPCa.TIRVICElLJEYl>ILRRADLI^^ 

rKSVSVKGDCRSVr^TIALWlVESTOrKrcRVtt 

I5DXPPATXEWE 

CPn.0172 213237 212440 

"impD-lnosine 5* -sonopnos phase dehydrogenase iCOOH-terminal 
region only) 

APIGJUkIGICPLOIjRAHHLVEACANVLVIOTAHAHSKCVr0TVLEIKSOFP0XSL\A/CN 
LVTAEAAVSLAEIGVDAVICVCICPGSICTrRIVSCA/CTO 

OCR tIWS(2J\An(Al/>Il\DCVMLGSU>GTDEAPGD I VS IDEKLFXRYRGMCS LCAMKQG 
n ADR YFOTOCOKKLVPOCVECLVAyXCSVWOVLYO I LCC I RSOCYVCAETLXDLKTKAS 
r/RXTESCRAESH IHNIYKVOPTLNY 

7Pn_0l7} 214041 213715 

no robust homo log present in Cenebank/EMBL as ot I 1/7/98 

T I FDLI7K t DSY KHQQGFMDFSVFPDR FVESTSPS P t ED I DAICTLVSNCCHYCSRCLF I F 
l-LLSI I t'-TlV/CTSCETASLVFCrLSLIVLVtXIIECRNRECCRRIS 

'.Ht.0174 214215 21472-I 

rfr. tnt*iM luinoloo present in Cmiebank/EHQL a-j or 11/7/09 
K I F [WF7KK tV I L34IKTT rCNSPSPALNPELSLI P PPTLVSfJ ITOT-LAYT I PA0CRRS 
TLft ( ILL [Ft I I t/JLAT 1 1 3TF I V t FFLNGLNLLLT P." 1 1 "-(X I rVCI-LFLCHCLYFM 
: :;UL£VXVOLU;>XLJOAt:En EE£Y XQE I EALRCAPRAE^rTfcL'IPSTWL 

':r*ci_n iv. jm«»h. j 10275 

fin rtrfiunr Iwanolon pr<tc»*nr in ihHt«fb>iiik/E9fitL •u: ot U/7/nu 
UJUCF^rMPGKEUItCTlODTrT^ 

F I Fra^mcrm^KVA I r r LP^YHPICFYLCF t DROOVWHYEVLOWFLKTVAACI tENS 



FLTOSK3PCLL5EVKEALF — - 
CPtuOP* - jr***"* fi 5^*5>" 

n^dpkSSe^^ 

XSvDOEniYwncsxfl^ 



. . ..^n.vmr." „ 

UaCLWTPCOLF XRJCSCI ftUQKirrj JKYLA . . JM-*> . . , 



ron 0177 217513 216608 

STrSuit hooolog present ^^^^^J^" 
WCRTiyrKSKFIFLISEESIOCQPHSLirSSVaCtrT rSJ <i<V-NQK?&*iiHHimiUt 
SHoS^LWPSAFirrTgiFSEEHNDPYWAJCn^ 

SSsCMClttlCUafTtEOFVIAWUlEW 
SiSLFW(^LVWESTOSDSIJ*AFCt^ 

<3 p ■ . 

CPa-0178 218052 217789 

No robust hooolog present in Ceneban k/EMSt as of 11/7/ 99 

VKB^U)rLVORJ<VEJU)POTKRHCTVSCKreBESIDAKTTTO 

ES IUCQLLAIjC X ITCYTNREREVWVYLD 

CPn.0179 218550 218056 

No robust hooolog present in Genebank/EKBL as of 11/7/98 
PKIWTTKrmiEATSVPieFWWI^ 
KXMIANVOTlAKTPtlYSCIFESIGFnJlOT 
LPRSLKKXSUCFKSSLirnCLSYAIJJLSAPIOUXCaCPNL 

CPn.0180 218963 218355 

NO robust hooolog present in Genebank/EKBL as of 11/7/98 
TSUOaUXlCYKPVFIQrrcVWlTYPWIDl^^ 

cujvYimwYSvrrrcviwYTOLRr^^ 

£jUu3CXWIEVp]rVVGLDIMXU5ia^XJC0P0F0SXT 
LISLILLCKOAVLESFOE3OTSS 

CPn_0181 219175 218777 

No robust hooolog present in Genebank/EKBL as of 11/ 7/98 
VHELTKXDCTI/YYFFIOCFHKIJYWYSLNSHHEXP^ 
DDISR£IYC\nuUYXRFWIVSIWLSRIfWJCWIiRYCTLI^^ 
GLXRFSR1JUCSVY 

CPtV.0182 220704 219334 

accC-Biotin Carboxylase 

RCIMaCVLtANRCEIAVRIIIUlCHDLCt^TVA\n^^ 

SYXJUSNXXJUCEXTGAMVHPGWGFl^ENACT^ 

AXSXJUCIUKCWXICSSGXXECXSIX^IAEXXGFPWXIUVACaQGGM 

IUrSAAlUEAXAGFMn^XE3CniMPPJCLEIOVXOt^^ 

ICrTTSPILNAEIRVKVGJ^VDLAJlSAGYTSVCTmT^^ 

ITlIVICIDLVKEOXKWttf3Oa-PWK0IWXErSGHII0CWN^ 

UFASFSXRVXXOTSGYAIFfWOSKX^ 

IPFWFKtJ)HPKrL£SH1fDINYIt)NIXAOGNSFTlCEF 

CPn.0183 221207 220695 

accB-Blotin Carboxyl Carrier Protein 

FUUO*DiJC01EKLJUAK3tfO0aVAlIWEG 

QERPIPTDPiq03TX U.n lLKi>LUl 1 1 HSGDFISSFLVCTrYCSPA7PSP3FV1CPGDIV 

CPh.0184 221814 221221 

«£p- Elongation Factor P 

GWKXKFXXCEEKXMVUSOl^VGKFXSTXDCLYKVTS^KVAGra 
WIlTWFKATOEVKEAOFETCriJYLYt^ESYLFl^LGNYC^ PQEIMCEMFLFLXA 
GVWSAKVyWATSVELPHFLEUO^KniFPGOSlil^^ 
ICOVXEIUTRTCEYIORV- _ 

CPn_0185 222457 221765 

rpe/araD*Rlbulose-P Eploerase 

AEVKKOESVLV^PSDCADLTCLtT^JUCKX^ACSDriHIOIM^ 
INRSTOl^tXVHAMYNPFXFXESFVRSOWXXVHFEASEH^ 
FSPTOIErLPSFUTCW\AOJ<SVYPGFTC«SFLPfn'IEKIA^ 
EVDCOIDC«SAFLCR£UCVu?XL\rrASYXFEAi>SLAH^ 

CPO.0186 222878 22406R 

•siollarity to Cps XncA 

PIKDKIU1SSPVNNTPSAPNIPIPAPTTPGIPTTKPRSSFIEKVIIVAICYXLFAIAATSG 
ALGTXLCLSGALTPGIG IALLVI FfVSHVLLCLILXOS ISGGEERRLREEVSRFTf tMQR 
LWXTTTLETEVKOLKAAKOOLTLEX EAFRNENGNUCTTAEDLEEOVSXUEOLEALCRX 
I^LXOANACOAOEXSSELKKUSCWDSXVATEOXMrSXOAI.KNaXGOE^ 
B0 XOALOAEXLGMHNOSTALOKSVENUWttt^ 

KLAOHETSUSORXCAMLAOEONUlEnVTALEXHXOEAOXAESEFIACVROT 
PTTPVVEGDE3QEEDEGGTPPVS0PSSPVDRATCOC0 

CPn_0H»7 224218 225045 

predicted oethyUse 

N^LTYTRTLrMHSKFLSRRXKNS.'JHKECTrWD'TIA.SoYffK fVOOKdHYYMRETILPOLLP 
JLTl£CKSSVl^tCCGCOFLEPALPKECnYLr ( IOI:^^PLIAl-AKKMRrVhCIIOFWA01*S 
KRLEFVEPTLFSHAVA IL5 L0NMEFPCFA t RNTATLLEPLOQ FF I Vl/nmtFR I PRASSW 
I lYOEf IKKA [ SRH IDRYL3PNKI P tMAI I V IOK D" P.'JT LHFI I FP L.TA4F KELCSI F/TLVSGL 
EEW^j(CTrnnKRAKAENL*TRKEFPlJ-*l M t jr. t K I K 

iTU2 iiypothtit w.Ai prntoin 

k-rmr 1 mfrk lfpfiikkktcvkori j<nm :u»ai ior:iFviajoiKA;;KKAi.'vi-'5YYnu. 

•P -VI* t LVFFLftUX'HtLfTNLMWKEWr.l I KFI V/KK P 1 7A I VFAAY1IATV- ^tfO-VUVt^T 

w>T>w;tLHLL^uay;LNkiFPT:a#Tt'u:(j'M.v::Yh-/iTLv::i«iKi ivi>2**iyito 

I MP rV/AXLF^LJI.WALYFt ^rvi-YI.U.YLAI J >HVA I UKT::Al.t :rTLC IG 

vFvKAPr.<;uvs 1 FNN^FTYt^i .valp::fi u j.y i rm r yi i« s ui.tp r lowsf rt 
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SKMKWUTUCiABRIK 

oiU't 22*391 229825 



Cr 1 F tYYLJYOYLTDDT ; ET. 




SIENJC 

CPn- 0190 229901 231274 

NoTobutt honolog prestnt in C^«txmk/E^ i^C 

EPLSKraETHKKVAT^ 
SKOTLERYVSSOLTI^ 

SSSSSySYEWW 

S P»CSRFSLOUCQTNQniJSPVDrY IVXAAHSCHWSGF 
CPn_019l 232039 231314 

glnO-ABC Aowo Acid Transporter ATPase «^,^o~»^#t 

rEUJOXDIEEVAKNVTOWOOTKORVAIVRSLCMDICH^ 
AO 

CPn.0192 232643 231984 

glnP-ABC Aotno Acid Transporter Permease ^^^^..^ 

WESAMVUMCYOIlWIYPOVTOItja^ 
VSKELKPHEMYLICASLmifrTSFSC ISRLSEXRRSYDN 

CPTVJJ193 233144 232686 

klhgvfhS^ 

AGEROARYSLPSSTEICTTTWLVIJIXRHNASLIVIRTVP^ 
UACn7rimPIDECRL?LLKVSIANLLOVTLO 

CPIU0194 233162 234241 

gcp-O-S la log ly ccprot e l n Endopeptidase 

EVPKTIKCNVFTSWFTMLTtCLCSSCDCrACAIVNBOKO HANI IAS0DIKASY0CVVPE 
IJ^RAHUaFPCVINXALQQANI^EDKD^ 

LICfVNKVEAKtYAAYMAAO^rPALCLWStWHTAAPT XeiPTSYKUOCrRDOAlCCT 
FDKVCRFLCLPY PAGP Li EKLALEGS EDSYP F 5 P AXVPNYDF S FSGLKTAVLYAI KGNNS 
SPRSPAPEISLEXQRDIAASFQKAACTTIAQKLFTI IKETSCRSILIGGCVAtNElTPRSA 
tOTACNLPVYFPPAKl^DNAAKIACIjQGE^ 

CPn.0195 234172 235785 

sppA -Oligopeptide Binding protein ' Itmt 

YSWSYMRXrSVCICITILLSLSVyiWXESSHSSTSRCElAINIimEPRSUJPRCW 
LSEISLVKH IYECLV0ENNt^CNIEPAIAEDYSl^SDCU.TYTnClJCSArwS>O)Pl.TAE0 
PI ESWKQVATQEVSG IYAFALNPIKNVRKIOECHLS IDHFCVHSPNESTLWTLESPTSti 
FLKXJJUJVFFPVHIttQOTLOSXSLPIASGArYP^ 
7ITIHFIPDAOT , AAJ<LFTOCK1/*<X3PPMG 
NlNKFPLNNMKUtEAl^SALDKEALVSTirLCRAKTADHLL 

AYAWU^EAliEWITAW)LEHtin»irPVSSSASSLLVOLtREOWKEStCFAIPIVCKE 
FAlXOADLSSGNFStATOCWrAOFADPMATLTirAYPSCVPPYAlNHICDFl^ItONIEOE 
QDWKRSELVSOASLYLETFH 1 1 EP I YHDA/QFAMhOCXI^NLGVSPTCWDPRYAKEN 

CPn.0196 235906 237519 

oppA -Oligopeptide Binding Procein _ 

KLKSYSKERSFKLRFFAVF ISTLWLIT3CCSPSOSSKCIFWNMKa4PKSLDPCKTRLIA 

DOTti(PJtt.YECLVEEHSOtCEIKPAlAESYTISEOCTTlYTFKIICNIU«CMCOPLTAODFV 

SSVOCEIUCEIUSSWLYAI^PIKNARArFDDTESPE^ 

Hf^TmFFPVHETLRWSTSFEEKPITCCAFRPVSLTC 

KU\TOFI3NA}/rAAtLFKHKKU»WPP^ 

N t QKKPWNNAK LRKALSLA IDKDMLTAVYXLAEPTDHILHPRLYPCTYPER KRONER I 
U^W^EALDELOMTREDLEKETLTFSTFSFSYCR ICQHLREQWKKVLKFTtPXVGQE 
FFT tOTJin.a>IY:XTm*frAAFlDrMdYLM I FANPOGX SPYI IUJDSHFOTIAIK1TQE 
IIKKHLPJIOLt I EALDYLQ ICH X LEPU'I I rNLR I AU 1KN I KKFNLFVRRTtTOFRF t EKL 

t:n,jii>i .M7si2 .;**Hn: 

• Ml mm*"!'' t'l»* Miiultrut >f «in 

KHYMtK::iJIUCFIJe:KK 1 KV I PKMF: '.RW tTLKLLF ttLTtr:CT:HIUIK&:U IPtHDDPV 
AKlil'hVAKIWMUt.-: ( ACIJ-FCl ILTHFTHRESNDLELA lAtiRYTV: lEOFVlJYTFF tKDSAL 

w.itrrrr it.<:kd t ftriAWF.YAoEnspn to t fot:lnf:;tp:;sna tt 1 1 hdhpnpofpkllafp 

AKAI l-TPfWPKt JWrVTLVEYFPTJIN IHLKKMPW/YDYIICVS INSIKIXI I PDIYTAIH 
U WWyVTJWVt fclWIOi: I I*Wra4IKOiV^*lfYYTYFA/BnAFVflx:LNTKSrot/IDLO'<RltPLA 
IV fliKK : 1 1 KKM4> rroOPAfTL::RiTAl\tPNOYK WKPLTtVKKLVLTYPriD ILRCO* t A 

i: i i.m*KA* : : ij4 . r ui ;i jjYm j*vnkrkvowa t ATtjTT*v.\YYr»iANt.i :;rkoklu)nf 



^S^^TTE^ l^-^' I **** 1 KNAOJCIX7L3 IC AXDClTLVTTLEOPT P 



nrns^LYFEETSPS IHTU-JVXKN&JA I HNAOK ICAKCCUr-VTTLWPTF 

■ ■ • "-^l; s /l^^:^ 



241018 C41983 



CPn_0199 

SJtoala^Swioltyssv^^ 
SSrErTrviw 

DU0SX XDPOXRYAMCKEMCWC 

CPn.0200 241996 242868 

g££x^^ ' 

JSSpcsRmcTimjciWAimj«L^^ 

DFtMSTTElSaPWP:iIliI.VirHKCLLPUl^ 
CP«_0201 242810 243715 

or*J>-01igop«cttde Transport ATPasa .. mmm ^. 

SSSl^P^lSSEmAIJ^0AOVlJlIL»IX0(X3^ 
CL 

CPn.0202 2««2 2^4500 

ilSSptwitthmdpkxprawc'^^ 

TTTOCLLDAIPirSLXSTEISPSEXYXLOVASK , 

CPn_0203 244966 245802 ah 

No robust hose log present in GenefaanJc/EHBL ai of 1 1/7/fB 

XVPLPOWOOCETSQWmFSPTITOFSLriLEIUJrOT 

ixSoSc!^^aaIKIX^LLPLVXIA^IlJlYFX4^^ 

SRPOAVnU^VMXfiPAITSIPWCYOMRXITrPICXmAPSILFPICIUI^^ 

NlJUa^fTi^PEDW^ICSIETO 

FNPSGICYFSElSTAKXHrHOKCRVCPIRSSSPXMCtX 

CPR.0204 245C91 246002 

No robust honolog present in Genebank/EMBL as of 11/7/9B 
PRDtAWTTIOIIClf SKDPFSSARSIWANPFFOTHHECN IK IKCMGYOI PTRUtKlfilSFSS 
YlCIKPyPYfyUUXFVYWtSQFlCSAI^DHCILOKQTrrrYT^ 

CPn.0205 246073 246327 

No robust hocolog present in Genebank/EMBL as of l\C*/** 

lEBSIKCYCSASATRNPPOLLLKPTtTOELCXLTVATOP^L^ 

YfUUCOXLOUOfYKSFTILNTSPLCS 

CPn_0206 246346 247161 

CT203 hypothetical protein 

IVDRRSPACYXS XNSOAXCVSLti<DXSHIL£DLAYDEGILPREAXE AAIVKOM 0ITPTl*L 
H X LHDATORVPE XVNOGSYQGH LYAHYUAOFRESRALPLI I KLF AFEDOTPHA IACDVL 
TEOLPRXLASVCNDDSLXXELX ETPX INPYVKAAAISCLV^VCAGKIPROCVIRYrAXL 
UrfRIXKOPSFAWDKLIACICTLYPCEI^PISKATOCLVIJTC 
CSCIHTLCSSTEI.IKOTL£OIE1CHLEDFPIEP 

CPn_0207 :47209 248617 

ybhl/sodiTl-Oxoglucarate/Malace Trans locator 

VNKKKRFLSLl,FtTAVl^XWFSPHPASINSNAWLFAlFTTTIMSX IFQPVPK2AIAI X 
GXffrUXTOTLtlXOGLiGFHNPIAWLVFLSFS IAXCIIKTCU3ERIAYPTVSALOJCSPL 
GL5VCLVITDPFLAPA I T ATARAOG t LYPVVTSLS DS FGSSAEKGTODLIGSFl.I KVAY 
OSSVITSAMFtTAMAGNFLVAAl^H\^Sl^WVLWAi<AAI IPCLPSLFLMPI ILYKLYP 
PK ITCCEEAIRSAKLRLKEHCPUUCEEXT IU4IFTLLWLWTFCOLLCXSATTAALIGLS 
LLXLTNILOWOKDi/X ANTTAWETF XWFnALXlOIASFUIOLCF X PLVGDSAAA1VSCLSWK 
IGFPLLTLI YFYSHYLFA^NTAHXGAK^P: FLAVS XSU7TWP I F AA LTLAF ASNLFGCLT 
HYraPAPLYFGSHL\T^VE^SCFAiiIVNIVIWIGIGSU«XALCLI 

CPTi_0208 .NS935 250602 

pfkA-Fruccose-fJ-P rho3pnatr.ansforase 

SVAV t UOIPLYVDLCT I XiiYSPPLPKETQEAASL I AVI»tyr3H5KPVVPGVrPtfParYH 
LPYUFVOnn«SAnrrPLK\\,WMFS(riPAW*fillNVXOCLFN^ 

l.TffftKS t O ITEEFLJKFRN.-S; f, FNC t f rPCRKK I W PEAK EAirLKTAEALDLOCLV X IGOO 
i *t;ffTATA I LAEYFAKHRnCT;; I 'JTUVrT I O ItiLQl fTFLXiLTFnFDTATK FYSS tISMISR 
ttAL3tltAltYMFlKIJ*W;iA;JII lALB-'Ai/milfl t AL [';EE!ASKM.PlJCTt f HKtCSVIA 

uftAAMEKYYiTViLirFi'i tkk rrc[ iin.tTRi Ku:rrF.OKi:;nt^rEWRLUOTAPi 

r ( I Jf DRUAIWIIVYVin! MTfePU .1 (II.VlWinWYFKIVPFNACUiFLOYBORflOWTK 

ci jiiTY' 7t: :tsi'fr : 1 1 .vi*,ni i j t ret j rr t a :» a< wmfw i.r a t i-wimi^nncoOAOtTrLQ 

IT t FKYLVD «XrrAKRKVKI.YI'F tWAI JSlKTiPM H'Ujt CTIiWlt:UJNFPPLTLLLIO« 

^wiPlvi:IEI^^^TTY 

• :iti_ojm*' .".('('/j .:*.i::7;i 

r»« r'rtmr.r. Imw»»Ii*.i iMirwii it* •« I* ink/ mm. .n; •*! 1 1/7 •*>M 

m: mm I hi jcmety; :i*:rr!-:: ^rxm n .y i mkki j*: :y k:;i-v y kthv i a rn w I aybqn t 
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MLUMTVK liT'/UFPRSLLRTTSLWVRP 

02in :52«^ :su4'j 

No roousc noaoioo, present in Ceneoank/EMBL as if 1 1/7/99 
Y0KL><EREP.r^rriRDCO<ATITrMLVELEALJCRErAKlJCD0KPTSreEITSLYCCLDK 
LEFVLLCLOvSK FLKATEDEOVLFTSQKA I DAWIAIATXARDVLGLGC ICA I YQTIEFLO 
AYL3KVNRRAFC IAJE IHFLKTAI ROUiAYYLLCFRWPLCKX EEFVtWSNDCVEIAIOUCL 
TTFEKETKEUJS:LLnEEHAMCKCS [QDLQRKLCC I : rELKOVSLTCrSTTPSOECVOKO 

•. :.;.kvuj.:.. :tt:-kv-:.; -.--"■-r:nKrr-::j-:.rj":;K < ,-TKSL3P.\: 



mo rnoiiu r now»io«i p - — - — — 

. TMLTOBYllDOOFTCATKWt x-T I IKLJrVRDKIXSLTNP I SE WSETSSSIKDSVUISL 

riLcsxi/xARLtfrruawDPLora^ 

KLVICFCX ' '• : "' ' ' ' ' — 

,-Pn 0324 26)402 2t?)'»74 

No roous: nonolog present in Geneban*/E>QL *i of U/7/98 
YTFKNPKKNKKMK FNS 1 1 FLENTKHYPCrFRECFVPOftHSLKEASOWLiSTTITXIRSX:. 

....... «... ■ . - n . •.—,%».— .Trr»' v r* ' 

i>n_U«'^ JnJcMc Jo4i4i 

No robusc noraolog present in Genebank/EKBL as of 11. '7/99 

NsmKFUjrricNAXNsomraPNLTn&EFi^ 

USXPIPGLAJ«JVALCI£IVSLIlCIAIANIGTI£X^JttlCC^^ 

OTPTALPWOACEn-EIC^VSATPILLPKJOmEEl^AXV^ 

LXDWUCQEESW»EAIUCKIVAE£A1*SRXRICCC*A^ 

CPIU0226 264S45 2M967 

No robusc homolog present in Genebank/EMBL as of 11/7/98 

A X FNRJCRHPYYANTLEF XOCTQSI^CPITCYCFVRHKYKCCIXX ECASKDWDFXEPPSTWX 

RTIJAAJPILGSVXGLGJU^SIWSIREFOCSOEYTCSIFWfri^AVLEIl^^ 

lATFXKAMPaJOCVATFLFYS 

CPn_0227 265467 265009 

dsbB-Disulfide bond Oxidoreduccase 
lOERFVirVSOCiaJCEIMHINFIRSYALYFAHAISCAC^ 
XCLFTC.TVXLGISAYR£X)SSXiaYXlJaAVb5LCXSIYGVFl4EXPTO 
KIFIJTrVTIP>USVVAFGArvCU.VLT10nrRC 

CPn_0228 266242 265412 

dstG-Disulfide Bond Chaperone 

VKDRADrUILKEXFSCSIUaCENArEFYVFCSX KOLTKSSLRGPLNXICI LVLCTANFFXV 
CFCTLXKXICKrXl«PPKAHXPTNAIQfFPTIGNPYAPXNX7VFEEPSCSACAU ULVf VLL 
K»IYIOTEISFTIJPVCFn«$KPAAOAU£IYHHDPR0 
CSWttTPEVLTlOAKnjCINSGRSVHPKCLEOCIASG^ 
PTAWGDYLIEDPTnfEIERAIOHIROLOAVECOHDO 

CPn_0229 266163 267560 

CT17B hypothec iul protein 

NSKAFSFUtX E0EN7SFXFKKSAI*SF^fNTANLTXSTFTF I LLUIJUQCEOCXJtFMDKET 
LDJIYRHFRYRFLKLN I LP ATT CZl f f rrSPWTLNYTQVDV I FSDRLCSCLLI FLAIASLT 
|QlSUi«n/iAPU3IW\mXACVACRSPTIFANim.ICFAILJkV^ 
KFSYNPSA0C3tI^VtJUXMa£ARn.TASSLSX 

VLSLACSERRWKTRPK IVI ATALAITCVT ILTLLPI XLKOUtYtXMLCLCLTIEPALAW 
FAYOrrRATUnriSOFLCRIKMaTIUlSFFCSLVirm 



• wi^cnhi*. EMBL as o; i:.'7/»9 



v«;:r:;.:\wke:--: 



TPnJ)211 252765 252463 

No robusc homolog present in Cenebank/EHBL *s of 11/7/96 
ECVMSYPDISNVQASS IOSALLHKTSDQIQQKRCFXQSTFVI LAVSLVI IGSLFLLACVA 
I LTVTSKCVLSLVFCVU: r^OiLLAOCVCLL'/EEAKSLZ. 

CPn.0212 254011 252888 

No rooust homo log present in Genebank/EKBL as of 11/7/99 

EI^YGVWSIYSEIISFSELTCCKHSIJTFGPIETASIRIH^^ 

LOCVFLCVTST^rLlXWSMXX^LIXJSiaJU^JCFKm 

:XQH0OQXAQLAREUl^Xia7rXXRCPSARL£7VLEGSICnTO 

AQEONS XLEQCKEALLFRRKSAOEX FTCKLYDRJW^FWRSYR ga^C YSEDT/SiaC Ai^S^ 

Y ICI7VFBCT APHFLKEAYAMCRTAWaJOWKVCVEM 

EIET0U3maJTSDSE3VI£EY0XHCIRVTMLHAIJM 

KAVEDCXETFEIXQHCVVHTICTLELEIAQLtVDIIJ^ 

CPnJ>213 254345 254190 

No robusc homolog present in Genebank/EKBL as o£ 11/7/96 
I LWFSRVI FSNTNQ IC X PRLELILPLWKXENDPFCFLFSKVECTT IILNIK 

CPn.0214 255768 254446 

No robusc homolog present in Genebank/EKBL as of 11/7/98 
FljGLKEXSYERFTYOtf PPAPHPQKTOSKrciASHVS^^ 

TSCCVLLGAALP IIT I C^'/I^VALI VFlOOHJCTRODUTrYI^L^^ KXKXX PND I 

SELRmEJCLQSt^FHTICDFSI^OELOGKFINO^^ 

RNFTTFCEgVHGIQSNIFDLHEIXSSLYT^Tra 

XI KGLFIRI-TSRl^KIJTVKAOERW^ 

ARI^MGRTESXX^»OWEEALI0>QCXI)PEia^Ht^Lr^ 

LISffTVTSCLTXZECXKRKRAA^TCaiUIXVRlCXA 

VIKSLELELXHXIKDXVrEET 

CPn.0215 257039 255759 

No robust nonolog present in Genebank/EKBL as of 11/7/98 

ltsskkovmssaiawctpspspopsstlgvhppkykl 

vnaifsfsvl7^g&ggagvfi^»lijjlgliffvsykr^ 

LRXEXMEVOEWSlirilJEWDFXEWAOKXSOFATPlX^ 

WAIATEXJMrVCFUtrUUOfCimaOCEin^^ 

MEOECYiaaOWEXL£VrPBmttEXXEV^^ 

TVFSEEELO£AUlKAXA£LIJ}XQViaCSVVEIK^CEPTLXOYHL 

TFSSEOEKVZ^EYEAXXARIRJCTlJCNnCLSOVR^ 

AHDDFLD 

CPn_0216 257623 257174 

No robusc homolog present in Genebank/EKBL as of 11/7/98 
NXAjtneipvnroxQfTOXPEXTreLRXiOTW 

flAAWriTT/yT.T rAT-P7 J^<?rgr tt t t ^rm/pffln gTVTnfr^vT-ror.YyrTTT pctt> 
CKSAISERLSXXEEKLRIAEXFRYSOSVFV 

CPn.0217 257881 258579 

ypdP 

PKCCIOJCCffLSVNELlFGFgTFSVVVIXJWFASRG 

WCFEVTSADVYVICIiTCT^AREHYlKNDINDA*^ PSPN 

DSSOEHFIALP^STPRIWASLmiFVOIVDIKlJTMRVFSirXY^ 

LIDTI X FSFLGLYGLVSNLCDVMI FAKLVKC IV ITLAI PTLTVTXAVI-DRRSS 

CPnj0218 259064 258582 

No robusc homolog present in Genebank/EKBL as of 11/7/98 
I FLSXKVFFESYEDFAWASWPKSlJUU*Vr^RYrVt^EUCETPYR I KDFKXTP XHHRLY 
RStJ'IICTlCCIIRLIEAHSCPIHPR^KMir/RFEVljOAVIEILCLCVXILVFOIICCFLA 
FLVAI XL5lXLYCNStrrrV0NtSlTfERKIXCIGEAVNFtA 

CPn.0219 259348 260472 

egc-Queuine cRNA Ribosyl Transferase 
OSSLALKmXK0SIOCS0AittC0IETS>sWIOTFAr/PVATKQVLIC^ 
TYHIIIitFGPEAVAKLGGUtOFKIRQAPr ITOSGCFOIFSLAYGSVAEEXKSCGKKKCKS 
SLVKrrDEGAWFXSYRDCRiaFUPELSVOAQKDLCADI I IPLOELLPFHTDO EYFITSC 
SRTTVWEKRSLEYHRXDPRHOSKYCVIKCCtXP^ 

L0EHSEWKITTSFI^KERPVHLLCICOLPSIYAM\^FCID5FBSS^'PTXAARHCLILSK 

acpikicookysoosstiopscscltcl^tsrayu^fkvrepnaaiwas ihnlhkmo 
ovmkexreaxlkde: 

CPIU0220 260660 261236 

Nn robust nonolog present in Cenebonk/EMBL as of 11/7/98 
r^SFUCKKCrrYMSKESIRSYSEISTPTPIFRTITSKECVAYKWLRiPAXDCILRNRVS 
LKGALLRJ I PFYCSFLCAKRI HSAWSAW>APCTT!^/HYL\«3CXXUXXC^An^CKV^ 
TAUtFLFSKASSK IK0KKWREKARNLAAKDTVC5 1 KEFCSV0LT3CFTRCFRLRNRWEE 
OAGENOTVREirV 

aTi.0221 261621 2620M 

ttu totjimt tKMKiloq presenc in ti*n«ttink/EMOL *** ot 11/7/99 

TjA I JtYKYRXC 1 0HVNRYKC.»AEFSA0KYY0ttlL\Tir7/KRNLRnL«\I\'ENEVCLFEOi>IL 

LEiiVMA." triMi*::t LCU»ui5wrrrODPKD£K x ; : r t ntTALC I letu;lci i vllik it 

.'T ILL! LFTPiriXi:YFWrj\AYSDFHP t 
ritual ^,3474 Ji.JM4J 

. ;kkfi.k tirfEKii»KLNAra.TgpEr/nNRwvLMri*: j";np* :mo iakvw::yrcvhea5Lye 

KT* 'FLTL'H'PnKMLIVYP^LVKrJrUJLFLKf'.UlKMI :iPHK r RYFFiX vvvctklorphyhl 



FSAAILFSPVLFHIPVtSPKFI*PI IVICLI LI ILSICKRRJtTKRXL 

CPn_0230 268277 267576 

CT179 hypocnetical protein 

ttnOULXYM5SQPl.VTTSSSL5RYVVLTCXE3CVACYiaCAm 
IFCFVLCSILLCAPLEriASILYWILPWIiPSILV7VU<VLPLNIYAYSHHXQ^^ 
rTQSNYraiYOTCEXEXXTPKXKAi^YIESOVW 
ESUQKDELXOKALE3lA10»IYMN1QNOREXR£^^ 

CPXU0231 268996 269253 

tauB-ABC Transport ATPase (Nicrate/Fe) 

K^F^lODPCFSmJQMXIJOeSCO^lV^ASrOKSKTZri XLCSSCVCmLFIlLL 
ACFLPLQECELLWNGS PLNRXIWAY>!OOKEAiJJVRTALKN>fTLSTELC INTSKNALSNE 
RLEEI X KN77JLC0LLDRYPDELSOC0R0R IALAAOCLSLK P I LLLDEPFSSLIft/LLKEOL 



CPn.0232 270134 269232 

•siailaricy co 5 '-Hechylthloadenosine/s^Adenosylhonocysceine 
Nucleosidase 

KKFUOlRrLFLILSSLPLVAFSAI^FTILEEDCOSPl^RVSIIFAIJK^PVSrnMCPIP 
WF^KSKXn^RXYYSCOSFCKYFWSAlJ^NXVSSAWACNMILKKRVZ^ 
SR50D5R/CSVLV5KCYXNYT3AX7iniPFTXRFTIPOXKXSVFATSD^REAILJtCCEEFXS 
THKQEX EEIiiCTHCYLKSTTKrEHTLMEGLVATCES FAHSRNYTLS LQKLYPE X HGFD5V 
SCAVSOVCYEYS I PCLGVN ILLPHPLES RSNEDWKHLQS EA5KI YMDTLLXSVLXELCSS 
H 

CPn.0233 2704 39 270248 

No robusc homolog present in Cenebank/EHBL as of 11/7/98 
EKAJtTMFLCKVLLFLLRXSRRSYVQE IGXFFHLETPOLKXVLCAFVSTFIVVSIDVSLXN 
KCOS 

CPn_0234 271246 27054B 

CTlfll hypocnecical protein 

FXHX^0SCKKAI^tWSXLAF>1PIPCHC > /EAXSCFLCXVKGWFSKKEX0EEAftILPVKOS 
l^WKRYT7VT5SXFSVEFPCEPOHSCOIVEVPOSEITIRYtn"YVTTmPC»n^ 

ypekvdisrpelnloecfsckmoalpe^vlfmoaroiochioklefwivcedw 
svkhtiyovfwyknknpoalokeyeafsosfkitkireprt: pssvkkkvsl 

CPn.0235 271395 272177 

kdsB-deoxyoctulonosic Acid Synthetase 

VF^YLLMKPEE^ECLCirrvLPMlWN*~RYPCKPLAKnCK3LI0RTYTNAS0SSLLDKI 
WATOOOHI I DHVTDFOTYAVKT.'; PT^ ICTERTCEVAR K*/F PKAE 1 1 VN (OGOEPCLNS 
EWDALVOKLRC5PEAELVTM/ALTTW'EEILTEKKVXCVnDr;ECRALYF.TllCPIPFlLK 
KATPV/LH U'V^AFKRE^LFRYLOMn.TrPLCOAEOLEOI^FLEJICOK IKVCIVOAKSPSV 

OYPO*tAKVeOYtTi:u»fc\YF 

t'Cit.l^V. J72LIH ^7 17m; 

i*m; '.TP iVnrtiut.i-i.' 

:;kt i yi impfk\* i flttv wvr::;LCKCLT.\A;:t^L [ lgrq n Lf fvahlk ldpylnvopotwp 

msXI YVTOr»weTOLDLT4IYIIRF::::AAL::i(IL'^AT.Tf Vj lYAnvi KRERfrJOYLCSTVO 
V) MimiEI TOVIUJAAKElKI'MfLI VKiryrri^DrKl-PFLEAlRUFRYaiSEDCLNIH 
KTYVnLOAADEVK.IKrTOII.^^TIJ-; i':t I VUA lUCRSCrpI.TOGVK.'tK tilLFCNVPNR 
AVWVtDVKHTIYGMWMLACCK tAIIF;';FJCl.KtATWFJII^DWKVl.VNOia?W.PItVK I 
* VWWYVOHRnAYIC: 1 FEALTHAAIJ* ( / aiAAC I [ P I OAFX.ENLTMEL.'V :DA*H.VpiX;rC 
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mmr.¥ i aaakfcreog r pYmicuacvLwr. *-i>a«AKSi^MDP»wKPrvY 



•:pnj)23 



273141 274214 



-:.*r:r — ....xrrf: 



tPfV_0239 2742X0 275838 

gdltarwIpalyhltk^^ 
wEortxw^YHRSEr»c#crrsua3siJ3JLn^^ 

LLTMEpmroADEiwtacmLORisprsE^rvRoavtacrvQc^^ 

KDSRVETYVAIJnVIWPRWLCVTryiA^CKRI>KJCSTOIS^ i nOCSPtt CJAAfiEC^C 
PIENIXLIIRIOPDEGVAIJCrNCIWPGTNNIVK 

ricrortrTODEVM^swKi^TP^ 

RPL 

CPn_0239 275863 276672 

devB-Glucose-6-P Dehyrogenase <Dev8 family* 
it5ISKTNKIETHATLIN^^ 

SSSCTDIWUtlW 

RSPLrwXISPESYCXADFENXSSXYXMDXL 

CPO.0240 277861 276698 

Norobuit honolog present in Genebank/EKBL as of "/J/" ^ 
unrrHVFSPSseswKANSvvRSN^^ 

RPTDWefTOFKAAONI«aJWroiLIHCrSO^^ 

LC PTLC ALVYCAYKVYTLGXMTYS LNKAKAKVLRH PACNVTHRAAGVAT IRSSB^VXAC 

XXYKSAMXGSLWLIASLALIALTAGIVLVlTFVAK^Ar^ 

SUJCUnAIVraCAKXOttCVGKLTW^ 

YGHLFSNEEVAOLVOCGAPGGGSRPSQHYOCSW 

FtlACTFTAFTNPSAPPPFWAYBTIYG 

CPft_0241 279372 278203 

No robust homolog present in Canebenk/EKBL °* i 1 '?'" ee 

irLVKFMSAttSLSSSKEASIASWtWRDW^ 

TAIIODIJaSWVETECEHHFOVVSNISlJWXYORrraira 

LXTOirJUVlJTVLAIVroirfWILCYSAYKIYOLTIW 

HRSGAVAAAAASQSTITUCiCvTWJSTLIFr^^ 

VKrW^IOCCAAOGTCIlXOTGTlJASWSVOKSOE^ 

LPITPGTKKVLTOS IJUWQOFFSW3EYRDXESIVP12niOTTPPPSYETl*FHEEtMDCSSN 
VIPReSPPAYSriDSSNSPFPSSSPPPYYF 

CPn.0242 27997$ 279487 

No robust homolog preeent in C«^/^ « o^l/7/98 
KSLWCSLYOrSOKPTWLMACSirnWSQC0VTOEI«K)CrAa.VVm«^ 
WSVVLLIUCVLFUJ^rPFKIXSASSALICERVSLCSHrK^ 
ICTAX HGF I XSKRTSEDARLSSAIVrMOAPI LQ1ACMSGLIKP 

CPnJ>243 280609 280X33 

No robust homolog preeent in Genebenk/EMBL as of IJ^:. 9 ™,,., 
INYKYLVFLLKTVKCR I IMACS ICYKlJQ<ANEPI»WASKVALVADIIiYPnU\^CAW 
FAVlJWVKLLrt^IKrLVNrCIAACKSRPLPSCKENTOCt^PXDKPCPSIW^XLVLIP 
I IGTLIYST I ITVOSOTNHLRYF I ISPAYCVCSTAI INW 

CPn.0244 280906 28X556 

adk -Adenylate Kinase 

GAFEVTKCSV7I rMCPPGSCICTOSCm-AWlICLPHISTCDLLRAI IREGTPJCLKAKAY 
LPKCAFVPSDFVWEILJCEXUJSQACSKOCIIDCrPRTIJOAHUJSF^ 
EISEOEILXRVCSRFLCPSCSRIYNTSOCKTECPDCKVPLIRRSDDTPEI IKERLTXYOE 
RTAPVIAYYDSLCKLCRVSSENKEDLVFEDXIJCCIYK 

:Pn_024S 28X627 282499 

ydhO- Polysaccharide Hydrolase- invasin Repeat Family 

TCOKE X MKHYWFSPSAI3FFSK0CAI ET0VI^ERVLVKGSTCYAYS01»FKKE LLWKPY P 

CHSFRSTLVPCTPEFH IHPNVSWSVUAFtOPVCI PLPFCTLLHVNSONTVIFPKOILNH 

fflmWGS(nTOCDPRHLRRLNWFAEU.lKOADl^ 

GFINlLYOAOCYNWRNAAIXXfAlCHWISSFENlJSGCLIFLYPKEEKRISHVMlJtOIJSS 
Tt. t HASCOGXKVEYF t t»EODGKFlI)STYLFFTOINQRORAFFG I PRKRKAFX. 

"Pn.0246 282955 232551 

rs9-S9 Ribosomal Protein 

WAKSTIOESVATGRRKOAVSSVRLRPGSCXXDVNGK^ 

eOOSOYOLIIRVSCOCIOOOVIATTuXUkRALtXENEENROOLKSOT 

YCHKKARKSFOFSKR 

OPn_0247 283430 2S2969 

rtil-Lli Ribosooui Protein 

[^YnMfcJCRKOTKTTIVXSSFTTKSVYWDAAC 

W.;W VI V tNAEXVRLTnAKXCOK I YRYYTCY t SGM& E t PFENKHARK PNY 1 1 EHA I KCMM 
I 'KTHIiJKKCUK.'XR I VKODJYETFESOKPI LCD I 

'TnJI34»* i«4451 JSICSO 

yiWyUiA AW Transporter ATP.it:** 

l«: : I 't • TYI V: :« Vfc YATF. IFRSPAi :KK;:RKMA(?LRflFK.'?FLAK*X-L t EAKNtr-TCT IOOON 
I : ; 1 1 .Tl A/: : I .: :i J IAf a T I ^ tT> u\: * 'J* aiTTt JJIU/TTLOV r.'XUlfJLRFFtiKCCKNQOLA 
NKktU 1 1 • *.l *V PjNKYI .UOI7IVLXNVI «PAL I ARKI 1 t '.X* I'S PVYTRALELLDLVHLEDXV 
M*vt - :K I . a > :W y«VA I ARA L I NETA I U>Cf.r*JQ ILDEET: >EQ 1 1 INtXLEOASALCC t L 
! vti WK li I a :v < :: : it u wu W.K LFPtW;: 



•TIM iivt-' ( w.il pcinun 



BBBBSBBti 

vv.* •»:,:. rAPP.*^»*!tr.r"T* *■" " 

rl3J-L33 Rloosonal Protein 

iiisMAWREXIWJ^ESSOHYWrmnCRI^^ 

CPO.0251 286036 287559 

^p^pfI^^isf^^ 
i^SwcviBEVKtcxiijacYEsnrcxii^^ 

^SSSFXCaCWlI^IFLYSOtAWAAZWOOIPEXVXEW^ 

ITCTACYOLVORVviatSONOXLIXrSICA 

CPn_0252 2881X2 287576 

CT144 hvoothecicftl protein Uraae-shiCt with 0253? » 

^gJStSSoSSxTR^ 
JJ^JSSyYRWHOWLCSW 

CPB 0253 288474 217950 

cSA hypothetical protein (fraae-shlft with 0253?) 

rCOCWJfiSlTrrQXITXSXPTFVRFNIESINLTO 

OTOOYVPYSYWTWie^W 

CPO.0254 289268 288459. 

Ni^xwNUcsi^rcviwFPrpsNrvrvp 

CPO.0255 290X83 289329 

VSAmTOTTSffXWNraOT^ 
SuOTDrKSQ^P^EWYOSW^ 

^WPWDPMASDRYIUCITYLSNtSCS^ 

CPO.0256 291282 290398 

ESsRiiisNrT^^ 

CC^^LTIOiaJINIRITSTTJSMVFtXniUa^ 

5SSlvtSSSkvpio^ 

TTSRCSSYYCTRPLOVVCVTYY 

CPn_0257 292136 29X267 

51!aiu^xan^ IKDQWLflflOATLNVDCNVDI ENFLCTRSUCV 

ADTrTSPCETTVOCCI^AESSOFTCATTl^^ 

YMyYIWTTCOALNLYTYYSSSOPTTVCKP I ETVCONPNPETYR ISASAKIYBAVTRFFY X 

OFwSciYOVTIO^RRI^ 
TGTmTEIVATPPHDYPWUTXmCIJI^ 

CPn.0258 292534 292X33 

CTX42 hypothetical protein (fraae-shirt with 0259?) 

CFSFCRLCSKFEK ITLCCNTAlOt-LAACTY I LTFTICKRVCWNNChOCSXRLFUJIY 

GfnOjCCSTVYSCCCTSTKrn^AVYRW^ 

NYSTTLLYFACDKV 

CPn.0259 293031 202441 

CTX42 hypothetical protein (frame-shift with 0259?) 

lYFVFKRKl^llYF XEKX1TWM0ONNECYFKl4)STVtX3OL LASN IQTFDKOAKC ISSTETF 

3VMKATFKEKVS ATCLTSASTYKLN ATCPAPSS IT XOMKNNRLSNPALPKNPCDPVPAN 

YVRSPQYFFCAKPIECTFMFCGSSRYLPriCDCSfrfTLYOSSKAflDVFRFVn^^ 

KLCCT0PYNFLLOEPIS 

CPn_0260 294090 293548 

■«cA- Protein Trans locase Su burnt 

AYtDFSKRSCVEEDHVSKK tNRia)trPCCSNKKYKC^CLKKEEQTARYTTlCKFKFSAEV 
LSASCOCCACONCTXLFgRLSOSLTSEOXAAVCXFHOrrniKEVHSKKAL^^ 
VTEXLOOKNFE I LNTCENLAP PKESTATLN0OTNFVC EOF I PTOED FR t SENSQXP PVCE 
0 

CPn_n2f.l 2 2^0 U 

vfJ.y)-P^*Liiot* .lutMrt.uni lv ATP^St? 

YSFNHPF IVFM:7TLLUIPPWMKAf ;KR I E." LV'RKALYTIfTMtANI I KK IWAI^rWCDSLTL 
IXHLKA t .TWRFf'nLDU IAVN fXIKV ry.^ACVMK PY LTH t 'IDQtf. I PPRT I PSPYAPET P 
ErrW-T^ARRRLLFOAAKE lilA^A IA f* 3IIIR PQLV9TAI1 JtLV IKAKFAI WUVUWIIF 
rr/T t LRPL I FTFF.FV IRKFAK VXf »FAfcWRCI W.:t.li.*:r AW.'.I JCU A*KVFPLA«IINIA 
LAIOQCT^^KI 

I. [FN INKEVKVf LVLMNXR L.K t ( LTNMS ; ! T AK> Si: U rt*V*IAtiKAN [ittifYI AATOAAj: : 
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• TK. WAI.:LN0WCASPVAYPJPVKEAWAW5P: . .tLSLRTLTESVSroLVISGIMCC 
tot I' -WW** IGAAKQALVOC tPCMALSQONHtSrrCOOICAPEXLKALVrYLLSOPrP 

-EEUSCMLEMI XSV3P tF.*^PXCUfnXEFOrr0eilF1<ASLLS5CLrrKIF 

CPn.OSiJ 276174 297136 

yrrtu hypotnetic-ii protein 

:?rAL^RKLRVT(PP5LAJC/AFP<3FRMSHGPRPTKF1!FPLYF 

••■ : ::■ :^"'n,;tt^:.; r:v.T..">:^rr.T^r-. :"r "/:y M T-.v 

•■ ;,:.:t.v:.i '*fr:i r-/rr^; , «-.vi^--".'r. : •„% : : r.*v ;piKv::T0f7r£::.'; 

: 1 1 NKKKCYTVCQ I ILFVNFT I FAL3G rVYKNWHTA FVS FLTYC I ATKVHOKVILGLEET 
KSVTI rTSSPRXLtXHILKETWIO^TYIKAECCTSCEPlU^ 
EDPSAFIAIENLHEVINCRRT 

CPn_0264 297730 297155 

ubiD-Phenylacrylate Decarboxylase 

MKRYWUI SCASGVILAVKLX KELVNAJCKOVEVI I S PSC WOLYYELCCOSFBALFStrN 
LEYZKTK5X0AIESSLASCSCPVEATI X IPCSKTTVAAISIGIJUKLLJUIVAIJVA1JCERR 
PULVFRCTPUrrXHtAnXKLSXSGATXFFFIfFMW 
PSDLTKQWSNPE 

CPn.0265 298632 297730 

ubiA-Bentoete Octaphenyl transferase 

KIirVRUVTFl^VNmSIFSIIJT^ASTWAl^INEISONl^FKECFRISVFGAIA^ 
FARTTG IWNCC IDRF IDKKIWirrSKI^PAW.VSUff AWXT-SLFCSFLFLFLCKILRI F 
3I£IASL7LMIWPYMKRVTFTCHWGX^VY7^WLM^ 

SVCMVI AAND IIYAI EOTEFDREBSLJlSVPAHYGEXKA\^IAJnrtiU<VSYl-AY I FSGFVG 

SUHCEPYCTMXFLWILKVVRMrSNYSiaCI^ 
R 

CPn_0266 299181 299B76 

No robust honolo? present in Cenebank/EMBL as of 11/7/98 
IMAI£EXWWPSQQXASSTSCrrSKXNODR)CrFACTv^ 
Un<SVPt^IL^AVTVGAVIJITGLTILVRXS^ 
SKFSVTCSTTSXVLCKAl^IGAWSVTIT/^ 

LADQZCSCSADSOSNIVGXC£PKAA0C0KWyi(MA\A/nSEDCIPTAXRLTPQC 

' CPn.0267 300122 300910 

No robust nomolog present in aenebenJc/EMBL as of 11/7/98 
VS rMSUaCTNALLNOPEPAVCUJAWDPKY INQDRKTTACTVTLLV1ATLMILTTCVTVLL 
AMt^LSV^VSTZXCrrSVTTLCTALrXXCLVXX^XKXSlJUaOYQ 
IPIWDMVHXIJSCLPSWieSPSPEASTPVSlO^ 

TT^fWlJUrVGFAajCTALF^3CLAC2JmiSI^A0C IKVLYLTYYLSSALEERNETVKDO 
RNEU7TYLTCECR00KREKALLE 

CPn.0268 300914 301318 

no robust honolog present in Genebank/BMBt, as of 11/7/98 
IWAUI2(S0C0SSSTSTWEI*«SPVTNWW 
. NAOAOTKTTSTPWENCIDOLNPUi»PNEae^ 
OEEEQCSQNNEDLXG 

CPn_0269 302468 301476 

Dipeptidase 

VAFRCVMTXIMtOTXSHPHFCRlODPAVRCSPEOtXSOm^^ 
QNSW5LP>CrPDXGU£y£££DG5SS0mL^XISXENASALCDCTAKmX^ 

EDXLDmDXLPNIAVXASKSNFRSVUMUQCV^^ 

LCDLEKHVLHAENLG X LSS IVLCSDFFYANi23ZNFFTNECSSA£AH PVLNQL I HRI FSKG 
KAESILSSRAEXFLIC0VIVEOVNPKITDVKL 

CPn_0270 303343 302468 

ywlc-SuA5 Superfamily -related Protein 
SXFCTIVPOXICAOXTFSLPCVMSAXHOGKWAIJ'rtmOT 

EPSKAFALYVNS I ED I ENISGY PLS PTAJUOAOIJ'PGAITXVVKHRNPRFPKETLAFRIV 
DHSVVRCXVDHCCTLXCTSANL5EFTSALTA0EIFADFAI3KDLC XFWPCSHCt£5TVVA 
SOPLYIYRECLXSP^SaEmACTEAXXFHRTSHAFSl«XICXriV^E0LV5FL5CSLDF 
KGWEHPKWWTYTRXJlEAUaaffPSIVriYD^ 

CPn.0271 303628 304362 

Lysophospno lipase esterase 

KLMTWSFFRRKXCNIEAI ECPGNPQDPI I ILCKCYCSLAONLTFFPSICSFSKLJlPTWt 
FPNCIUTJ^FRGSRACFPUMXLOTI£R1YAN3V^ 

EUUOffiPYNEIIIGCFSOCAILATKLVLTSONPYAGAXIFAGAJlUTfOCWEBCUOOCA 

OWFLQSKGYEOEILPyHLGAHLNDLUJICUm 

OPARfl 

CPn_0272 305272 304340 

dnax-DUA Pol til Gasna and Tau 
FNRQSMrMTWVKHLEEENQCafEALLRlW 

LLSSSPGSEHKVSQK I HPO X YOFFPESKCRIJtSI OLPRGXKKQ ZY X SPFEANYK X Y X X HE 
ADRKTLAAX SAFUCVFEEPPKHAV X X LTTAKVQRLPKT X XSR5LS X F I ERCEK X LCSXET 
FSYLFRYACCEI PVTEVS0 X IKESSETOKtTVLPJJKVTORFMEVlXELYTlORYTLNLCLICAS 
ALfOTDWEXUXJlXPlXICVLUVESACPSLNNSSSAA^^ 
V3VSPG0CL5N 

CPn.0271 305853 305227 

rrtk-Thymidylar.-s Kinase 

• IZ rvrr/IE^ECOnC5Sl>KAUTOLVA0DRKVU.TREP0nCI.ICE^UUX.ILEPPHLX 
LJftCCELFLFLCSRAOHIOEVX I PALRDGY X V I CERFHDST I T/OG I AECLGAEFVADLC 
wK'/WPTPFLPNFVtLU>IPAOlCWRKHROKVFOKFDtKPI^/)«RIRECFLSX-ASADP 
::RYLVLIW*E:;LA:;LIOKVMLHTOLaLCT 

■'li,J)274 lOtflfifl 10m2 

•lyrA MIA «Vf"!i*' :iubunic A 

^rrt IWIIKDni IVPKNt.EEEMKE^YLRY'^tWI I SPALPO [ROCUtK l':X)RRVLYAKKOL 
: :UU * iAHlRIOTAK fJlOTSCDYH PI K*ESVtY PTLVPKAONWAMRYP LVDCOTNFGS IDCD 
1 1 *AAHR YTEAkLTI V JVMYLMEOLOKDTVDI VPNYOETFHEPWFPKKFPNLLOC5SR I A 
V JMATTI I l«HfNU;fcIL I E*TU.LA*WPOASVOEILQVMFOPOFPTnC £ {CGCEGIRSAYTT 
< W »C I K7P A(4 U IVKOfEOKHREfl 1 1 XTEMPYNVNKflPLI EQ t AHLVNEKTLAC I SOVROE 
::UKTiairWUEIKW;ES:;C I UNRLYKI^MJ\n , FrWWLAL0KNLPRTK5lHR>(X5AWl 
HIIUPEV 1 HPfcTRYEUfKAETRAHVLECYIJtALiSCLOALVKT t RE3CNXEHAKERX t ESFC 



PTEPOALAXLCJIL'rtLrrL • £*KEYREi*LNK tAYYW-SCECL-.TS: rRNtLCCL 
UCHHKVARRTTlEfTACOIRDrCDI ITNEJVt rTISCODYVXRMPVKVFKEORROCHCVT 

PGmJUILMIKKFDWACrtTlJfflCW)^^ 

HIVSOEOCvHlJTHLCWVRFPHEXVllPlf^ XVT PCSV 
LXVCOOCFCKR3 LVEDF* rTTmGCVCVRS I L rNERNCWLCA I PVTPW DSIL UCSgOQA 
XR I>«ODVP'/>CRJT-yT/P LWUCECCALVSH EXl^SNISJCCT/^SCSErECSCrVSLR 

kmdpkeknymAJa i- -u^l^vrct wKYiiam; iTJUutL-.wvu^i :dewum4w 

RIDVRXLtCGG IV rVDfCRC I P I EWgRESAXQGarVSAIJVVLTVU UOaCf LUi JSTyV 
SCCLICJVt^rSCVNAIJEXLVATVroiaCCTOKErSRC IPVTPLQWSVraWOTXXVfTP 

DPKzrsTCTrws ixjoniMnjfuncrtTvrwcainvs i umi f kujuiosfvitlw 

OfJCCLFSEPXYlLUlHV\,iJU J EXEFEAALOWWSCYSlX\nfSTAW 

TAXVniVXinYXWWNtJUa#BaALTC£X3IMa 

SVAQCWCSAtTIfTroTOIARMXVDKVFVAAOAi^^ 

ICTIXAAli^:CAEWFItt«S10^RYllRlIXKn>AlNDC^ 
WIMPPLYKVSX10XDF1tYXLSe>3Mt5YlIiCLCnm 

XUVPF1WTUKKAI PFS EFXJMYKECXgYPLYYl^ATOMQOCRYIJf SD CIKTT AI^ O 
EETWCnCIIttYKVAVFVOIONOLKCYGXJISSYLIPOIWaVICKlM 
EVINYIJOAAUCCXEXORrYCt£D0MD0XJ<X7TTTWEr^^ 
MCTEVPPRJl£FIESHALSIRIJWU)I 

CPnJ>276 311140 310793 

CT191 hypothetical protein 
DMFXJOtKHtGCSOVQNXKrASPIKHAJQfYXJQ 
ICa(SOAIGF^ILL\^VYNSSLYAUJCOTPO^IKSLYOVAS)^ 

CPn.0277 312003 311404 

No robust honolog present in Genebank/EXBL as of 11/7/98 

NXSirwmEacE^iiotfxJWFraixjiixNTOtfA^ 

MIVICCSPJSSTYArr)IErVAOEJaJDCSTHSKSST^VNL«AH^ 
OTALVLOVaiKLIKCUOVlO/CtFta^WVVAIICAIAir^^ 
PXCVKTLI SPDXPYPTWYV 

CPn.0378 1UB84 312060 

•conserved outer ■swhrane lipoprotein 

RDSHPOQ^LLVaJFVISSCHKniAQrfXXRIVASPTPKAn T F^TOTTAKPIjSXKlXXL 

PVtXTHlIPNRI-UJHCOVDAKYlW^F^ 

5LEMJCSOnaTIMPvt3RI1IAORAlJ<XXEEOCC^^ 

LI^AJLiVCSLPDVDAAVIPGNFAIAA^PKXIBLJ^U^^ 

P104XKlJ01O^SPSVQHFTtmcmMXI3Cr00NS 

CPnJ)279 313546 312875 

• Possible ABC Transporter Pecseese Protein 

WXlOSDUOXLLKETV^YMVCTAriTSCAXGGKX^ 

MILSFLTAX PFAXXIV^IJTITRWIVCTSLCPTAS XVPLTICAXPPWTIVVX»nO«Al. 

NYtXSAVAIAXPKmXX^XLUESmUFSIJCSLVWXSCSTTJICr^^ 

OYCrWWSVTTSVLVXTtVUESVTlItCTF^^ 

CPnjD280 314593 313SS0 

dppf-Dipeptide Transporter ATPase 
IRGEAWLVSEBHSPIISVOWSKKLGOl^ 
L&riXKPTSCSXSVAGFXMStJTCIxrSRRMFSiaCVAYXSOW 
KHSDtSKSEVlXCVYtm^Ft^YTmMIUYPQJLSGCOKO 

TSAtDPKSTEWX XFRf JQT iNQERC ITXVLVS HEX DWXX I CSHVLVKHQCAVIIlj3TTEE 
IJl^ENSXTNn^TlEOXinAAX^SSCYTAfnCZV^JUJIFSKXLAXOCI XSlCVXTTaLVS 
INXXAMXNX^KSPKIXXXVlJStVtaiUOXAXmjEX^ 

CPh.0281 31S033 316103 

"dhnA-Predieted 1. 6- Fructose diphosphate Aldolase idahydrin 
family) • 

XSXJUWTLMl^XKDXLCNDnENlI^OCimXT^ 

LQTKF^RlJkNSCYI^rLPVDCCIE^SACASFAXWPIYro 

YCTL^I^RKYAHKIPrKIJCIJWhniXSYPTlCYKO 

rrSNEEIVAVSNAFAKARSLCl^TVLVOflJUffArVAfCVDYHTA 

ADIWOXLPTCOCCFICAINFCXTDERVYSEI^SNHPIDLCRYCVIJ^^ 

SCXKDrrCAARrAVXNKPJUX9CLXX£RKAFQRPUEC 

CPO.0282 316084 317529 

xasA/gadC-Aaino Acid Transporter 

ILIl£5LOTSXXVm<SHSKPTKPLCTrTVCWLSL*^ 

AVICFHX PYAX,ISAElJkSFKPOGIYIWAR£lAlJCXV^FFAIW>O ll 'FKWflVYPAVXJkFXA 

ST IVYK XNPELAHNK\*Y XATVXLACFWILTFFNFtXIITSSAtJ'SS ICVX XCTUPCVILV 

SLAWXF55NPXAX5L»IO<lXPNF5NVSSL\nXACM[XALCCL£A^ 

NY PKAVFICAIATLT: UXC5LSIAIVI PK£EISLVSGL7KTFTUTDICY?n^*mJIW 

VKTIAC."LCELNA*#4FACTKGLFISTCt4I)ClJPXFKKVtI5KI^^ 

LFLCLDSA^LVYWILTAl^VOHYlJ^XCUXACPIlAXKEPRAORtYSVPCK 

a X LCXl^AFAtWSFLPPltELAOISECSiaCYTrFLLLAfSLNCLIPFC tYFTKKRLSK 

K5 

CPn.0283 318581 317532 

no robust homo loo present in Cenebank/EMfiL as ot 11/7/99 
URRLCYFQDL XKNAVAK I XSFRXSPPNPVKLLX KFAKFJiUWSS IAPLYEVLLEILLAPC 
EEIIXWSLOP«WLKtfMU)PKKH5TLCIEIS3CTAETI ESCSLCLIS WLLLSCLCLUS 
!JHDR30AVKI X00FCrOFS5EEV0NFVEORNI LTPFUIHLFECOCVALUCXJXRLOLIV 
PNALYPEPPP3CVV; J I NSEtVAK DAEOQQEOFNKTX EACK ECLKK LVLPALC tTC I PQLL 
RARRFKQGAE t LMA t PRKKMKCJN PF t FLEALLE-^EEF" t .T/^KYUCUJtr II ItMDKLUIA 
I YI/5YF7TJL ICO "F. [ ETFrRRANIHPEAFQAA I00OHX^FLFM(MU-P 

CPflJI^HA U •IDS* UMSM 

N>* i^u::t Iwmil.si i>i.*iictti in t t ^ ^m^a*illt/^^XiL .tu <>i U/7/*iM 

KLI MM f T*A IVV W 1 1 M* CWNMT^r/' 1 1 »t«Kl IT *LT< [' t T*/t. I LA 1 I ATtK' VI iYF» M I |:J 

vi rrFVf/;MLi ru^A aTayi jywts; I forrrvF.': iT::r: wff::pkn j i. meed;: 
v:;AiOEi*UNFT»At«»^RnwiT^::HFiflE;/wft*Fj:i'KEnf:HT::K 1 1. 

oitiJwh'i i.iiM.di » i -ici*. t 

Nit r'/rft&r homol.xi [M.'ttn.i in i>tiftt»n»ic/»>»n. .t:: <»i ll///'»N 
KKl>TiLFFFt ANKrrrA: ti iM • i y h Ft v>: :f: :t •: ; f/r I u ;i .i a r ; n/i J tU * twfa( m i 'ir/i , 
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AAPUILLVW^AA3VCSK<AIV3LMCLlfKDCKPi JNEEXIDPTIOXEIICDPESXJCPV 
PVEC05LPKERKTVSFWUC IPS tVEDDFKPYVIGSTFYHONXVYSKP IAERMQSLEXEIT 
Tl,IVWPIttLEE33XS3KLLRGVtSEIIlNt^LPRF 
SSDU.ILLLTKPEPll»rnWl.IAHLKSLrTTICWCLTPKHOtt 
tEKmYDPt^TOEXJWHlXAOnVOPtLSrOSSOCRErRALrPSmOElJ 

vvpAiNssrwvTjmJLSvxjociijcjaArcExrpsPSSWNr^ 

SNQ05V I tONPFUtlELLHCMPKCCTfUCCLLCKAII PMSNWAALfRPKtJCMl£SC IARK 
KELIt I 1 AEHt/WPFKETTQA IASCKI LDLLLOHL FDF 

mgtE-Hg*» Transporter (CBS Oonainr ^ 

SCR£SKCKD<WB3WWEElCLOTAJ^SOII>a5SRTSHI^ 
OtiXmEYNPXDUYAVSCXJ>SESJUILYIWI^rc 

sevcALZZ9QMpPD»vwwxpcmiYnizixuosiacAijaR^^ 

FFArLHETnnaWSACXRSMCII^TTlL^^ IHPPOCSLieOtM 
NQI DOCVWa^TOE£VVDLVEItYKIAAl«PVVDEgff l^CAITY 

I*»tSO*WCVOCOTLVRSMATCn^FCRRRETinCM 

IHSHI tPPL IAGCXKFUTN 

CPn.0287 324230 3220B9 

no robust homolog prtsenc in GenetoeJik/EHBL as o£ 11/7/98 

IWCMXRSFLPFISSIWALKKLStOrcFSCPEX^^ 

LLWfTNNHPKVVKRVirfCVSYCIJCHKSMSIFID^TYI 

TCINTELYSQTGOOCFliEVVDNPIlLIEOLlJtMHPOLIWRI/WEHFW 

ASVYOAVCRSFIELYHKHLEX^DLACaOCCLAXJu^^ 

KSU^DUattHFSXXIFI^FSRDGDTtAYQNyRYSYALASVKI^ 

YQTVOAFPNt£GL»fVUG£LLXRSGWUl^^ 

C I AX LCLYLEEPNLFKDSRHRL I SAMRTTPGNSALVHALGVVOLCSALYTTJEIJSHFASAI 

SCFQSCLEWDLDATGWCKlJOAYTSWGIiaQC^^ 

RGXJtfJtCLAgATXPEAYKgirt^ESIJJIY^RA^ 

HYDEAYTLLTKVDLTLSSSRVKL ILAAVLiGKCRLLODTDPAZEART ILJEPLVEVYllXJE 

NfXLLLOCVH^LFWKMQWClJGXLA^ 

KVIRSAOYCVRITIJUCWLNDPYIJWLR^ 

CPnJ>288 3257B5 324571 

CT28B hypothetical protein 

XSmREPlTOFECRAKFYNVIKSCrNLTSTNESIJlPISPItAOT^ 
HRSDTI^\/SVCKVNXYIW<UVPXT^ 

XLXPTXLLTGGHYILHRUnaCVD^SCnCXPPFSRKCM^XSSSfm^ 
LDISnjAtt^IAAVYOCPPlAFW^FlCFGIPCAMP^ 

RM lYEHrFCKHUEDDRTX YKCVARFH7R ST A A FLKAP FYASAQgGAFYSLLDPLAGRV 

LHGSWERDWHDWVIIJU^SXJWEAHSXJRT^^ 

KCEXVSOUfPSXQLPElUtCIXTSGRyFHXSVXPOSGNDSAlQlFXV 

CPu_0289 325797 326996 

CT289 hypothetical protein 
KFNRXl«X0R5KyiQOQaXl4XSXLVCl£LSS^ 

VTCAlJEISTITSIONFWrKVTUOTULYYTliTTVIAAS XCX a ££FFLXJlP0MXTQOA£ArT 

TXCHP&GYLDVLSITO»ENIFTCPniOGNVISJU^ 

FTSIFUOAI^LmJ»IAHU3rSVIUTCnJCDOSNLTW 

ILIXXWKVSPIJCVXICAMSPALVTAFTSKSSAATLPLTWFTATrTyiTXI 

VXNMGCAAF ILXTVX*FVATSNGMX 1 SPUtSLGWX F X ATLAAIGNACVPfOCSfTLTLSLL 

TSKNVPl^IU;LILPrnVICMIEraJJVWSDCCVVSIJW 

CPn_0290 327027 328523 

Na -dependent Transporter 

RSALTJCnOOttSFSSIUXFIFSMIGIAVGAGNIWRrPRV^ 
IPLIIIEI^IQCLTiaCAPIGALIICrAGKKFAWAGGFI^ 

YAVSGKIHLCNOFAKLWTSHY0SS I PLWAKLTS LGLAYLVI RKG IVHCIEKCNKIU PAT 
FtCrXALIXfUVTUVaVQGIXOLFSOlKK^ 

GF ASKKTGWSNCALTAICNNLVSL IfCI IIFSTCA5U3IUnTQL0DGACASSICITTI 
YLPCJTRLPCCIYLTTI^SSIFFtAFSMAAL£SMISKIJlI^0TLA£3^IKpyiSETLA 
TIIAJTTC£IPSAI^LTFFSNQI7rVV*^^ 

RLNKAFDY I IKY LLP! ECI UXCWYFYECLFPENGOWWNPISLYSLCSLVLQWSLCLX X L 
WKFWCOLYLRFSRYNHEIL 

CPn.0291 328658 329194 

incB- Inclusion Menbrane Protein B 
SKHMSAPIPTPOEI^ITCLNVOYOOySEL^ 

EIYSWTAAALILSASCKPCSCPSCSTECSVTA/^SPCKFKKVLAVVLTIIALIAIAVLIA 
CIIAACGGFTLU£AI^YTIGACVS£J>IIASTSVAI.ICr^ 

CPn.0292 32920X 329836 

incC- Inclusion Membrane Protein c 
VKKTOJSDFNTSPtPFOSSCDASFU^PO^PSTSESOLTO 

OORDRLPTASIIUJN^^PTOCACAPFOPCPADOHHHPIPPPWPAOirrEITTIRSELO 
LHRSTLCQSTKCARTGVLWTAXLMTXSLIAI XIX IIAVLCFTCVLPOVAIXMOCETNLI 
WANV5GSIICFIALICTLCLXLTNXNTPUPAS 

CPA.029J 329940 332723 

CT234 hypothetical protein 

VWSMORVLRLLFMJtKCEEKRAFlfFPLLSLVWC XGCYCTLSLAECLF XEKt£SAELPK XY 
XLCVLSSL X LYNLFKXH X SATALPLI PVS L>5 ILCNFYLILSS IFA I DPPRSPLFF 
YWVIWSLTI LSYTSFWCrVDOFrNLOCCKRHFCIFNAI I FLCOAICSG I IA3LVHTICI 
0C ILIIJTAALVLTFPIVFYVSKSLlCSLSDOHDI^IOTCHPPPl^KALKLCr^OXYTFYL 
LCFYFLM0UAI ATEFNYLKIFEIOFASKEEFELVAH ICKCSLWI SIjCKMCFALFAYS R X 
VKRLCVNNI [LFATLC FLSLFLPWTP1CTTL5 I AVLAMWR ECVTYALDONNUQLL I YGVP 
NX I n*j tB [WESF I Er tCKLVWSL 1 CFL3SOOYVFCL I ISL I AT I LVCLVRrJYYAKA I L 
KNL'iAOALOLTR.TMOrW t K!*MTVKQKROVELFLLAHLKH PSERK0TFAFQHLLMLASR3V 
LP^LLMmKI^LFtKUCTIEMVKSSLWAXDFLTXELLXRWrStFPHFAIATAXKLYFAE 
NDLLU I Til I AEDLYDTVODRLLAA tLTVRROEAYGPYnDLAtMCRUCELLNSOOPEDtVMC 
LT ILn-EKNtViFP ILLOFLNTKNEOrL I VTCKALl IT WHANl IK rYCrELLKRLROCSHN 
DEAyOYUJCT V.: I AU) I:;rVKDLLHTT:X?U(NrJRJCYA^I*.;KLaEVAPAFU3VLT0E 
' .TKMPCR I LAAKALiK lUNWLUCWtAYK IVKJKAwKALFY.OYIhl IY ICKOT PTYNL3LLA 
WPU«^r^AKVNFWU:UJa lUISWEMSCVURALTCKNOK IK/^AI^LEWITOSHLFSL 
I ja-PVTOK »e.-Y:TEKYYFKa IV t PLTUtELLNMMEMT»P:: JLNKt.TAOOUCEELSYCOPDF 
y^Wrri YNOtaiEOKHTEKETLICH^ I 



CPn.WH 



I » 1077 I 1 *S»2 



cAKP-Deoenawt Prat- Xinjs* A^^li^^?™ fefv „ 

IRNFFKKLtORAFULKKT I IWSLOKDLiLTIAOKTET I !FKPC#3NVFS :Z3?CTSTyiZ 

ECPSVALSrLELTMOSlXFREP "™" ■ * 

CPn.029? 33386* !3J«7 

acpP-Acyl Owner Protein 

AKSLEDDVt A I r/ECLGVDPXEVNEN5ST lEDUiAOSLCLTEUKTLEEXFAFEXIIIDA 

; .try* : • • »*y pr a r* 
cptuo 2 m i j-s • v.* j j 4 . : ; 

CT296 hypothetical protein 

XX PIRGHICKDITLVGXKWTtX^rcrC^ 

TOa»raFARVDVS»4aGVXOCVOXrUD01XXDX^ 

ISTNLTSLYTrCSSVXRfWXIUOlSCSXINVASXVAXIGSAJOTNYAAAIUtt IAFTXSLA 

KEVAAIWRWCLAICFIETXKTSVlJOilJCAEV^ 

SSYKTAOTLWDGCLTY 

CPTU0297 335724 334774 

eabD-Nalonyl Acyl Carrier Trans cyclase 

SRSMK UW I K WCH YAnJTOOCSQTVCMSOOLYMEYPEVREIJXyAWDU^SLTSIHri 
GPEDU-METVHSGLA X YLKSMAWKVLSORSS X0PSLVSCLSLSEYTAX.VA5DRXSVLDC 
LELVWCROQLWItACWOSPGWAAI J AT PSEVIEENITSLCQCIVIAKYHAPKQLWACI 
A£JWIXJAXEI^RI2^CT0CAVPJJCVSCAnn7l>CVA^ 

VCKSLVVTXOIREajUtaKrSPTUfrOSCY^ XCXSXP 
ITSLCTFAOIEKFLSEV 

CRVJ029S 336742 335717 

t&bH-Cxoacyl Carrier Protein Synthase XII 
YTSrFLYOTSVNIOMOCAAXUATCSYLP 

C^OEYTSLHSAXAAEXAXAHAGLSKDOIXXTX XFSTAAmXPYSSCAIJWAHLCXXDVFT 
FTXOAACTCYXYCl^VAWkYVrSCTYNHVIilAADI^ 

VRJ«CrAAXifSIAIACX0E£SXX»fFVWANERXXOAIJUCR^ 
ASSVCI ALDELVHTES XKLDITTliLVAFCaa^VCAVVIJCOV 

CPn_0299 33S726 337415 

recR-Reconbination Protein 

RXXLVYYSESLYSKIJ^PRPEaO«lHlTCTRYPDYUiaiFriJ»ja^IGT^ 
FELIShTOEOLKXLCaMnMVASERSHCPirrTtJCESXEA 

IQTVPrLtRSICVFXraYHVttBLUPXTm XLAXSkATtE 
a^AX^XJCOELOHFSVNXSIUJUjSl^XC^FDYVDSCT^^ 

CPIU0300 33776B 340152 

yaeT-Onp95 Analog 

GRX^OCLXJIRlOTXLQXSXXALXOTPLTtJSTtXVK^^ 
tTOjrniSCUaJ^OtJJFDEDIJUlJUCEYnSVEPKV^ 

D^KINDSPCCKIKOLTTSCISRSEXSDXQEFIQTICOHi, 1 1 1SN> 1UACLYHPDXVEOO 
STJUTNYLKNICYAEArWSHYmiCKJWIU.YKDIDRCSRrt^^ 
EKQSOVCPHDLYCITOIWDCWHKXKOTYAXYOYIWrll^^ 
SPYKVGUKITGimmCSI7VTU{ETSIJT>^ 

GTYOHA^TTYILNraLICYCXFYRCSOTSLHEXRKFI^ 

VIgPRTPTTGIROCVTFEVSClXiJI tH> f KLSLNSS XYRXLTRXCXLKXRCCAOrXI>YS 
rrTTAIQVPVSERFPlCCETrVXCYXSFIXGPKYSATEP0C<71^ 
ISAFVPIJISGFVCLOEYKXSXJCOLR^SACFCXRFUVWO^^ 
XOVSORPTPALCaCBCF 

CPn.0301 340163 340762 

(OoapH-Like Outer Membrane Protein I 

IKDl^KEIFVVFRXCFVYTFSIPXLVOVIMra 

LEESOUaCXErm^AKKQOrsna^AEKXEEELTSIYNXXADEDYM^ 

EDl^YlUYOSOYYOSINOSNNnailOKLIOEVKIAAESWlCEiaXAXLI^ 

GTOKTTEIIAILNESFKKON 

CPn_0302 340766 341866 

lpxD-UDP Clucoeaisine N-Acyl transferase 

SKrXEFSY(SEAPVYTLKOLAELU7\^VD^XETPISG\^DXSOAOPHHX 

LKtmCAGAX ILSRSOAMOHAHLKKNTLITNESPSLTFOKC X ELFI EPVTSCFPC IMPTAV 

IHPTARIEIOTVTIEPYWISOHAHIGSmTriGACSVIGAHSNn^ 

HGtlRVVVOPGAVIjCSCCFGYXTmFC^KPIJCNUWIVCSt^ 

IHECTKIENOVOVAHHVEIGKHS I IVAOAC I AGSTKTGEXVI IOCOTCITGMISIAOHVI 

HXAOTCVTKSITSPGXYC<^ARPYOETHRLIAKXRNU>1CTXERX^ 

AEIPSEI 

CPn_0303 142982 341921 

CT303 hypocnatical protein 

REOKGLKHHDVSRK XNRHTQFYVDS 1 0CVI KNFOHKPS EOKSROHEELEEKLLTXTXH XV 
ASAOETONSKTOJW^YYLKICTOWLPFKNEEIXOrK El^AJILTSKOXK IA0LFFYSPGCSS 
DWVErTEVICHL^IGUXTV^LCCGLFTXrc 

TYRNXStXNCOSKSELGKELGOVUCOHGVAFTLI FKEtVDIOLLNYVKLIOCUCRSGNXO 
AR I YDNOVPTLPSVSSSP X ALRYSIANT X RCLALHVDFS5LK F I S PS I LSNTEHTAXALM 
iWGECFIFSNLDETMLCMK IVMOULRTCK tEPElLNKNIWCXLMXKRRVRSLYI 

CPn_0)04 .141091 144 

prthA/rjdpA-Pvriivarp 0ehv4ron^aiti>« A Into* 

D0XPLPKRLPYKKVMOr*.<^PYN I K& TEKrTPVRR [ LOLYCPArtC I KFLXOMVLt RERA 
RCEEAYLECLVl^FYIi:nrAi9EAVATAAIAJm:L0Pt^.^RniAU\XUJ4XPLOCIAA 
£LirJCET*XAUlRtra;KH»nPNFK;r;R ; t Vf ?V I PLAACAAFT I KYOTOKNRVStCPIC 
ir»AVAt' IVHIFTLNFVIIIJlOLI'LMLt M*>^ rTfTtXnAVAK^M' I AC3QtlT£YOXRAV 

iw iPM j- rtt j *;F«KAY»VMVt«Tj;ivi.vw:i t ;:;hfri at: I ::DrNi .YM.*ncr*JtXLncR 

WtVMFIWLI RI.FVI.'rFJvEFyN IflOW-TCTAVI .F>F.*aiAKL.'jr; DP: '.VTTI.FJTIVVA 

'.Pn.Oiit'. M4I4^ HM ;7 

(*IWJ/'*«lpl*'iV( uv.ir Duftylrt'i'*!.!::" tt*<r.i 

« KESMI'KI IKTI •tfIKKAl.it I l/EER , ;f' W W ' 1 1 / XFJ-Tft : W| r *JVVKVTr* a J JUCWr;PItRV 
I UAPIUKAAK;;; : 1 1 f If 1AAU '* II J'P ( I KKM:*« li TIP/AI Jiy 1 1 ;;i lAAKMUfW I IKF.'IVP I 
VKBTTtlflAAAOVta.VIUnil'VFJri.YANIMi.I | I AI*!>f1('Y(WK'U J J{;:AI MNNMtVIiFLEN 

KifEYWiKfiEVPTEKYi .vp i* :kai ihvohy ji.ti i rr.*HV.; I tkkai-::i <AKKftwr*tL;; r E r 
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(0LRTtKPL0t3TIL35VRKT3RCtVIEBGHyr/ jEXIALrTEHVFDSLBAPPLlIVC 

9ketpmpy:ik i leoatwwnr i Lorr I exvmr 

CPn 030«i 345134 346431 

pdhC-DihyrJroUpoaniae Acetv lc ran* f erase 

E I LRHECEIC IV [CTP I AVLSTEAKEPfNLEELLPKT EPSNLEA3PKCS3EEVS PATTPOA 
AOATFTAVTFKrEPPLJriPLVFKHVCTTWl^ 

■r ; i : " * : a :i • rr, ■'. *f L :/r»<vr/iiKOir..'^ r aapu A/*r r .; : r* ifwpocw 
* : j ::t .ir f .u -v.* : ; r ! - ' : r r • • ivp.v :ai ju /st;: : r w rac r/w .r :*/? rrr ic : J x A 

VAlPTOIITPIIRCAlJWtNUjHIiAEIICSLAJJCARffWUJOTE 

ErTAIWPPOMILAVCSVTEOALVIJXIEITICSTaW 

KILEAPAVLLLN 

CPn_0307 ' 348998 346515 

glgP-Clycoaen PhospnoryUse 

NGClVEOFSSrD»nCVSVDSHiauaiJW.YlJVV0SPESASPRDr™ 

wxjoc»cyyio*>vicrvyyi^kiki^^ 

SDAGLCNXUStUMCYUlSMATIAVPAyaYCIRY^ 

NPWEICWEVLYPVRrTCPVIKYTOSIUSKOVADLV^ PCYGNDTVNSL 

RLWWSPKFEFSYFIWNYXQAXESXJa^ENXSK^^ 

ATXQOIIRRrrKTHXCUXaADKWVOUaRHPAXISXAZI^ 

VXFKrrNHTXIfCALERHPLDLrSKIXPRHXJEXXYEXI&nM 

IVEEXWOKRIl«ANI>VVCSAKVICVSSrKS0LIlCrmJ KU * EJ f PEW INVTM3VTPR 

RWXALCNPRLSKLLNETXCORYX ^DI^HI^LIRSF^g^Cl^^S^nCC mJNlCQ D LTSRI 

YNEVCEXVDPNSLriCTIKRIKEYXROUfl«IJIVIY\^^ 

APGYVKAXLIXKLXHSVADVVtCDSRVIOXtJCVLFLPN^ 

GMEASCTOMCTAUCALTICrKDGANIEMAEHICKQW 

ICOKOTKIRQVUaiEOGFTNSKDKDlJTCPrVHRI^^ 

t^KEPOSmaSIYNTAOCFrSSDIlUOIJYAROXWKVFTKSCSeaBN 

CPH.030B 349213 349S96 

No robust honoloo present in Genebanx/EMBL a* of 1X/7/98 

FTTQEMNWTVAQTPQTTQPQPSVSHIUirHRyCSHVTTKPILVST (71 X I ASTiTTtGLVI A 
SCTm£t£XGXVIJUOXVIJCIALVXAFWXROFl^^ 

LEtsnrssK 

CPn_0309 350977 349595 

CT309 hypothetical protein 
nauHEEFXxiAEKExaniTVDiaajtsiJc^ 

SCLVNNNNKP XHVHVTSVDKAAFY VUJUJMUUEAi'A X F'lWYCSVNPEKTrSNrLVTPEN 

DXj>rirvLOEFTKsroEMQGvrFNPiY^ 

rTEHLVSAlRSQQQXFItSrntHXUtFIEOtEVrSGKSATQEli f tfr* NSLKSESXUV 

VSSSYAFVX&VAVBllUUSJtFEWWAJPiraL^^ 

IYJtf£SHVKTIiKALNIIJUCIWMr^ 

NVAOrTGVS0ESIU3lS0SHEYVLl>ROVAKrTC^^ 

LXEQKX EENSHOXHHAXQ0X8XNUI5LK1C5LEFTP8CEMX X 

CPn.0310 353472 351049 

60D4-60)cDa inner Membrane Protein 
YTOLLSLXFRVYQMNKRTtXrVSUGIAFVtXOXFTGY 
AVESVCLSVASWtOTW*3*XHXIOTAVKV^^ 

FTjmHiwuVYROQCSSFiiPTtmacm 

KDSTirCTALVTWRSCSPnPI/XYDSRZIJa^ 
YVtrNDYMOIIVSttSGSIECIllLrTASnB^TV^ 

r prrrfniraeTrevvPT Tjprer.T ctxtctt r or rvu» uwwcfT iTtn/ir PVPVT <rvTP 

HSIOUSLDRSVQmia^ENPESCTrVmAXTLTKETEXMf^^ 

TXKVKVIIQQaCGSUnCVn.PICVXEPXMf»^ 

LYXSCSTAPTIg^AISPITOLY mK y W YLT L lW 

DKTITOEKQENPCYLDSISFRSVFAFITAF?AALIJTXMKFF1&^ 

KUXYFUlAWSIHSKRWCU^mOTIOTKYTCra 

LLIOLFFI^AHFDLIJCSSFtliUUSFIPGWIDNLrAPWLF 

LCIWTLOOKVTSIiOaCCPVrDOOKOOCTVl^ 

VVOOWTNlUUJSKHIJOiEVVUMaCHR 

CPIX.0311 354453 3S3S7S 

CT311 hypothetical protein 
»«AIHAVIYWDRSKIVVGFTP«SUILTW^^ 

FSJCSOLRVAI^FTXYSXLFXVPCARLAYVXFYCWSFYUJHPEEX 10 XWHGGLSSKQGVL 
WFS DP K)CVQCVPVHPA^YECISYWS^ 

RFTAEYVKSHOGXVlAEICtXTXCOXUXPl^LFaVALLX ICSLXARRHR5H X 

CPn.0312 354518 354976 

CT101 hypothetical protein 

CTMARN IKYFLILTPG XLMISAGMXLllJtATAI ALCP1*5 SFFTYCLLSMVSWGLAS LXH R 
fU^XTIRJ03U:i^EFFSOKITVaAyiK0TFISRRrLWVIMIArSLVUlRyiSNP<^ 
FVIRATVCYALIKTAIAYFSKLQNAtJfENPEGN 

CP*U>313 154957 355355 

acpS-Acyl -carrier Protein Synthase 

WKILKE13ANSMEIIHIGTDIXEISRXREAIATHGNRIXNRIFTEAE0KYCLDCT0PIPS 
FACRFACKEAVAJCALCTCICaWAWOIEVPXVSHCPEVlXPSHVYAKXCI SKVILS ISH 
CK EY AT AT A I ALA 

OPn_03M 356285 355353 

r.rxB-Thio redox in Reductase 

MI HSR LI I IC5GPSCYTAA IYASRALLHPUXECFF3C I5CCQLMTTTEVENFPGFPEC t 
LCPKLMNNMK EQAVRFCTKTLAOD t ISVDFSVRPF X LKSKEETYSCOAC t lATGASAKRL 
E I TOACNOEfHJICCVTACAVCOCASP IFKNKOLYV fPTCDSALEEALYLTRWSHVYWI t 
RROKU^iKAMEARAONNEKITFLWNSE I VK lOTS I VR5VD tKNVOTCEXTTREAACVF 
KA ICIWttmJrUXIOLTLOESCYXVTEWTrSKTSVFCWAACDN^^ 
I AALOAFJIFLG 

t::l ill Hil««:i*L t i Protwin 

MtKUAEYTl* ^ :KK 1 1 SXt I ECLTEDVAEFKDLLVTAHP tT^:XEE:WE iofov t LICfHW 
I'lNKDPVWlAffiLK^BtVIMtlFTIOSSBILV^^ 

•."njWKY i laAiM 'kb i vwx) XTRicvicaa. t vdi» wjvflr:w t ijnkk i KNLbOYvGKve 

KKK I U 1WKRRN I WimHELLEAER IftKKAEL t EC IS K ttWAIVVKN ITCFOVFLOLD 



KYPPCKRVLCK tVKLLPYC* * EECI ECL IH I ^O^^J^'Z^S^?^^ 
L5I0K0ffiKI3LClJCQTERNW*DNtEEKYPICLHVNAEIWtW 

VISCV^mrCATgEXjO^Tff^IgVCStl^DKr^AlCtfPI XULlWWtfXIXOPCN 

KKVSUmcnJVWYOCOSRTEUFKOSOCPKERXXm 

cpn Q3is nmm kqxzi 

rtiitA-N ifcUixation Protein A 

. .. ^.v. - .... . — "wrr Fr *: r r " \ r vamt i aaitt: :vs,\k: ^.n 
''I'V-jf * kk ~ : v:-: r *. n r ,r ?*ar eti*i ' * 1 .» i • a.>YMnvr r vji N KiR : A,v-i 

AAftui i JUKiRliAEkOV t VttVKllKV'KLTLJwVVKKKAKUJNLX XOLCXVEAXLTritFYF 

KTEXKKI<^KXYAU<YEVUESE>CCAE\-ILSRSliA£JVKOLT:a£VPEL^ 

RCACYima>VR5S0PrTDPVC^rvCMJ^R\nCNIIR£lJffi 

LYP I EXQKXAX X.0OICVIAXVVNOAIJY ATVXCXRG INAJU.I S H ILCYIXXVQRKSCTllXL 

LEIQRIX3I>XFDSPHLJXPUMECIS)aVTQ>CJ3tAQfOT 

ELAYKXLeOVSKYCCSXVOEXPEXEO 

CPn.0317 360045 3627SO 

in CB- Initiation Factor- 2 



VSXAfiFEPELEVVDEVCDESPEVKWA£\nj»EOPVT^ro 

KXKSXFCPTClWXNKXXAIffFXAPAXEZXVArACSKSTKP^ 

KX)FNFAfnSPASCP1CRIWaaaiLTCnWRSlUC50E5l^ 

RVYJCPKIQm>EASI0RJ^IKISLPI7A7a3IJVAI2flCI^ 

SrrAVQFICLETKTIDIOY^EODJa^LSJIITIMU^IOOT 

OCTTLIDSUUCSNVAATEAGMTOO*^^ 

OrVVLWAGr^IKKITLEAIEKAXAAXJIArWAINICCt^ 

TVLXOM^UajCEALVrKlX > 0 KVK 1KHNEHNELXXEAC PSI PVLrTCLSDX PKAgDPfF 

WIWETTARX)IIEAR£AGOORFAlX»KKRPNrtiSMLO^^ 

ISTCIESEXVl^ILTTiSWEISESDIRlJUUkSKAVLICnfrciESW 

rrVIYHAIDAIKriKrSIXCTIAEEJCDECSAEIICEJF^ 

VRVIJaJKKIJflCTL5SlJaiVKEI^E^^ 

KL 

CPn.0318 362704 363126 

rbtA-Riboeane Bisdino Factor A 

VMSYNVKKISIXHXNrKLICYQfrENRJIXKRVNALXjQEAXAXV 

RVSt<5KDUiSARVYV5VMF1fEMIXEEAX£AI 

irSPQDYIENLLHOIOEKEXS 

CPn_03l9 363133 363879 

truB-tRNA peeudourldine synthase 

TIFTan^IIOJMIMUAVElJCECILLVDKPOCRTSr^ 

FATOVMVKUCRXFT^X^XIXFEOKErXAIAKLCrm 

LSAArY7CCEIOOU>PKFSAKKVCC10a.YEYAJUCI^IE 

FWSCSXXTTYXRSXAKEZXnt&OCGAYUSQlJUUJ 

LRQAHGN5L 

CFOJ0320 363824 364783 

rlbT-FAfi synthase 

TTPISIFUTTEMPKEIAYSXTSSFSVDSVTVCFFOOCHLGHSML^ 

KOOtLILCY USC XG J CE OQSWrEAXJTICKPLS I WIKIPPYRHrKIVVSSKAIROFLEAC 
NLECAHRFI^PYAI5CKXTECSCXGCSLGFATXNX^REC5UPLSVYACEIBI(UI I IUJ 
CVWJLCTAFTFSRSSLYAEAH X F5 FAENLYCK CVS X X PRXFLJIEEXXF05KETU1IAXCX 
XULXMODWFAXCSFNYECTA 

CFA.0321 36S900 364767 

yehF-CTP Binding Protein 

YS10XH\aXFXFRCX^HnCCIVCXfl^KSCl^NALTC^^ 

VireRIXAIJOCISNSQIUIYADMKrVDIAGLV^ 

CF0DPC^rrH^KVNFVEDXEVXMXXUFSDT55AJCN7K^ 

TXXAHLEKCLPUtrULT^EOWAIJCinrPPLTmPMFYXAN^ 

VAAKENSKWFICmXEUXVSI^IEERLEFlKSXAIJXSSLKRX. 

TCPQESRA>rrVVT^CSSAWEAACEIHTDIOTOriRAEVrTFEDHIECOCRAAARn<CyiHI 

Bsnotwooscmesiiot 

CPn.0322 366231 367328 

yscU-YopS Translocation Protein u 
SNLOlSKSEICTEXATPlCRXJtXlAIUCXCOVAXSOOFPSANn'FX^ 

rLVSXLSOAPTRHl)PVTTI^YLKNCLMLI LTAS LPLXgAVAVVCVIVCFLIVC Pl FS'l I V 

ntroiKKFNPIENIKOKFlUinXIELIICSItJCIFGAALILYITlJCSICVSLI 

XTAOIFKEt FYKAVTSICI FFX,IVAILDLVY0»WNFAKEl^EXFEVK0EFia7raaMPEI 

KCRJWOIACEIAYEDSSSOVXKASTWSNPXDrAVAICYKPEKYKAPWI IAK3XNLRAXR 

XLDCA£KYCXPIMRNVPtJU«)LU)ECXElJCFIPESTYCAICEIlX^ 

N0PDHL 

CPn.0323 367322 369460 

lcrD- Low Calcium Response 0 
• SFIM^fKXLNFVSRTLOGX^ , ALIMXNK3S0LILAIJ«O1C^AaJ4I I IPLPPPXVDLNXTXNL 
3ISWLJJ<VALYIPSAL0I^WPSLIXITTMFRLCINI3SSR0ILLKAYACHVI0AFCDr 
WGCNYWCF 1 1 FL 1 1 T I IQF X WTKCAERVAEVAAR FR LDAMPCXQHA IDADLRAONXD 
ATOARDXRAOIOKESELYCAMOCAMKFtKCDVtACIVtSLlNXVQCLTtCVAMOOLAC 
AAHVYTLL5 ICDCLVSOIPSIXIALTAC INnTRVSSOtCTTKLClCEXSTQLVKEPRALLLA 
OAATLOVCFFKCFPLMSFS ILALIFVALC I LLLTKX5AAGKK00CSCA5TTVCAAC0GAA 
TVC0NPCX)Y3LTLP\'ILELCK0L3KL XOKKTKSCOSFVDtJH X PKMRQAtYOOXCXRYPOX 
HVRTDSPSLEGYOYM ILLNEVPYVRGK t PPHHVLTNEVECNLSR YNLPF ITYKNAACLPS 
rUrfVTIEDAKA ILEKAAIKYWTPLEVI XLHL^T^FF1(K.*;.'>QCFLC X0EVRSM IEFMERSFPOL 
VXEVTRL I PL0KLTE I FKRLVQEO 13 1 KOI JPT t LEJLnEWAOTEKDTVLLTT^VRSSUa. 
Y I :;FKF.*30CO5A X3V\*LLDPE XEEM t RCA I YTJX^N LALOP05VNLI LKSHRlffXTPT 
P«X JQPPVLLTA IDVRRYVRKLt ETEFPD t AV r .r/OEI I.PE in IQPLCR 10 IF 

i , rfi_Oi24 ih«M*,R ItdHHH 

i"TJS4 hyvwthoric.il protein 

V WAJ mRI{HAAJGCTCCLCCTtyr/T ILAAV W^KADAAETA^AT^EnSEMNM 100*^00CT 

NI*AAATR*rKKKEEKFCTL£.^RKF.GEA/^(AKrK\ , l-lTl*J^r'CfrOlJu3K^ 

IHiLFUA IUOUA.1HEOI LALVyEKIKLl-At^: rCAl JJYLV^TTPPJ VIKUC EAL I0ARMTWT 

tV^JKTA KUKNtLKAlJOEYADOUIVr.'Ili Jl .l^':l;/^.E*/TYOTrPr:D0LU»L0ORrrY0O 

HA I V^.*:F(*!K04ATELKHO:P , r /l-Mjl iff tAffl! SjAVbTi rTDYKF-SHVP I LLCCU 
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AFCIVTr.:OLflfVKVAE.TYHK: IriWtrPTASKV. /WLIGDOVDSVrTVLKLrrSALR 

CPnJ»25 now J7U49 

CT325 hypntnectcal orotiin 
mAKWOY£C:/.^5UmtfrrLAPOW 

CTTLPENVFR EH I PXAALSVNCSFSS S I ICC I LCYCEVTOOLYLSD I LSWm/CacLfEYL 
KLrSWAKlWMKLRTOILPDLH'/Lnrrn/A 

■in.: • W-H^ 

ndiQ-Clucjnocr.in3tere*e 

LiacoxoGrsviotxpuiDrcnwsF^ 

OWnCSTPSVSYTOVKZICKWAFIJUYYOKCCK^^ 
AIlWiWOTXWWKSLTWBlFfWKJC^^ 

vixjcoou>iLis?cDScrwynir7YrsssRS\^UJ , roLm 
rwwtEWJtvAONryswwjHiicrrRLwiwosscRCW 

ASSWJIOTlCIIFOOWnTWHtSICCra 

THDStrrrAowwuispKEAKorAKnjfLFroicrL™ 

UU£PDLVSKNLOR£*imTCTISTO*SYRVW^ 
CPn_0327 372 927 373211 

rl28-L28 Rlboioraal Protein 

RZHWWMSRKCPLTCKRPRRGYSYTIJtGXJUa^ 
eEHRftlOJUSASAUWXOKLCLCXVUItAXSlOIF 

CPn-0326 373220 374992 

CT085 hypothetical protein , 

IJCYREIFMSFlJUWISIJTlSOKOLIDVrAPVSPNLEL^ 
STfVLTNU'CriCHRVISIJ^QJtf 

ARfR^PFVSMSSVNIXHU'LLTSWPmSa^TLPLVrrES PTLTTPNLQfYRVORFNQ 

(mttilFOIOKGQGMHLYEWiaoaNL*^^ 

QGAKZI.YKlRHDHfHPLLYCAXriLV(SPAO)(W 

iYHRxiwrrpATvvGKPYomrriC2Jia/:t«^ptJ^vMp^ 

VVKEP^ESLTTAIAZI^^LTXrLHVTSOEVFUm X 

Fsriwc7nzirna , sijacsKorFH3iGJCAj 

TSUTOCIK5LUfKFt)UC5WFLZZUU»lJt^ 
ATKRPNYNFPFVZDA1JCKPSYPKSVEVDPSTIW 

CPfV.0329 37S08S 376146 

Phopholipeie O Super tuily deader (33) peptide) 

KHKKRQCTKlJaCVIISTLILVCIfA W^ 

AZEHADETIFIJtZYNLSEPKIQOSLTROAOAJOOCVTZYTO^ 

FAflRJ&lTOXAiaXOXKIHWI^ANYT^^ 

WX7TGICmrLi^DRIUAJOA\a-EXIQTAOICriCfVAMFALTKSEI ZOALHOAKORCZHVD1 

IIDRSHSKLTrKOliWLNXNlCDFVSZNTAFCT^^ 

DE^IIU2JLTK0WIOJimWKIJLMCKSE3fPT^ 

CPn_O330 376930 376202 

CT083 hypothec leal protein 

ttSZtKLlASFOT'SVlJSRFQDLHVYRFKESLlOU^ 

ATNEAXIUJaiAIGDR£0m3C0RimYmjCOXC>AKVI^ 

SIKmJOTCISlGAAASQAIAAAABAWVXAJiNR^^ 



CPn.0331 376452 376701 

CT082 hypothetical protein 

IORIIKAVSGOX^PSSDPCKWNPALOGEOAraSPtJCESirSrnCOASSAA^ 
SGSTQnrATeSOZNXAKVRKAOIKSSrSPKSl^^ 
KPJiSTiSCEGTSliFTEKDVAIJaCGNRZSFDW 
FSOTOIrfl^SESSSVASFCSFOKAIEJWSEEXVNAW^^ 

LVRRAMATGNEQMX DI^OtiSOEXVSTAKrSPRAVlIjKVICVSSSDSPCANPTCI PNSNTLC 
PJtEXEAEX0E5RlQLSEDQMMXJkRAHAGX£t75AAPQEVU 
PTORSCDKSKHKSPCIEKSTfWTWSPlJlECTVXSAEVKS 
PEAWXESTAFTCNPE^SQNFI^IAVESVFPICESCTGCA^ 

APLPRATDDYK EKLEAHKG POG PPDP LI YCJYRNVAVEPP rVLRSPQ PFSGS S RLSVQCK P 
OASWVDQGaCNSOGFSGOQmSSGQKASM 

CPnJ>332 378676 378536 

CKLTR T2 Protein 

YUlSRIRVIFUVRQRCTUilUAVirPPISFFTOCVSP 

CPnJ>333 379117 378800 

ltuS 

VDFFVFVFFWCKPKKSRTDRALAQE X QKKSTEVLXXPAR XKAKNRRKFL IAXBOKTLXHR 
AOEYDQL7R SLLIJ3QKK0TDKVLIFNYENGFVTTDKDHFSKY5IRL 

CPn_0334 379308 379823 

CT079 similar ity 

TMSVHITPRKCFILCILSHTnj^Lf PKAHLILFSPYIVLCrYCFSnXCLVLALK 
£ OLALfiSRGVFLLLYPLTAL XTKXAH LI FSKESKAALVX VNM X FYGVFLLLT I PHCALTC 
HEV1^XDVLMIPLKC^L0NLIFTSVXYXLPCAXN5CXKKMXSF7RRLVCY 

^Pn_0335 ?79«08 380674 

t ol D -Methylene Tecrahydrotolace Deftydrooenaae 

EICMIXP'JI PAAEKIt^LKEClSOSPTSPCLA\T/LtCNDPASEVYVCM!MCICATEICI t 

.jKAHKLPCDSTLSCVLKL [ERLN0OP5 IHCXLVOU'LPKHLDSEVtLOAXSPDXOVDCLH 

FVWCKUJjGNFOCUXlTPAC I IELLNYYEI PLRCFHAAXVCRSNtVGKPLAAUe^OKH 

IVrNCTVTVUt^SENLTEILKTAOt I tAALCAPLF I KCTMVAWAV I VOWTTRVPAON 

AKnYTLl/JDVnFMNVVTKCiVA rTPVPCUVriPKrVAM LMSMTXRTYONPS 

'TnjCt*. inns** 

/'ML 

K:(JKN)':aai:^K WRW.qiU mAI(tX)YMAIU.PKFFLVU*;U;unX"JOKTTTIECEOKr t 
KYM t Vt/jTtt. lAKKtwV-l I DRCFHK IOr; I YH*fflPYSEL!t f I NRAPADVP ITUIVEL 

^:Fii///i/ri.wuj»^KniTVGK,K™ 

I mCT L t FKN1 1 IWj I DU \ WnCTTYAVOCLI^ I WTF^PNNYVEWt X1F t KTJGHHPSCRPWR 

i fr.ttw rr i i.i j i iji «a i at > vnh iokwvbt;k tYTii iLOTtmiK w.elssyp ioiwsww 

I *:M 'AYAIjA I ATVt MTVVtX I FAKi^AEEW 1 1 LTY imTAT.ti 



cpn.0337 ; t >*W> 

AAAMCRRHH 

CPn.0338 "2272 38)375 
— no i-^'f •)■•»■: ••' i'' 

KW«r. : . : .w4r::.-?=.:f:K: . -r:\NTrr: \-.~sv:. r-.vr-T.vrrt.-r.rTJ.' vrr 
PDiioujirsu»A£0txrT4WirrsrAvsRE£SRWLTC^^ 

XQJUEVTLDKSFSCCYX IPIKAVttl X1Q«SDnCAAXrU30DXXAvXCCWriXtTKLLSC 

EFPOrWXSTCSMVKUJUiRmiTUJtQVAZ^^ 

OTQWSMAVWSWXIXIAFOTFrFtXIU^^ 

KFMXLKQD 

CPIU0339 JW405 384034 

CT339 hypothetical protein 

vnTJWFMKZCSUaJttTlaWDL£XSl^^ 

wrrF^SHprirrorncDHiJCAisxYTOKOGiaticY?^ 

KOUiaSSAPAfiRIUJ'LNUi^OCCMHYT^ 
WSMTAPTYPSNCrSVVRNrOIYPlOirCLTT 

CPn_0340 383842 3841S6 

(£r«M-ehift with 0339) 

PtYTLI,r^SRSSAaCStJOX()ANLMR^ . 
USIftUXEO£JUJCnCSSLXI0MSDISrrAVA£CniK0LSISLPRX^ 

CPTU0341 384160 384495 

(traae-ehitt with 03401 

CffrSVCPHRTDriiTWOMPVSOrSSECOKHSLLAI UUJkEZXYLXOSKHVSPLVCLflOX 
KACLIKEXVOOUiJPAPTLCXTLITSTWOCEl^ 

CPn_0342 384619 385062 

predicted OKP (leader (19) peptide) 

VALAHPOCPriAjaCOClJSWIiRTOCLH 

CPIUB343 384999 385595 

(frwehiet with 03427) 

LPRJtSQJORJO^IIi^PWCSTlJWRYT^^ 1' ¥ CPTItttt HyGKL PS 

SLiJvuLSoa«R«SKixpnttPYENixawcTirrxi^ 

PIOCWSLNOKNEINPEXLEK 

CPO_0344 387432 385556 

yaeL-Hetalloproteaee 

ssimrri ZTTzuutfJCJKLVLZKELMvwwuvcHAVEsr sicPOPAtnoouaotrr 
RZCCXPPQomxRaciRTKaoaEiocnciD5\nrDXPQcm 

AVLAFS II,YW«aiSXHYSDC5ICVVCW VHPVI QAFHT ,r,PCDEILTOCKP« V UJUM LTT 
CT-T r ffr^ Jgf.gTKMGYMVP5KErAIDVltfOPTICFCN^SG 

iisixjtPNDiirvwmcntfSMAOxsoxu^ 

¥!Ja4n.XC7TCyVGLXCKU5SLmPYVINSYCYXCCZXTM OPESPLPOPQEKUOLCDR 
IlAXtCTPVSCSVDILRI.'WDNXRVSI ZVQQKSPCZX£E\^RiMDmXASYKSCDUAZ 
LNHUIESHPVEVACPYPIiiMVOPRPWIDVYSStSl^^ 
ADDKPSLCXSXJOSLXVRYNPSPVVWI^TKESLITlXALm 

uma*svcrsEvijvica^s>gg>N^axpiPvu^^ 

LVPPTTLU IFFIFLTFODLTRFTC 

CPn_0345 388587 387436 

CT345 hypothetical protein 

EJCVAQJCKLAVLCSTQS ICROTLETVRRY PSEFK X ISMASYCNKIitLFfOOLtCrAPtAA 

AVYNEEVYimACORrPMWFrLOOECLTOlflMrmrr^ 

ALJOJWKJULVCACIIVSKTAJCCCIKVLPIDSEKNALYOCLECR^ 

puiflcsLXELSc\nroDvi^prwN«SKvrvMST 

VXKFQSUHGHVmJCSVXSXJOTPDXLFPXOYAX>TAl>TC^ 

PVDt£RTPSIWJU»VaZKOCSSCSrrNAAHr/LVRRFI^EZISVCriIL^ 

VYACHSLEDILCVDCEARALAOEI 

CPn.0346 389690 388704 

070-troD/ytfjD- Integral Membrane Protein 
KKCSLMAltfPSPYYWSFTOTTSVFrSRl^SOSUTCSLYX DOIQI IvrtAXfiCaQAFAO 
TTLVLRKMAW ANAVSHT\TJ^VCVOXTM0LTTLS LCTLT LAAHATAKLTCFL IYTI ■ 
^FKVSEESSTALVTSl^SLSLVIXVrMTKHAHICTELVLChMDSL 
AKAVrrXrArRSLVCSSFTSVPASSLCI PIRLVDYLIIFtJl^ACLVCAntAVOaMALAT 
LIIPSLIAXVIAKSIRStJUWLVFSICn^FXAPASSRAI^ 
WTWKFXSYFRCYFSICNFEKISEKSSOY 

CPn_0347 3*1078 389678 

0<i9-troC/ycgC- Inceqral Membrane Protein 
TFCTNPCAI^KTXWXVLXMLSCVFSOTXFLSSFLA\m.ICKITAIJCT 
ESLSHAS*/TCL^VGAI>4AOVFSWAGIFWIVLFCCAASVFCYCI XVFLCKVCKLKKUSA 
LCFVliWFFAICVXLASYMCESSPTLYNR I NA YL YCOAATLC FLEATLAA I VTCASLFAL 
WWWYRQ I V\nTFBKDFAVTCCLKTVLY£ALSL I F I SLV I VSCVRSVG IVLISAKTVAPSL 
<>R0L5DRl^XLtLSAJFJStSCALCSYXSVAFTCRAX IGOOAV PVTLPTCPLW I CAO 
LLACUrU^3PK£X>A/rRnT?P.KMFSF<nca3EHtXKVFWH 

OEYFCPKPFPfWRVQ XLDVRGT/XKEOOYYR LTKKCR SEALRLVRAHRLWESYLVHSLOF 
3KCSVHELAEEIEHVLTEEL01ITLTE [LNOPOTOPl 1PQ t IPNK KKEV 

CPn_034() 3«»U15 1^10^7 

OAR-troB/ytuS-ADC crunsporr«r ATP^sa 
fT»LUWrOETFW3VHNLC\V^EMAAVI .YH I "FrtLCKCSLTA lI/iPMOrJK.7TLUCA5LC 
L IF PCSTXr/FFNOKTWROP t AYMPORA.'r/ljWnf PMTVI DLAIJf X.TUYKliMUCRISS 
nWREAFU I LGRVULE^VATRO lOOL:^ X Wjl'AKLAUALMOKADLYLHDELF:* [OHAb* 

Frrsvrr/LOtuiDoacrr^vHHO^ivROLFnitwLL^ 

VTCE I ELLEQTLKLSRfiXCFiiC'.' 

'H.V-t riA/yraA-:^jtut.f t-rfjr.un lurcliryt Kmity 
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W I LWlA:tREHDAKMCY I FKVMRW IPTFVACC ITJ NSCFQNANSRPC CLO«RHDTOC 
VERVVnNHU\TAVLIICC5LDPHAYEHVICGDICDIt IACSAVl FCNGl£LDfTL5LRXKLO<N 
PNSVKLGER L C ARGAFVPLEEDG tCOPH t WKDL3XWCEAVI C ITCVLIEKFPfHSAEFKA 
HSEELVCEKS ILO3WAIC0CLJTIP WLflYLV5GHM\reYTORYt>TPEeVASC*W^ 
ISPEGL3PEAQE:nnmiHWVDYXNEHWSVVTPEa^^ 
K Pt,YSONVDDN'fF!7rrKHNVCL ITECLOCVAIXCOR 



^»n_03M 



»H6'J J«1684 



- • •s.-v in rv:.. ; :n ;*:>*a '^-v - -:«• rir-LCnv: ,r* :tm yrwrn-rr jckkk.:aa< 

tULLXMLKTF ILE3ETPRT INPKPkAPROSKKRftDFIMrrKTOIERVlXLAftOVCOKOU. 
AJtrepKKPLTSUCRELIRS:iWWSVn>aUYVCAVKAVSSPNlXVTSP 

CPn.0351 193861 395432 

adt-ADP/ATP Translocase 

K Z PCVTORVNMT1CTESKPFCKLRSPIWP IKTHCtKICVlJMflJCrTCITPWYIVUaTncntL 
rVGAICSCAEAIPTXJCFV^WK^IFHLXYAJC^ 

V I Y PLROVLKPTETADRLO AI LPPCLLGLVAI IJO»nTAA/TVlAniCSVKLSliff>Cr 

ANEITKIHEAItRPYALFCIGANISLiASCI^IVWASlCLRASVSBCVO^^ 

IVSGlVlitASYVOTNKNVLTOPRrrNPEIWKGIO^ 

uviAYoxcxNLiEVTwiesoLKLQYPNMNiJYSEFKaira^ 

FCWLTCALVTPVMVLLTG I VFTALVI FIWQASCLVAMFOTPLiaAVVVCAIONXWKST 

KYAIJDSTXEMAYIPLXXJEOKVRGJUVAlDWAARra^ 

LAVIU.FIUIWLVSATKI^XLriJS0SALKB0CVA0EDSAPAS5 

CPnJ>352 395478 396830 

no robust homolog present in Genebank/EMBL as of 11/7 /9B 
WVCirFDiSHFTOSYATFTJOKVI ITVRHSGCTMKCSPLTLVWIFIJCNDCECHRSCSLJCI 
RT IAW.II^VLALVSAI^FVFlJUkP ISYMarrXJOAAIVILI ITLWAXXAKSKVLPI 
PWEL^IIYNRYPKPn'VrvrrHSLTVNELICirilOflCSCTO 
SIDt5l^7EFtEIIX0^PLVWLSHritJKTESVAC5iailKTO^ 

XFHSYTRPLLTWSESCYKaYSKASio^owDSPSwati^ 

QFl^l^TSKGriVGOAOMZOLZNPCNMIQILCOFnCA^ 

StfYEPTVNFKTVKZXJCYLLEXVKES PMHPA£AI«VQK ICWITHHONIXIQSWOFVRtfHJSQ 
WTSSLPOVAFKAOTYXLEKXI ESSLPIRSSL 

CPn.0353 396693 397135 

no robust homolog present in GenebenX/EHBL as of 11/7/96 
UirRNIIOCSUFIKRIRYSOXKZOXCARPPTKKSITSSL^LLLIAIFNENFSSI IQNN 
FNKNFWKNISINRIFV1CFTX 

CPn_0354 397062 396507 

No robust homo X 09 present in Genebank/EMBL as of 11/ 7/98 
YTTISIKILIUKTFIilGFXUnJlYNTOIDEPRjera 

TTTrffiTtliT" Tapt rvt^ii^T^^ri^/^^ T xr.Tmrrr/?TmTPMTgfiltE 

ONTpQEtVNRIJUkHYyiCrVSnFVSEAtJNIJIPLISr 

OKItJTFTClAMJQJEVRTASUOUXSAASSWLJTSLPKILOKV^^ 

VELHRVKKIGG ST.FTnf JTOY IKPIXLPTYWLI PLDFft PTMSS It/iLHTtVLABVLTRDVF 

OHlXYAAZMSIAaMISDLNTHKOQUAXYHAAYOSY^ XFSH 

RYSUKQKSLZlCrVPAQtMEMUXLTUS^^ 

LnXSUXJFRTXRROSTTnAKFIJlUTHNSTFRSl^PITVHPU^ 

IG 

CPnj03SS 399955 398591 

No robust homolog present in Genebank/EMBL as of 11/7/98 

IRDrrUillYTAFKRSISmJWShTriWHAIJ^ 

XGAXAALGCLAPPVSYIVGSVl^IAFVTI^LVILALIFGE^ 

EAYGI^ISATVRSOQVTIAETRQFSTALJXNISPOT 

IXWWPIFIDU^OTCPLYV^KFISACOPQVCROU^ 

Z FCKETHHX LQQLTKEXJVIXLKMCALOEKWDro 

KETISKEUXLSLHCYSTOWLITWfUAWDWL^ 

LOESS IDFOVNLGLYVIODUCEAVOAFSASDE^^ 

HAXALEKGTMRARVYDVNFVTQAR IHAKTS X FFTCD 

CPn.0356 400465 400109 

No robust homolog present in Genebank/EMBL as of 11/7/98 

KQUOLFOYMNESOrcwi^FCSOCTOFOLS 

£LI£HJU£JO*naCVAICDLCNVFEXHFQRF 

CPn.0357 401341 400469 

No robust homolog present in Gene bank /EKBL as of 11/7/98 
YSSHNCASMVNIOPVYRNTOVNYSOATQFSVCOPALSLI I VSWAAVLAIVALVCSOSLL 
5XOjGTALVLVSLXLFA5AKFMIY1«R0EPKELLIPKKIMELX0£HYPSXVVDFXR00EV 
J X YEXHHL X SXLNKTNVFDXAPVYldQEXLLQrGI EKFKC7VHPSKLPNFEEI LLQHCPLHW 
UjRLVYPMVSft/TPGTYCYYVCGPLCLYENAPSLFERRSLXJjLKKXSF 
^m«3Sti , /0IR0NL^^RYYAr>KEEVOLUIJUkOYXOFDSlXHUFSHKI^ 

CPn_0358 401757 401578 

Mo robust homolog present in Genebank/EMBL as of 11/7/98 
EE7LSVSMKLIPTGI)SXERETOSKRDIQCXFrXYXCSSKVL^ 

CPn.0357 401994 403817 

lepA*CTPase 

ITLOYXUCEYKXENIRNFSIXAHXDHGKSTXAORLLESTSTVEEREMREOLI^HDL^IE 
RG IT I KAKPVrKTYLYECEVYOLNLX DTPGHVDFS YEVSRS LSACECAIXX VOAAQGVQA 
OSUVNVYLALERDLEI IPVLNK IOLPAAOPVRIAOO lEDYIGLXTTTNIIACSAICrcOCIP 
AIUCAIIDCVPPPKAPAETEiXALVFDSHYDPYWIMVYVRIISGEJJCXGDRITFMAAJCG 
nSFF/WIGAFLPKATFIECSWPGOVCFFIANLKKVKD^ 

KE INPWFAG I YP 1 0S3DFDTLKDAU:RL0LNDSALTXE0ESSHSLCFGFRCGFLCLLHL 
KI I FEP. II REFOLD I IATAPWI\^VVU(NCK>rt^ZCl^SGYPOPAirEHVEEPWWVNI 
[TPUGYUN tKNLCLOKRC tCVKT EMI49QHRLVLAYELPLNE I VSOFNOKLX SVTKGYGS 
KOYRLf JbYRKC^ I IKLCVL INEEP tOAFSCLVHROKAESRCRSICEKLVOVI POOLFK X P 
I OAA t NK KV t ARET I RAfciKNVTAKCYGGD (TRKRF.LWEXQKKCKKRM K EFG KVS X PNTA 
f IVTJUTAJU 

•lii_'U'.'J 4UW22 
•Timi tr/tHH.tt»>r uj.il (>rori*in 

V A! iftt I f ';i * 1 1 ILAVKtKNLVl MH 1 V? VWRTPE KTR&FUCEYPMHRELVGFESLE 
1 IMLMtLVU SKPVOW 1 1 IALLPFI XKXVI I lf$ 9t 1 Y FKOSERKCK ELQGK 

• :»t*i/rA;t:iinEH;AW»:rczMrnnNPFJwra.w 

1 ivvFAViirr : I ey» :nioi. u:kayo 11 jioflklsatavat i lkewntlelesyl i r t asevl 



ALKOPEG I PVTST : ^WX JKWTAIOALKS r.TL.;L 1 1 "^^^^l:. 1 ?^ 
ARNYPCTPLXFEHPHOP3VFIOOVFMALYA2TK I ^^y*^f^?rJ*i^^2?i25M^P 
LMWRCCCIIQSA^IHWAA^ 

xpciwuArrranmAfiSAtsxJWJctwrciM^^ 

K 

CPn.03el 40*650 405382 

TLVNNAOWUJEl-^-^f ^JulJWlKKUXHLVKLT; K.-KVXJCEUiJYTC'-'Y^ZLCSYC 

FYHLntNYCT I LCCOSSOCWCWITSC XOF tRRXGLCOAYGlTYPLLTNAOCIOCIGICTlSC 

Tt/MtJ«rnT,TSPFEL-/eYLtJlIJ>DDTIPKXAKTL7l^ 

00XLSAXKG0LCIXEALS\TRSI«IOa^SX^E]mrHV 

LFtVlCLCKSKGEt»U.XEO]CGVYINNVPXAfaMSVCC£OOZ 

YLN 

CPn.036: 407843 407055 

fliA/rpsD-Siga»*28/WhiG Faaily 

LDKKKTVKTCOTONI IEVWNFY>rtTC^IEYRDSLIErrwPLVXSWHRXISG)<PSKVlC!X 
OLYASGVIT^VIUVEKYNPEXSIUXrrCYAVFLXKAAXXCDUUCD^^ 
AWSSlJtQSI^KEPTOLELCEYLIJ ISQQELSSWFVSARPAX. IVSLNEEWPS OS D C GAGW I 
EEXXPDERA£TGYXMW}ErSiriJ^XOEXXEXElUCNmAL 
ESRVSOXHSXALLXLAAALSAFR 

CPn_0363 409700 407943 

flhA- Flagellar Secretion Protein 

t>VrVSGXiaSVRa«XFVPLSILVLXfXPI.POXU£FClJCXSFAI^^ 

SAfUPPTTTYtr ;,p;^jjrACTPWTVg«aTrJL«;gL XVSLGSrrSLCSLMkATT ACUXF 

FVNTtMVSKSSCXXArVItSIirriXALPAKQHAtJSDLV^^ 

FSAHEGVFlUrvntGXMXXSCXUVNVVSVTCLYYTS^ 

TSCAAAT^SKICJOISU^rtUTYYKOUtOHnWSU.irSL^ 

LWLAYIUCECPASEDSCXmArSYVECACPXIQCSOFYGV^^ 

ijatiRpwijnTO>Arrtz)EKrp£AVij»ruw 

IVPIQnSUSLVVtJRXXVRIirVSKja^KrX^VAVYONSGOSt^ 

C»SLTOK0TXXVmDF1fVEEX>X^SYSKSOTVMOEW 

SCETRFTMOO^PHFPOliVl^HDEI^KIIPISFt^rVSDEVL^ 

CPH.0364 409954 410236 

fer«-Ferr«doxin rv 

ICBiOiAJXVCTSrCTOEreXEXtC EIAEPC ESICX PTACTEGVLUTL V llVUaUMLS 
EfTEPTYOrLCEPEDSNERLACOCR IICGCCVXVrF 

CPJU0365 410496 411544 

No robust honolog present in G«n«benk/EKBt as of 11/7/98 

PKOTGVNSL 1HATIS P I SLTVDK PLVDTJOaCSCSNFDK I OSR I LLITAI FAVLVTIOTLL 

IClIIMXPVXYn.TCXSrXAVVX3KrTLY1CIUTTIIJCP^ 

XSXAXMRSKWDCPIOaLC^PAPSALLmm 

SASDGJCrrUJEETSaNAPISSYVDTrPSPKSU2JEAI0ET^ 

KJPDFRGRVTLPQXFTTPEAXYCYYYALYVTYXQTAXNTKrQX IOXPLYSLUKLYSHS. 

PPOSWtXSLA>4ITAVXYHACJ<Pr^PLT^ACVI^LAOLTO 

CPQ.0366 411976 412440 

No robust homolog present in Ganebank/EMBL as of 11/7/96 

CLIVLSIJAIRPAMrrtXTOTPAAIAVlAVSGTILLVAVI ILFCFlAAVPFAAJaCTTrT 
VKTVD0VASHHSH0CTPTLGTIFSGIVYAES0A0L 

CPn.0367 413078 413836 

No robust haeolog present in G ene b a nk/EMBL as of 11/7/98 . 

SF PLNRYFMTKtTS XPDWENOSKLSVDERLISESPVLTKKEVIAXI IKLTALILAIAIA 

VCTAWACVLGMPLMAIATGAAXXAAVnn^CLLLiUUlEPSKP^ 

V^PSVPLOYT3KIXRNIVri.VNTLSElNISWn^PN0PonnA«^ 

piUJCMCRVMrVNAANS^SCCAGTNWkLSMW 

RSAPWINRUWTNX 

CPn.0368 413766 414107 

No robust honolog present in Genebank/EMBL as of 11/7/98 
TLAjarfLwvNAAOHPCSirrcRi>rc^ 

YlJCTD£AlJ*NOATVVg\^ISSSIYSPCGXLEL^ 

CPn.0369 414345 415562 

CT058 hypothetical protein_2 

NXMTDSNPLPSYTCA5 LYRTPAKMSYP I R LPLNRTDR X EK I LK XVTLTLALACALCFS I A 
ACXLAMPXFSAVVVITLAIAAVSLYSLLKKPtaYEXLPOXEPCSEOSSLSPSPOPPCOOO 
LPLOXOPLPDPE£LPEVSLADLTTPPEELTAITVrPGYEALJ«EONWDlXPSLAAVI>PSFT 
TbTPQO P C F IWKLKP5 KLI F X STSCD IAVPR X KTQGRVM I VMAANEN ISREOGCTKKALS 
IATSLCCHNASRL7RAHSRSCS0U}PGECRSAKWEMS0HTSNDHVPGKAHF1>0L^ 
AXONDPKQAFEVSKKAFHNLFQEAEI XGVDVZOLPL ICCNLFAPSRLLNLOCrRACWI E 
AXKLALXTSLQDFCWEOONOEEOK X X ILTDKDQPP t X PPRFDLTTP 

CPn_0370 415755 416912 

CT058 hypothetical proteirO 

KRIFFKLFVFYLK5FMSTTEPNLTNVNLTMLI3SE^PTOLASHKIJ(GIJLVAFILIIGI 
AVSSOTAAX I LCI FU.F I LTALAVLAF.il LLYFLLREPKSPISVTHOPTPI IKOTOLPPV 
PPLALTPVPTEAVXEEPPUrSPRTHOTLLO^iWDR X PDLOANTOMPF XAADNOTCYAWML 
KNSNLTLXSTLCrXEKPRYKT/CGIVMtVNAATPtMAMNVKrTTSLAtJUXATSVRCWEM^ 
SPDPLRSK0PLCLJECR5AfcVENUrTTTNAf;KAGLP0FlJG^ 
CROAYLNCLNEAKRRKTTVV'OLPLljraiFpr/rPKOEmSLRLOWZDnVKW^ 
CSEAENONOPWI I LTTLARH T L IT P 

CPTU0171 417141 4l7Mi: 

no robust nrw^Uxj ptrts*mr. ir. »;*rft«:r»mit/FM0L .n: ot 11/7/oh 
KTMPVMAPLPTSIRPSTOUlLMFJ^t'IKAJrAKIkiDrrTrr IKI.I.VK I LVA I LVIEVLG 
[ 1 AAFF Z PTTPP t v*L 1 1 U a !L t LTTA/ VI i J .7 f K LAI .VT 0<T B 7TTAF4.V I KR KUISK3 1 

t:Pti.0l72 41'».M 4 1 MUM 

Nr> rriUinr hoifc'liiq pt*t:*tnf. in » ^ ink /KMJit, .1:: Ml 11/ 7/'tn 

nyrachrn ti»JHMy*iPWt\Tr.'::*jL';p'm/rr i / ikfi.kh uv v. a uti : 1 k 1 AFAArrf aluxn 

TFV3C I VA t AMI FVATirt/UAYFTV VlHJt I J ZV, .1 U JK IMI .t.'JMYK rniPMCVTIMiW 
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»:pn_0J7 1 4U1W 42021* 

4CpE 

MS E t FC t rHTt rTPAINSSRRKTHTVR XCNLY XCSOKS X RTO&M1 1 1 LI .UIDSTVGOIY 
ALAEHNCO I VRVTVQG X KCAOACEX I KERLX ALGLN I PLVAD IKFFPQAAMLVADFADXV 
R I HPCNY I DXRNHFKCTK rrrEASYAOSLUUXSXTAPLVCICCKRtCKANRlCVimSSLS 
ER I M0K7CT7 T EGHVA3A I EY I AVCOCLKY*0 WTSMKS SKP K IHVTA7RQ UUCDLOARG 
WLYPLH UT/TEAOt^Vtr. t IXSAVCIGTUAfSLCOTXRCSLTGCPTTEIPVCIlSLUUrr 

- f?.~ !*. •: : f."K.T — *\Af*K rAr-r:>r rr , .--r,*FT.KiY r »? ; j:.t rrr r.siin ! 

- *;r • -vi.-n ri r v ,vr i s: rwi • rrr".': :*R4r i.vpinaiovt «:i r ■? wee ncrr* 

/HvMP*VHFHAiDPFIHTSluVraCQUHQ(* 
CCI£EAWIXtJ^PLQUVLXXAFtrri£W*CVRL^ 
WTTWXJCLKUIHOCIVWPOTiAO^^ 
LIRLLOEHCVWXDPCET1C1V 

CPn.0374 420209 420961 

CTQ56 hypothetical protein 

VDSHXUFHTHPL/fVWrrE£fDCI^XRHCVFSX0KX3A£STVTAA 
WQRJCTCVRCVTPTSFTYOPAflarrOSPL^^ 

WRCU^IYAVTVCTMKiaJWTKPOOL/VAICPS XG POYAIYPCYATLF PRSFLPFHNPK 

WFttt^ lARKOLT^ISKDRIFISBLCTYTE^ 

NVTAVLLLPRD 

CPn_037$ 421112 42 16 IS 

No robust homo log present in GcnebanJc/EMBL. as of 11/7/98 
RlSKKLC»ST7JHXVHE*nCPK^^ * 
LMLAAC XTFVTFLALGFPLIQAYS t AC X ITLVCLAICLVIXZLSUJ^KESCEAOALSRKA 
LLPLTI IVIE00PITPKPEI PYSYLTKZ^ALLTSLFLTLRRSSSOHXTH 

CPn.0376 421680 422294 

No robust homo log present in GenebaxUc/EMBL as of 11/7/98 

FKVVTAIttFNLTEIRi>HGARWSLniSPEW 

TKKNSRKKACO^in^PTFCVSSTLVTJWTWJCr^^ 

ETlJOttJVtJtiENCFTSVH IHAZEATTPLDHTCKPHTKRCWVYLPGrf J flAtWTAAVQAN 
VSADTQFTIXLTODECNPFKD1CXRG 

CPnJ>377 423441 422347 

s ucfl-P ihydrol lpoaaide Succinyl transferase 
I KrTTm PNIA£SISEVT VASII« VTB3AX>IQE?PCl^ 

VSEGDWPVCCWGKI EPAGFrTFTT finSQSKZTI EA£ I XCFPQSCVROSPPDOCrriPLR 

OOtfflOGSOCLSACTROTTRERHTSlWaiSRWXSALK 

KOEm^rCVXtXTMSFrmvtEALKAYPttV^ 

wwTPnrnifr.g>rtgTPO i rVI ?rA r JAPEnt J A T * CT .ww- w w iMirMt t wpttu 

PPOWZLCMHKIEXRPVVXXNEIVXAIMCyVAUyDH^ 

5LL0L 

CPn_037B 426195 42344$ 

suc A -Ox oglut ara t e Dehydrogenase 
rvriiFNYFMMEFVGOVYSsiafflwiESKY^^ 

ASTKISffl^IAMLOEOKSOrLCTlYRYYGYLOSO ISTlAPTrDSRFIOEKIAKIIJLDBO 

vpSAciij« Agysvi^ iEAijacapecsLTL^ 

LRSYKOLCXATFTEETLO I KTTOOXI^SLEOCETLVPlCiHLVHYCSALC I SNYVLGHAH 
RCRXi^TFmjCKPYRYVFKEFETOPAARCLZSVCOVKY^ 

KASHtZSVDPIVEGWAALOHQCTAGKEQSSLAI LVHCDAA^SOOCVVYTTLOLSRVPCY 

STBJTLHIVWWIGrPAVPRESRSTPYCTDIXIO^IPVn^ 

VRERTSCPVI IPLCCYKXYQiNESPDPSVTAPIXYPQ I KRJOCS IRELTRCVIXESQ PAPI 

SEETLA5XEXE20E$LNR£7QVUani}PEPPP^ 

LrHHS5Ri£CfPDNFW FHPKIlCrLI£ KR)g^^ 

SGQOS IRGTPSORKLVW&UrV lU/rYSPLYHI^A£QGSVIMYHSPI^EYAXUyEYGYAQ 
QAUCTLVUrfEAQFGDFAfGAOIIFIXrflSSCIQI^ 

lERYLQ LAANWNTQVVLPSTPVOYTR XLREHAJCRX2LSU > LVI FTPXXJXRYP0CVS5XEE 
FTEPOO FRA I LEDAOPNYDAS I LVLCSCK rYYBYAE*lLPODRJlKDFSCLRI ESLYPLALE 
OLVSLJCKYSHIJWFVWI^EXSKNH^YDYMr^ XCRPR55STASCSAK 
LSROELVTCKETLFSLR 

CPn.0379 426268 426765 

CT053 hypothetical protein 

KNKXKl^SRIODGNPWQCSERUacIXSEL^ 

KIYEEXERLOliKEJCEIEEYVTPRRSPAXTVYPTCPSMSDIErVTPTrrciDIOPCETV 
EXELTOECREDCAVlVDYSHEDDEr^rSDRNRWRKOCIIDPOANEW 

CPn_0380 426671 427876 

newN -Copr oporphy r inoqen XII Ox Idas* 

KSTI PTXTHKTLSAIA tACOAWSLI PMLMNGKAPLALY IK I PFCTKXCRYCSFYTIPYK 
SESVSLYaiAVIQEGLRKlAPI0ETHFIETWFO3Cn?SL^ 

TLfiAllPENLTVSYLROWrrPXIffilSVGVOITOOSI LOtXCRTHSSSAArrALOECCNHC 
FSNI^ia.IYCLPTOSLEIR50UCALTLPITHISLYNLTIDPHTSFYXHRKILVPTIA 
3EEIIAEMSLIAENUXS<XTORYELASYAJCPOW 

CERSIWYSHISHYLRAVWKMLPTOgTgETLPinfgPTrrAiJit a t JM ^*PL*rrP7Ttr 
•^LTOOVKLOt^SVHOOCLALWtOCRLFHDTIAEEIMCYSF 

CPn_0)81 429836 428037 

I.T326 ■ imilarity 

^Pfn<FRALMTAPTESRSSPPTli£TTEP[^PNPIPADrOIPRITISPPSLOVSTVASSA 
EO I3VT XAGCPRS55SASVASCNTY^VCLCGCOESPEPPDSEVRT1.YVNC5WQTKOCAVO 
EU.Y ISEVRCEAVRIXYNDGSCMSPWPISPCItTLJTLOW 

REFLVirfCDA^r^IOOALTOSRHSPRIVVVCrSPTWIOCDFTilWNYRVSGDFTSSLDC 
PfTTPAEKTTXLPYSSCLEGVFLPS r RCPSPTWAVRFCE0CLVANRCEDVEORGCLS0DAE 
P^OLWir;ERDL^WrD£rrDPS5MSRLVEW^0GSPSSDMEINPYP0RCPOVAl^ALYAIS 
I'Vr/.'LAOEWI LA3VHECL0 WICYSL t LMHTTFAVR YFFXLFTNYPO$R EP.FRTAR IVAO 

.i n.K t r.vLVFixtwiJiKLWHPOErLRAir iSA^rraas i vfvectrwmtri ilrhrvq 

WfiJ* V tCCrXPVCTrVnASYRORAGF I ICFLOTVHCCLYLPVniKVLNOIA IQVPR I LV 
HKirn-AWDUIMICnAEE^SJ^aMJkVnOTUIF lLCAPVtFVHLWFFVK5VUUtSRRRRR 

TtiiJH*.! 4)0742 4IOO)h 

v.)i>:/vi.ii,.:* i AM U.(-<iiinnt Mt.Ttivtr.innt<»r.iB(» 
IVTLYLt^r/rrRAVCTLrrWICELViniLtt^ 

l.-;KIIAPlJ*AW«nrLEPIVKHCE^LISDAi:LTCIADPCAiXVnRAiUI/J 

• ■:: fTlAli(U XP: V:'f*rFliJYLPWUHC£nVIW I KKAATUKKVIjTSVi' t ETSYRNVYTFE 

: :U Jttl.ft:7AKU VA; :DL: WPL'ELVLTllOVOSWRTTEDLCSVXgs* tTKVPT 1 PLFK t PN 



TPn_038) 4. ; 4)0741 

CT047 nypothtctcal protein 

VODTTTLTLPHQC^TjnWSQAYAtlCVgA I AjJ^v^ETW 

QCIXPATUCWTrTTALFQCHETKTr TH Jt tltrPtAT)CEWjW YMtMlpPWt*t 

E3CTR£LSXALPSAL5LSLrCWPADROKR X IRLLLORACRVH rXSCSLAJLFUIALAST 
SLP0IUETCIKX4jC7^KlcniJmSDXK£LVV1CXCXA^ 

LwEOCESPLC X ITFlJnOCLYCLJlSIEE^KEJOCHPJfr^^KEXLHCAiJCL^ArrL^ 
KNHV0DP I VAVETLV t RMVWL 

■ • _• .^i : 

V ITCL I RC I XH I (SAOKKOSOCKTAJ RA VF K P AXICV AAXR TVX ICATVWCT AVXKPAVWCTA 
AXXrVAIO<TTAXRTVRjrTVAXXPAV1ClCVAAiaiV^ 

TWAJCCSPKKAAACALACHKIOOOffSSCKRVCSSTATIUtHGS KSRVRTAHCWRHOLmtl 
SR 

CPn.0385 434042 432522 

ptpA>L«ucyl Aainoptpeidat* A 

KTCnEIXYS5PKAJCEI0UVlLCLCXNm 

AA IPlCVVl.TRflLVNWJAI)ETrPKKI^EVAIJiLGKEFPS I CTKVTCXSAIAXCXICtXlJk 

\^w^cvi^FrvvxYoauicsmnvLicxi^TFCsccLDi^ 

VLCILSALAVT FT PIWVTCI lPATENAIDGA5Y1C><KVYAO<SCLSVIICSTnAXClLIL 
Ar^T^ftf jfvCTfiyrwrrprATt.^CA*ftrt^SLgFP.ri/TfTgMWTiMT^rTtf r rmtrtftu 

RUlVXICYDKTtMXACKl«LS5NRACAITAAL7^^ 
ECCRYPKYASCFCVRS ILYYLEKSLSK 

CPn.0366 434543 434046 

ssb-55 OKA Binding Proctln 

KSXCYXMMFGHFA<7ft£A0P££JWTSlCB 
DKWJ^lJOCCSCVTVACDISViaYKSKrXMPOSSLVISV^ 
WqOVCYTSVSVCrtCrALnArA TTrOCKYACYOQEQOYVCn^Pf 

CPn.0387 435229 434699 

CT043 hypothetical protein 

NNIWSO^^l^^Q^SBtUa^AXSlJ^DVhn>ONtrTCl UVKXFSLMLTtZDfSD 

RLYVTAPUIX^airORXZALYaaiXCUKLaCQ^ 

ATTOLlJCA/AOLFIinVVICVfRTVCADICACR^ 

CPn.0388 435323 437320 

glgX-Clycogcn Hydrolase idabranching) 

TGAZWZCXCSISD0SSYAFinmCB>ia0«3t^ 

YAJTYlJCECPFTWrXTOPU*U>ICmiIIY110(VFSTO 

KiaxnHAVXLLPXFsTOErAHPPSKSICrPirLCHYm 

EnTLVKTUiOEnrsVXLDVVTNHTCIOG^ 

LNnmAPTTWLDXLRyWVXEMHVX3SntfinA^ 

ASmiAEPWQAmYOUCYFPTUPRWSDMGP^ 

00XYTKQ5PTNS Ximr5CKDCrri£X7TVTYfWK}04E^ 

nwH r»wnr owiTTTriii/cnrTafTneffTwv^nr^^^ PmftfflTtffflQr 
TAJCPTlMKFLCXlLXAFRiaCYTCrXJTlRCn^NXEX SWVDAK3(PKrimpOinAPKXUPK 
AHVYVAJWGAQDQIJaTJKASPgTPYO rVAX&OQCFV POWVATirvSLOPHTTLIAXS 
HASXVT 

CPAJI389 438254 437319 

CT041 h y poth etical prot ein 

TVFWnC RJYC KDSQR OMSW rrClJtP FjlKTC KE^ 

SEVXVSrTTrVKODTVVEPXIRVLIJanSTTALI EAXGPYR X YCOJVLLOTAXQOOXCVVH 

ALYICIRJCTFYPCIOClJCIEPVtOTASLrFWIOYOCSLYVWUCn^ 

YlJCSVl£XKYLE£XJXEAI£ACXXL£]n , AI*^^ 

FYCWCSAXDWTAJtLVVt^PQCLJXDAOSLLOSNVD^^ 

E5UNEEL0CEXR 

CPn.0390 439171 438134 

ruvB-Holliday Junction Hcl lease 

RKSORECSYHTHOVAVLHQDICKFOVS LRPKCLE£FYO0KHLKE1UJ3IJ1jCAAIORCCVTC 
HCLFFCPKajCKTSlJUilVAYTWKCLVtASCPOLI KPSDLLCLLTSLOECOVTFI 0E3 H 
RKjKVAEEYLYSAKEOFKVDZT XDSCPOARSVIWGIAPrrLVCAT^ 
FS ARLSYYSD0OLXEI LVRS5HLLC XEAD55ALLE IAXRSRGTPR1JOWLLRKV/ROFA0 X 
R£»CI>CDVAEX\I>fO-LIDDW3LNEIDIKLLTTI IDYYOKPVGXXTX*SVAVGE9XKT 
t j-rvvrt>rt.Tt jmrrinrrPttfiaMVTnT avtwt if puArvT r crrry^ 

CPn_0)91 439701 439510 

No robust ftomolog present in Cenebank/EKBL as of l 1/7/98 
KXLYKOEXP I PKATILSRNlXVMU»ffKCKRfjrLFLCRTSCRSALYSY5»RRILVlXJ4AF 
HRCP 

CPn.0392 439814 440383 

ticd-dCTP Deaatnase 

HSIKXDWIREHAyiAEMIHPPVNGOVNVNEETCEXLISrc 

VYKSWXJPKCFTEOI r IS ITODVCIVPPNSFALARSVCYFR IPRNVLTMCICKSTYARCC 
I IVWH'PFEPEWEOATIEtSNTTTLPAXtYANECIAOVtJ'FESSTTCEVSYAIMCOICYO 
K00CITVPCV 

CPn.O)93 440229 440723 

<rr038 hypLtthetic.il protein 
KPLTUWCORKPTLM!WU>RSY3ULVWA^ 

VC FRLLEEVM I K5WWV IFStLI OOFVf DR A I OELirr EELRLO 5KVSSLCO0X U2AOBXOR 
0L0LHL0IIWD2AA I EAAL IORLCLI PITTflCKLCVJIPKQQ.TENXD 

VPnJtti'tA 440727 441'm.H 

tlyi.'^S:; C^okiin prot«m ih.^, i V :; in iUvm>Op t i 

kethi PTHijtFFt iCFTi/:.Tr:FtnL:.*o rAi.p;:t J'^rw:;irYK«:w:;KicwwATf*LLHPH 

MLLITL IFCOtCLNIArONCFA ILF/;r*AA:*Wnv;i.l*( JVITI .1 liJEIUICAVALrFHro 

i AiirvAPt* r i*.*vtk ifkpixhm; iv^; i f iy wow t i.-;k^j i o i un^rxKEviaX'itoFnv 

VNOEESRLLVCYLSLSDCSVKERHOt't'VUl r J'YIjf OTH jaVLYLLf; WOffcWKVPrrWN 
U>»^;itTAl«:LLU!0KPtJO^:DDLU-l4J'Kl-VYM|W r ^KAU-UMAAEOPTIi S4 1 1 

IfHW I AT UUtfl. I EOtfTT t PTT WTl^Wttf J] J ,H»JVUlAAI tm I RMVY I HKLYb 
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'.'Pn.oo'. 441V55 441 IT> 

CT2V7 hypothetical protein 

GNOCTNSA LTW I CVN [ IC r/LQCrY3KM£MAC/3FTfllVRLC / YYLTTCDHKKARY XNTLI RR 
PYRLreTVHtCVNIAWVCSCSSRWnfRAl^ 

PEXLA1JNGAP tL YYSHY Z PYPL ZOLICSLTE^TYtXM IRKEKUtSTLSROEFCWALCTH 
HOOtJFOTrATWFSI^TCAXXJVCOPt^OW 
ARWWtGXAHPKDFVNKAiaEPLZINLHSPWFITA^ 
GEPZGIL5WrFKILFNTTWZAHLKOTZSVIERTFPG»^ 

r:.v;; ::: *r m ~/ • '"i; - ztv 'i. r; : mvs : nit j_: 

CPn_0396 444ib* 44Ji41 

yhfO-NifS- related proctin 

YSMZYtOWAHrPPERGUXFLC^LXBCTYA^ 

SFCOTVLYTSGATESLNLAZASLPICDSHWSGSW 

vmEOZERAVTPin^ZZLGWVNSETaUCADZAA^ 

VUSarrKAAFSaOCTHAI^ZGAIXVSPGVra 

YirKYLDLHOERISOtlLTHRNCrOUaiCARI PCVHZHCAOOPRAIWVSAIArPPLSSCV 

WIAWZBGVACCTfGSACSSGATAPFW^ 

GXXaCWIRUCNS 

CPn_0397 445124 4443B1 

PP2C phosphatase family 
fflPVOTDYFGISDIGRVRAlWetm*aVNL^ 
XDTOSI0J*CTC2Z)CYX£TUatZI^^ 

FHVCOSR I YR I REC^IJIRLTTOSLDGIJCNRYGLPKOSDKVYSYRH Z LTNVLCSRPYVM 

PDZIO<IJCEKra*CLCSiaLTNHVPDXDZroZL^ 

TWLVRZQ 

CPn.0398 44 551 B 44S700 

No robust horaoloy pcttmc in GetiebaiU/EMBL at of 11/7/98 
lEH^ZENSSILfAEWKJaVFZrSVZSAFVVnJCCm 

p 

CPn.0399 445759 446523 

CT253 hypothec ical p rot tin 

CTHLGSSGSYHPKLYTSGSJCTltGVIAHIJ^^ 
EXVmB«ASPGTVSWAPZAI»IJ>rrZZEOF^^ 
A5VWRTOZRHmXJU.ZY0mECSQPI*Tri^ 
EWARVEGYVCANYS 

CPn.0400 446527 447306 

CT254 hypothetical protein 

SXXMSKF ILLLSUJVAAIASXN7T ZWPAPSGK7PUajlOVLraCAXXVrSSLVALSVSSO 

TAXLLSWTGXSX^AFIXlfLIJUraZTRAIZ^ 

TOWIIOISia^TLVLPTOEIHTOEVTOENWSt^XT^^ 

GILQTnJMWTRIA^VirSSIIF^IHIDiSuaWVrmrtJW 

SPZALHCtrVLTSLLFlJCZX 

CPn.0401 447884 447495 

CT255 hypothetical protein 
KRXWATSIO^GTVRAtl\nra»CFWSLQQSL^ 
AQC^TLVLZUrnXEIUXTnJ^SnVANtAKEKtJUUlAPYZFAEZ^ 



CPlU>402 449012 447888 

mutY -Adenine Glycesylase 

NFKRPCMTKXAFSEXAKOTPVEAIJCXCTE&nCRSLF^ 

OTTOTM ff* mr ^ T F^r A **»Fn^m l "r^T/7YY t rBftpm rirr?^PTf u>f FT r " r r >f T pwn 

AXSUWZRXWGPYTVKAIWAFTWAAAAVt^ 
«3AU.PHKSP£VIAEALZEI*»CXClCKVP0CHrc 
ZTUO&VAIVLYXX^VVEXRRPXEXKACLYEF^ 

* PL£FLGNLKEORHAFTNKKVHtCPZ Z FKATSLPOFCELHLLSDZOHUkFSSCHXKX KDAL 
LZY1C0VS5RESZGV 

CPn.0403 449009 449710 

yctC -predicted pseudouridine synthetase family 

NFHQLSNDXRAALQYFMDJFSWIATQ^ 

IERrTSYXVOPCDRVSl^XPSTXCOPSZLWE«JYSZZYEKPra^ 

RIAKGTSGCI£JS3XSKQAATEIJ<X1^XQRKZHK0YZAFV^ 

CCAVirGAAGPSOCEPIJGAYKWDCWIU^ttSTrD^^ 

CPn.0404 450962 449871 

No rooust homo loo present in Cenebank/EKBL as of 11/7/98 
CLEAi£0KYGKAVLLXAL£EI£I0rnGU£GHlU^ 

gSKOLMADAASCVDCLLOOPFWSTAZASC IAXSSUJETCPCCESKVMVX^SWCEOCAOVC 

SPFNLERICKSPPSLlO/FSLKKNCCEMCZ0l^A5CMNU^ZFFVATNOCSTPIWITKC 

NIXALVALVLSHYOCYFVPATCDPORQJILCNPEVNA ILARCMCKRVOLCRXRGCESSSS 

RYLEUw^CFENSLTKTSLt^DAWrAJERWCU-OMSTSLKHTACLNLORP 

AHPOPOTOPVWSQPSLLGAJtEJlSFVSSRCRFPWW 

CVMF 

CPn.0405 451814 45096* 

CT105 hypothetical protein 

N IQTSHSRVLLXXFSKEFT XRTYRSUaTOYUX^TNPt^FPSPWPOWIAPSSTT 
PQAVSSAVrXFWOGAASSTATTTTASCASALUU^ 

.-'.AGTSGA55SSASMQQQLLQL I LDA l'l US UL-S SVSS EOLOOLLSLVSOKTTSOGCSCCTO 
AGOAASVLLNU^TCSAAANPLGTAASLAQZ IYAAVTSFGAKKTSEFCYWCGETCOCN 
CCCfTCCCPDCOCCCOCFGRrPrCVWKNCCCICCCr^CPAIPL 

»'*m_040n 4419*0 4S2Bri* 

t.itJl-Enoy t-Acyt L'.irner I'roroin RtMuctasu 

» «f->4U I DLT HCVAFVAi 1 tCOOQCYOM* I AKLLACACAT I IVCTWVr I YK I F30SWELGK 
H<C;RKl.'»jrTrU4Et AK tYPMDASFftJPFJVPECIAOUCRYICn tTtlFTIHEVAECVKKDF 

• :» i niLVii«;i^:i^ii»:;txifT^nKCYtjsAu^r:YCFV. t ;LU';tiF»y iMrntrnsTtsLT 

VI AMRAVrtV/f a » MlIiyOCAALESnTKTLAWEAOkPWr; I RVNT f : iAt* PLA.'>RA[*KA IGF I 

hhmvoyyukwap i rRAMHWvruvAArLAnpLAiM rTncrryvntiGAwvMn k;pemfpk 

u: 

''I10MH7 4M7 r i7 

llAii import. mi ly tiy'lrnl.itin/iJJio:}pn.tt 



F0Al^lJCCaNCAbW^7^.«(<U.Y3KSUP5^U^ : LCDCMir«ATALfrV£SLAPYOTY 
YRFSPTP rA0DWIYWOr<W*P**AK£BE£UTfR^ 

CPn.0408 454090 4545«l 

CT10r hypothec iral protein ...„_ 

-rwAKr-^-- ~* - ; : " vht-.v • ~*rvrr:^ 

£FPPinviwHu^ufiiRw.*^ ; ^'"^*'i-'i<A;^R * »*K»vrcA*L 

CPIU0409 454645 45512? 

CT260 hypothec ical protein 
NTTVTUCWfcTKfTJCSSOEEPF^ 
LPYOLHESHKASTARIiKIXNRXHDircFCWaTC^ 
0TZJ0.VCDSFSHAXGLZSS(3MraXtCIJU^AU}EQQEKRNC 

CPh-0410 455087 455933 

dnaO-ONA Pol ZZX Cpsilon Chain 

DVWJXSWKNVKSSOTKnVLIMUltl 1U*VIERCRI t EZAAYN5VTCESFLTYVNK X 

PX POCASXZHCZTtmVUAPKFPCAYEC7RKFCS£DSZLVA>(N»X7I7Pt^^ 

SL£PL7^^ZOSUCUAQICYltP0t,PIOQ^C7YlAGV^ 

CDt^POGVUlUjQOSYHPtCVFIWFCKYXCCPLVOZPKSYrE^^ 

AIALLH0PT 

CPn_04ii 455794 456609 

CT262 hypothetical protein 
RHQSRYSSZTSTIMXLTAAFSPCPtfDZFTJrtlSFIJCDPOFH 

LCKLYYPKAXLZ FMPYQKZLSA ZI^ICVtXX^ZHCERrSYt3WTUUDFCfI>miUCTZ 

FPLPLK^IAJCYVPHATVTlALTAALJUCSLICSLraPrrA 

GTYXNKTrnLSKraiOUUJlHXMCANECCOYT 

CPn.0412 456515 457246 

CT263 hypothetical protein 

£PZ5TICXPFKYIJCICKBXYZC5GRP«CNAVKITiaCZIXrVADm 

LYSYKCTOYTil^LYTVHVWOT AVO-NAWSYCOAYTO^ PLOOCY 

TinOANLTTtTTPPVl^EDPPYIFDALPOSLPKSSL^SPVLYWCFHKT^^ 

XASOAA£HMZPCSFLKZTSiyifTVreDCPFSRI^lVSOXLT0Tt-VC^ 

LFCP 

CPTX.0413 459209 457227 

©a da -Transport ATP Binding Protein 

VFt0OJiJCAVXJtKI04HLVZl^SLIAZLCU.TTSS0^^ 
GXtMCVSS^t^IZAa^AXSiaiSrTLTVSDATT^ 
VSRPItCLAlFUCVMrKAVTtrFOIirLIKTVVAXRVSIOl^ 
ICNLSSRVKTOASIAIAVNSUCIKYIOAPITFILTLCVCl^IS^^ 

prvreAiuaKMiJuquoKSODsrsgvLYt)^^ 

CEK£AAYCLiPRI > UJfrrASIJTArWVICIYlCFAI PPCEUVFCCZXYZ^mXOfQO 
D«TSZlC(aC>AAOtPirC\U<HPOLHSOJC£R£ZCriC^ 
SFTXJOOSEAXiCZVCPTBSacrrLVKLLPRLYEVS^ 
UNPFIJTZnVWraa.7XXncZME£EAVlXALI^^ 

OOQRIAIAJtAUJCWIUU^TSALDAISENYIICNI TGFr. KGQCTOI 1IAHKLTTHW 

VDMVLYtElCOXZAECT^IXLLQTCPEFUO^^^ 

T 

CPIX.0414 460203 459172 

aecA-AcCoA Carboxylase /Transferase Alpha 
LCLRIVCIIWILFIFtCmiLKai^OCOVVEYEX^ 

RLOXLXEKZ YSDLTPWERVO ICRHPSRPRTVWYIEGHCEErVELC CDR TFT U}D PAVVOCr 
VnCZOCQRFSUCOslOCKZn , A5RLKRNrCMI£PB^lUCAL^ 

AYPCLTAEtSOOGWAIAJatLTELSRlATPVI ZWXCCCCSCCALGMAVCZSSVAIfLIHSYY 

SVI5pnX^IUflCDPKXNSEAA£KL»OCDJUC0FCriI7TVIKEPI0G^^ 

VRtFIIOEWIJUJCDIAIEIIXtKRYEKFRSrCLYrrTSESCPEA 

CPn.0415 461522 460221 

CT266 hypothetical protein 

SOTCrLPCLTLZFVI X IVWCNAFLXKirVIMCWSRLOHCIEVSONSKTTSOVTOFrYAC 
QOKTLROSVLICZ FRYH PLLX X HO I ARAVYLLHAL£ECEOLCLSFLNVOOYPSCAVCl^SC 
OCFPWKCLPYPAIWAEFCLlXl^IAErYEESOAWSKMSHFOOAIJ'DKOGWrPSUiSOE 
NSRliKEXTTI^OSfWLCWIHPr^SLXDPALCFWMORT^^SATVAA^ 
SSCOVCVIAYCPCSCOISIX^FCCCCrAXEFVCOKSHOTTE ISFLTSTCXPMPRNTCFS 
YXJtOSYVHLPXRCKZTZSDXOYRVHAALAeATSAHTFS Z FCKCKNCOW0GPRXJ15CSLD 
SYXCPCKO t MX LCCNOAXNX VSASPYME Z FALOCKCKFWNADFLXN Z PYKUGVKLZ FEX 
KVTSexCRFFTlOW 

CPfU04i6 461871 461557 

hinO/ihf A-Znteoration Host Factor Alpha 
EAiJiNMATMTXXKLISTISQDHXlHPriHVRTVIONFLDW 
VERKPKWCRNPKHAAVP XH Z PARRAVKFTPCXRMKRLI ETPNKHS 

CPn.0417 463047 46::44 

aniA-N-Acetylniramoyl Alanine Anidasd 

REXOOCLTKYUJTKOLRSHlSRLFVRYSLFMSKOI^FFALCVU^PrrAOTPW 

R^EVtFIOTCHCCXOOCTASKEIJfYCCICSLTLSl^T^r/URM^X 

LfiXRV A L5 NROQCOVF IS IHCfmS^NAAAFCTCVYFYN^r/QSPTRNRMSEVLCKNZLAA 

HEXNCILKitRGLKTANFWIRDrrSMPAVLVETCFLSNSPERAALODARYRm 

VHNFLT;CP5F0XPKON I AK IRKPOZOAN 

OPtijHlH 4n44»l 4»i2'»51 

mirt rt At.cryiiuiii.iMiyUt'iny Iglucaoyi 0At> ti*r:<r 

Mut j( u j j tt^Ax t yi :»cvn rt evrnltr dsrcv.tvcd t f r aj i kt/jp va vajf avhalanr 

A ( A t Az;:a.YMI'FI .'WO t tTrNI££LEAEL3AKYYEYPnSKLHT t'r/rrn»r:rTTVTC LI 

kaiuj k; y'jk 1 1 a a * t r n i r U":u .v r r dt fttptpallok'/u\tm , /i« jwpnATmEvrj; i 
' :i A'> aivAvniM rrAvi .tn i ti ir j/nrrrFFr/vAAKAFLFaLvi'Ka »wv trrrosPYA 
: X« - 1 fj :ajcai v itv% : 1 1 -**-v\uypati> roL; : ; : ^*TKYTLr/GOOK I a< f r'*YWYNL 

I AA r 1.TVI <A::i<K* 'I M .hU 4 J i*'JC If Ili.^PI'l* !H LDPVLMOP';* VY I tr/ Al nTHAl J*M .TCL 

i iKu.ctr > atui vw. ; t a ho *:;Ki<FLMA\»WBRYCFA , /'/r;ONi*i?:;KP(+j, t vnr hot 

l^:aCNYK t F. tU<l\W ITYAk : I A.' 11* I VI. I Af JFC3 1 EAYO I FKI R/T/AFIX jFVTA *KVLA 

:r/v 
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CPnjMl'.' 44M1T 444B'>« 

pop J* trjnoalycolae*/ transpeptidase 

QLFFNTN IWH t P0MCV3VTYPMSYWCRSTC. tVMVTALYALtVLRYYK lOZCEGDHWXAE 
AUX}HEFrVRDPFT?RCTTFANTTVWCaD^ * 

ILOF IECCTY5DIJUCIJ)CKSRYCia»YPUIJVSVH0RLSLHHW 
ITDVORSYPFTyttXCOVUrrLREI lu3ECTCXAFPTQGMEAYFNH XLICDVGERKXXRSP L 

: ,\ . v ; ^r*Tjr .- " " r u : : 3 rrr /::; *r.vrsr» i : : Mr r : :.";.*• : \i;.vlFE.v:Li'~ ; 
k" iH-1'EF.rt! '"tt: "i * ;i' r :: : ; : .pl r :i- : **. : .*:myha [ . ■ . » .V"/A0l-'*rR I : f 
VAWCCKnJU£raKTGIEIJ>S£ASGLVP*^ LAT 
G I QHVQ AY A I LANGGY AVR PTL VKX XVSASG EEY KLPTKOCTRLFS E£I TRZWRAMRFT 
TLFGGSGnUkSFKHHSSAfiKTCrrEXMXHGKYlX^ 

YEEWNRSPKQGCTR 

CPn_0420 467130 466B24 

CT271 hypothetical protein 
KSFPMNKSRFUUfCCLCFCCSXJYririKXC^ 
LlDXIERPDHLKeiAALPEYQYlXYPSCESXSLLSYILP 

CPn.0421 466007 467108 

yabC-PBP2B Family aethyltrans Caress 
SZLMSEJtAHZFVLVEErLALJAORPPCTTRIKrr^^ 
OAlAZAnOUCTFQOPAOTSHASFEIttJWPTPtU.^^ 
3EKEELn«MDQTQSLSASDVX^LKEEElX»irRm 

SI0OVK£ALLCVFPKYRFMRXX HPLTLIFOAXRVYVNOEDROLItSlXTSAISWtAPOCRL 
VI ZSPCSSEORPVIOWFFKEAEASSLGlO/miCVtOPnOCVI^^ 

CPn_0422 468233 468784 

CT273 hypothetical protein 
GIJWVEITWSTSIYEOKAS^rVSOFJUOri^^ 
LLQ^KSHH*C FSPP* « rrXOPJ'STP ^ 

FVQG 

CPnJ>423 468788 469216 

CT274 hypothetical protein 

LYtOXCSGOALAVLIX3AIJQ40QOKXJ>TUK)CTKAX*FCLQRZ EEA.TA1ATVLSSCPIPAI 
ANOA£AU^SYSKATXnOAX.VR 

CPn_0424 469528 470961 

dnaA-Replication Initiation Factor 

SRCNEIFSPSU*CTATCIW!SFIKKESa<LTCOTCTWro 

IQVLEETOEJURI^VFNTPVt*miimKJU)l£S 

ASQKESNECISEVFEETOTFIXJOail^YPJn^ 

FIHOXJVGLQKntlJJiAVOTTVRXHHIO^ 

U3IXLVDOIOrLONRQNFE£Er OfrF LTL DILSKQIVITSDXPPSQLKLSERI IARKEWC 
LVAHVCIP0LCT3IVXIU3KKAEQlS3XXPNEHAfTZ^ 
FCXttTETTVIurniCELFRSFTlCOltlSNW 
IAKnJUCTtZ7^SLVAIGAAPQICXHSTVLYACmE^ 

CPn_0425 47096S 471364 

CT276 hypothetical proteins 
FraCTMPPJrTGKGPFEDygrLYEEETSSPSCT 

KKZTEE5SXJ(^STPnESLLPZ^PCTTLSU«V 1 F M^LAFIRXXRXOGTFEGXLVSX 
GXXXIGPKGWKADXOLOEAI lEGWCQNITVSGKVILRaCAI XHCOIOANTLCVDECVR 
ZLCTIAIACITDHSERERDL 

CPn_0426 472111 471536 

CT277 timilarity 

MVLFSIXJ?KirYtXDQAPCAYFCSNCL£XI^^ 

SOLQArSLYX^OTAI^VYAJUCBaOlPAI^FFSKSIAFELASIJSTPSC IAYITSTISR 
KIWEVAKLEXLLRI PLWPWL^KKROrEXUTCBCXCFL^YPLSOKiWOTrvCCSASPL 

VSISLFISQNDQ 

CPn_0427 472153 473715 

nqri-NADH {Ubio^iinone) Dehydrogenase 

AVCYVFERVEASTfl5ITOI^FINSU«LC00OICYORr^ 

PFIWJSVDVKWOCaWAIJVAWArWNSCWSmSSCN 

FVYKEIKIVPILWECLKIFIPLLTISYVVCCTrCVlJAVVRCHKIAECU,Vrc 

P PTI PYHMAALS £AFC TWSKELFQSTGHN ILMPALSGIUkfT»FFTFPAXKSCOVWVCSNP 

GVZlQSllUaiNSSTCKVLIDCFSOSTCtJQTl* 

VXHSOFSUtfTETHPGWLDNLTLTOLOTT^ 

KFSAGNLFWCNI IGSI^CTSTFACLlCAIFXIVTCIA5WRTMAAFCtCAFLTCWLFKFIS 
'/L rVCCJNSAWAPARFF I PAYROLFLCCLAFCLVFMATOPVSS PTMXLCKW IYCFFI GFMT 
tVZRLINPAYPBGVMI^ItXCNVFAPLXDYFAVRKYRKAGV 

CPn_0428 473719 474681 

*nqr3-NADH (Ubiquinone) Oxidoreductase. Gamma" 

NMSHC3SKHTVR I NQTWY IVSFILCL^LFAGVIiSTIYYVlJPIOEOAATFDRHKOMLLA 

AH I LJFKGRFQIQE3CKEWVPATFT>KKTOU^AT^KVSgVSYPELELYAERFVRPLLTDA 

OCKVFSFEEKNLNP I EFFEXYOES PPCCOSPLPFYVILEOTSRTEJMSCAIWAJQXJTVO 

ALIF^ IKF ClWSPrHC^fLCVKWIgDTVLCTAWYOgOT 

FLWSSCnTNFATTDLCIXVVXCSVirrTXCOS PKALSAZ D3 ZSCATLTCNGVTEAYVQSL 
ACYRQLLZNFSNLTHEKICTCE 

::Pn_04J'; 4746<>h 47511 4 

ikjM'NADH (iihuiuiftone) Rtwluctasw 4 

K ENPPMTSKKir/ltTYFFDriXnNNOILtA I IC ICZA LAVTTTVUTA ITMtl IAV.1 IVmCf, 
W*FV:;i.UU<FTPnsvnMlTOLin;nXVIVIDOFLKAFFro^ 

' :nr;K.';LARi rvTr« i rAFLzr.FAscu jyitwvllv icv 1 p. eutohtlmi ;fr i t pofvy k: et 
ii nr/;voriL: ;i *vi ArrjvFFLU; im i wlvn [r&^kkpkr 

•lii.'jilll 47* 12 J 4Vt f l)'t| 

t*\t'+ riAMi (iniitptiiiiinel Ri^lu.:cjr:»* *> 

hxwi /;avi wi «vh ; i lluaaf ion i llanfuxc^i ,Ac:rrR\rrrA*k :u*«; valvltvt 

**■*'• l*rt-*Vl lAKtf* rTKALTW I::rQ^-VNLb*FLEL 1 1 F IWIAAPTQ t LELLLEKWRNLY 

\:;u u hi .1 *w avw;a iu > wlpc itrsypf rrw i f:x» w xyzuui a t v i latzkeklay 
rr* i irni »* ; t :>* iTT»:r . i ahafhslta: t d i ;;ki-.';ak i yuAHiTTEWRtrrmpLKC.: j 



.:KHOPSt.-*KAPTVRR.;L 

IC IMTTLrKYVPRSROW PDTLTFLKRY S JVLi-HS ENS i-SY R I F AXVXA XL LT3 LAVAFA\T 
LFSCECoCLRLCALY rCIALAICVL-TT V-.TC I AJK I ATAJK K PPS I SR I C IV 



CPn_04 52 



47<817 47K5I4 



CPH.M3J 477J27 476929 

oesH-Glycine Cleaveae Sytcera H Protein 

RTF* ILYCTXYRTTSRJCVKWYSDYHVW: LPVM ERWRLCLTEJOCWiLCAIUIVOl.PSVC 

5LCXtr^VLVZIXSSKSAZEVUPVSCEV:0ZNULVWP0KZNEAP|X3^^ 

0UDPSNLSUCEE 

CPn.0434 479471 477276 

CT283 hypothetical protein 

RW^IYCQCLFCWXRDPAWTFSliSrrUFYC^ 

CVSCIG^CWCRFSRXCSAECTSRP^VrrTASCWlYVEKDFMAMKKrr 

HHFZNICtJiTOTFtTTRVCEKLTUCVYHPCEXIFSXEJCAYOPY 

Aj^LLCILK\^IE2«ISKESFLAIUXLFttXmPHY 

YTKISlO0OrrSLCTEEFWSYFQFZXZSCSCmOfyR& 
llfr tf OGItD SIQVPTPXPICEYSrKrKOZXKAJ-EVYI^ 

IJUCEPPXVCWSIimCRVWDIAATVPHWLHWO P O V TICQSY K 

ouucvixeJEvutYSc^rrYYTirvrasFE 

MEIUJSAX^RTItYPGSCASZiiCRRLtflCVNnQOnUjCRKUCS FSWSLORSUCTTSKjORZL 

POEFDRIFS>OOAa7YSSVrMSPNECPCYWLSHLLYimPASVDKiJlJ^^ 

YMERFZEOGWH 

CPiU>435 480908 479475 

Phoeprtolipaee 0 supertaaily luncleeveble leader pepcide) 

mrrmr vy. m v . 1 1 1 iFtt^t"" -g^—gw ^FT^ryTTTTiPTPT BftrroiffTj^M JTfT" 1 

PNIUrPYVFT0CPPSTSXIAPNVIO4fZXt«5I IKIOf CIUXTrKFtmCTKJDrVPDCVD 
NPIOJVSCMtRPLAIia3WI10^Rn > AfXXOLR£3STlCXOFA^ 
ACPPLTLEOA EI T V F PUf UIWUJL VLVIgSKIRIVraPHDKOlWPVrO^ 
?fnAHT1YTTPrTYT l I, M * r "T 1trgwwr:WT gT ct»»T T 7p Y ^^7>»T w ** r r 

YOTrYPLJ*XWFCtttJCPY!*V5rYEFArVETO^ 

OYZSXVVXESPeVAAMNKVFNKDICLSI PVSHGDI FSWYTKSVHKTLCHLOLTYWA 

CPn.0436 461633 480902 

lplA*Lipoace Protein Ligase-Li)ce Protein 

FYVCTMCVRrVPSGKSSAASWiAroRDT T f^T flTTT EL IUaYlWIMCSLTYOHPMim 
FZXWADLGUlAAWPTOeXVVPHXroYArSVX^ATKP^^ 
LEKmiOCailAPCDENSSSP^SGNFQtiUCraKyi^ 
LFLg^SmORTIXPEVIXEIIgJIQIHAfFPLCLXAADEVI^W 



CPIU0437 411810 484350 

clpC-ClpC protease 

AJtOBVE}O.ICVCPaOVYGDPALTC1tVIQCSFCSANEEA5LLE»^^ 
DKNm^AXJ»YGYtCTD<VRCSiai)PVXCRSSEyERLI 

OKZVVHPPSVDETIEIIJtCSJOnCYEEKKNVFITEEAI^ 
LOP«aUIVmv W lH LAJP 'I CJ^ MKLEAgIEOTTO^ATO 

SMKOEWENHXEEK0VPVDEEAVAOVVSLQTCI PSARLTEAESFTT f jtt nTTLPJUWIOON 

DA\rrSICRAlRASRTCZ3CDPNRPTC^FLFU2PTC^IC£LLAOO 

SETMEXFMTiaiNSSPPCYVGHeSCKPtLTEOVnuaiPYC^ 

ECCJU.TOSFGRXVDrRHAJZIKT^NLGADLIRJCSCEIGFCUCS}0^ 

KLXPEFINB£^E5VZFRPUX£SUEZIK1XIKKLDSR1J04Y0K^ 

HSPOCARPLRRVIE0YLEDPLAFT J.I Jf FSTRQEAIWLJlATtVE»WVAFtRgXgZDCML 

PSPHLtS 

CPn.0439 485455 484334 

ycbF-PP-loop super family ATPase 
NLTLPIO>PX>VREII«XTVIVAMSCC\mS5WAYlJ1CKFTNYKV^ 
SSTTtZ^CSVERN^t^LOIPYYTVSFAJCEYltERVFARFUCEYStCY^ 

CEMOar/RAZAAOAAI^AEKKOSTCZCFZCKAPFKEFlXXFLPtllCTOfV 
OQHGCAJTOTZG0RRGLOLCCSEXPCYVVCOT 

WrTPPK-^HCSAKVRYRSPDEACTIOYSSCOEVXVRFSOPVKAVrTPOOTIATYOG^^ 
GSCVZC/PHIPSEC 

CPn_04i> 4B5523 486077 

No rofc'jst honoloq present tn Cenebank/EHfiL as o( 11/7/98 
I ZSSMIP//LFVSSTLNGWPSSLPEESAOI^rrNKEIVALCEICNVFLTHSI PKHZAAIT 
I LVI VAXACTA t rCLCCYSOS Z LLZAVC IVLT I LTLLCLQALVCFIKFIROLPOOUfTTV 
OF IREK IRPE3SL0LVTKAORinT0DTIJCLYEELCOL^KEFKLOSTLYQKRFEIJHIQVE 
' KTNQN 

CPn_044') aijlHOdi 481.740 

m vovizr. tuwoloi pt.^tnr in O-u^inh/KMCL, ,ir. nt ll.*7/?H 
LATlRai5MAT::VAPSrVPF.::.':PLSltATEVLHLP»JAY ITOPHPIPAAPWETFRSKLSTKH 
Tl J ! FALT'JLLTU OT I . lAGYAf ;*/TT!NW I uyiUiUH IVI.TI. I LALLLAt PLKNX0TT7nCL 

iBRUtbu I "5 ic:a ifvoryi:lkf.tp i y- a/uu •elttonoektp i lne ieakket: ionlcl 
y tTBt^r;FLAOKorKRKi"*W':F;4P:: i kiiu'iknpvi r.Fra: 

'Nii.lMV* 4»t».i«7'i 4M/H1H 

'TIMI7 f.yptrtMtr i.Ml |>tor»(in 

I'HH r lc:34Fkllr 1 1 aakai aivi.Trr i v i vw jAi .t; 1 1 jkjcat j'Hr-pi'H i*K:;Acwri JCVNO 
AFUfKU-ff/riuYnuYDr n , FtrrLi-tTHi:2.*i.i>*;rrrr/ wyu/tywKa.'U't^nropwou; 
watfolttynyvluiu :AYTLr;urriM>«: 1 1 1 &\t.vt»m i r ek wni yoi wusratYOAt 

KKI ."A r fV/ ( KCTIUtOFJCAWM/A W.*:r KA71 Arl.TI ; I YI-/HF:; I rvYH:T: W «UXAY 
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rltpfr mJtnwLtzx*: t r vrtv ;h r i gawk. 



OIFYArSPNLCVTSTILSMt 



j;R5YNADUVJCT?F 



-:m_0442 



4H7TM i*HW 



CPn.044 fl ~; it- *N *7* M t- ■ 

IVYPyWVVPWtW^ wvtj i ~ Mi n i** AW? IPS PSOFrwEM! FAPKUwntKrY 



SSnip^^ 

IWPI I VLA lAAG LIAGAFVAS SCTMLVT ANPMFVHC L FTVC k^TTlSLMCLTLDYFRJ^ 
UJUtEJOPrOETAEP ILVTPSADQAKK I* v EK*^ t ^^^*' a *^2S^S225£S?« I ee» 

A<xsFrYssiOTCHRRsrcsi£Rnmcre^ 

CSGWSAn'PXHPASSRSPNFSTCTVUffCPWKOOTSira 
QQQQNOOEEQWQSICKIGGKSNOSL^ 

CPn_0444 490266 494507 

p»p_6- Polymorphic Outer Membrane Protein _.™™c. 

KAFPORHMKYSUWXTSSAI.\ff3WPUiAWmJLSSSDW 
^LTSDVSITWSAITPAflKSC^^ 

• TAU> FSGF SSLL I DSAP ATCTSCCKCAICVTVTCCCTrATrTONASVTMKOT^^O*^ 
SAYS IOLAKTTrAAIXDSNTSTK?OGALCSTA> f n V UQ A5C T V T FSgff ATDieGOTngK 

OGAICCYIATATtJICIl^ISOWEKSrrS^^ 

S^XYTErHJFSUCCWTW^ 

SlSsAWEXOTAJU^OSCSVSDKTO^ 

rCSLTTTWAGTSO^YTETEDITLTCS^^ 

LLTSGNWVTCPSNSSAWEO^XL^ 

AmTKCAiJitanTLTm^ArrACGAiYrEra 

KGVTSITKNXALVFSGNSAT^^^^ 
INOTAJCRSGGCIYAPlCCWSGSESMri^ 
KGGmiADSGEX£I^AIDGDI?TS3^TECT^TTO^ 
YDPnmWASGGTIEttVINPVVXAIVFPPQPIQ^ 

caci^soi^iPAMrrTiu^NiJWJCWVAajai^^ 

KANLKLPFLOLSSTSCTVKtXOFNPX PSSMAATOYCYOGSfcTrWICVGAaJKVrXl.VA£W 
AIOTFKPEXJUTWmiSLWttYVNXHSIQOEIATAK^ 

KEHAGFRLI SRCYXVCGSMTTPCEYTrAVArSQLTCKiSICCnf VVSDX ICSQVYACSSX^AQSS 

YVIPUiSSIJlRHVLSKVLPEIJGCTPLVUiCOVSYGRNHMMfl^^ 

FAVEVGGSLPVDUfYRYLTSYStnfVKI^ 

LTFKHESAKPPSALLLTLfcTYAVOAVRDH PHC LTSLTNGTSWSTPATNLS RQATTAEASCH 
UOIi<CL0CFASGSCELR55SRSYKAICCntYSr 

CPlV-0445 494739 497579 

pap 7- Polymorphic Outer Membrane Protein 

p|Tp^uerrrt.gMitggVja>LFFggTPt J FgSLSlVAAE\ffLDSSNMSYtXSSMjl'l t IWfMI 
0AAAJ3TTYSLLSDVS FOHAGALG X PlASGCFLEACCtJLTPOGNQHAIJtFAFXNAGSSAGT 
VASTSAAIONIXrrffiFSRI^IISCPSUiSPTCQCAI^^ 
NCXWWTKNFLtJCTSOFASFSIlWQAJTGKOG 

QGALYSTGMCS rTDNFOVI rDGNSAWEAAQAQGSAI CCTTTDyTVTLTQgCNLSTTHNTA 
LnQGAISGtlCySISAffiF^FOSHISGSSA^ 

NQVTNSST5TRNAINI IDTAKVTS XRAATGOS IYFYDP ITHPCTAASTOIUfl2IXAX)ANS 
EXCYOCAIVFSGEICLSPTEICAIAANVTSTIRO PAVLAWKLVUaXTWTFKDLTOSPGS 
RXUQX3(2TTI>SAXEANLSUGIJWNLSSW 

FYENKNUCSASTY PLLELTTAGANCT ITLGALSTLTLQ EP ETHYGYQGNWQLSWANATSS 
KICSINWTRTCYIPSPERKSNLPLNSU'Cm'IDIRSIhPLIgrXSSCEPFEREIA^GIA 

NFFYROSKPTRHGFRHISOfflrAtfllTATO 

ASLYFHHTECLFOIANFLWCKATRAPWVLSEISOX X PLSFDAKFSYLHTDNHMKTYYTON 
SIIKCSWRNEAFCADLGASLPFVISVPYI^^ 
KSELINVElPIGVTFERDSKSEXCTYDLTIlfYILnAYRJlNPKCO^ 
LAHOGFSVKAANKFQVNPHMEX FCOFAfEVRSSSRNYNTNlCSKFCF 

CPn.0446 497602 500415 

pmp_8- Polymorphic Outer Membrane Protein 

LIEPKHLSMXIPUiKLJ.ISSTLVTPILIJIATYGADASI^PTDSFtXlACCSTFTPKSTAD 
ANGTNYVLSGNVY XNDAGKGTALTCCCFTETTCDLTFTCKSYSFSFWrVllAC SNACAAAS 
TTADJCALTFTCFSNLSFIAAPGTTVASGKSTI^SAGAI/^ 

5AITTKTLSISGNTSS ^TFTSNSAKKLCGAIYSSAAASISC^^TWLVFM^l^OCCETCCClAL 

CFEASSSITWSSlJFSC^ATDA-WICCCAIYCEXTCETPTLTrSC^LTFAI^ 

OCAICAHSUH^AAGPTtPSNNRCOTAACXr^ 

STSAFTSTRNAI YLCSSAK XTNLRAAOGOS IYFYDP TASHTrgASDVLTINOPDSKSPLD 
YSGTIVFSCEKLSADEAKAADNFTS I UCOPLAI^SCTLAl^GNVELCWCFTCTBDSTLL 
MQPCTKUCACnXAISLTTLWDLSALECNKSVSICTACAKICrrrLTSPLVFODSSCNFYE 
SHT INQAFTOPLWFTAATAASO I Y XOALLTS PVOTPEPHYCYOCKWEAYWA PTSTAJCSG 
TKIW/rTT^PWEPJlA5VVPOSLWASFT0I RTLOO IMT3QANSIY0QRCLWASCTANFF 
H KDKSCTN0AFRHK3YCY I VCCS AEDF S EN I F SVAFCQLFGKDKDLF I VENTSKNY LAS L 
YLOHRAFLfiCLPMPSFGS ITDKUCOI PLI LNA0L3Y3YTKP1DMDTRYTSYPEA0CSWTNN 
:KyU£LOXlALYLi>KEAPFFCCYFPFlJCFQAWSRQOW 1 
PVQIRLEKISEDEKNNFEISWVICOWRKNPRSRTSLMVOTAOT 
AaWLTL^PHVELSCEAAYELRCSAHIYNVDCCLRYaF 

nPnJfW 5005-lt 503351 

fwp_.'-f-) lymorpruc outer Membrane Protein 

P/KPP I AkTMKSSUMFU .135 LALPUILNF3 AFAAVVE I." ILCPTNSFSOPCTYTP PAOT 

TTiAcrrr nuircrrr.-: t thai u ptaltasc fk ettgmls fqchcyofllon i dacanCTft 

MTAMIKLL^FUCFI7YL:*Lt OTTHATTCTGA IICCTGACS IO^JYSCYFCONFSNDNCCALO 
f -;j;iJIPKLTKAKNKATOK&.lALY.TIT3t;iT t MI rTl/CA." FJtEKTAANHGGAI YTEAS 

::f rair a i :^ f r tttt ivtat: tAT»*TA t YC2:>Tf. apk r-VLTLSCw »elnf ton* a it.-woa i 
rruiL/u:.* % ;m JTtNNf.A t ctaaputca t a t al^ljusauxid itfeojtwkcas 
:::s/rm*n:: wv mthak t vouttziorwr t vpy™* itts :taau;dai jjufjpolacnpa 
/*/ rr:vFT» ;FJ«tj:KAF-\A>yvEWLK;Tr cgoPLTi J«y ^uiuwrarri.vAKSFSorirflsn.L 
MiiAr;rTiJ-:r*\n; it inki-vi hvt^lkctkkati jcat/a w/n .■■« L:ii;LVDPnnNVYEO 

V;^IIV/F^L'ri.TA11H'AJIIIIltDLAAJX»LKKNPUn^;VVi^ 

ti .Twrrr iynpni -kp Rt :n .vahti m ::;FVDvn: ; t vjWKrrmz vCTWi rwccr; t i"NF f 
i ikixtk t f iKt ;kh ii i : :vvv wrm Jtaw L i t aakto Lf *;kuk wt f i mknrasayaas l 
ik juhlati^::ii:i j*yi .i ^:Kj:Eorvu*nAy t:n*i Y::KrfmKTYNTOATKf;c^*orNDfiC 
ai,ki a;:: a j 1 rrAuaiw :i j'iiay ftpc kvka: :y (iiow.tk wtittlvk: :FDfv;ta. invsvp 
I'll tfkh K::i'MKi*A:;Yi*Arv i v vai vykkhi >i* TT/.t.L. f rirm vk-to ttni**:W)al; ioha 



O;FAaCBC0HRHT£2£CWKK I AFVKEiAKt i Vk* 

CPIL0449 M7231 505330 

miO-PMP 10 tFrame-anttt with 0451) 

Kron^ISFSNNXVOCTTACTra^ 

rw effm gmrAJT/JLCMLTSTLICQ PVTLTAGNLVLKRCVTU; I M>f ^tft AGSSV1MPAC 

faffOnrfifVtH-SAi^ATTTTVPAVPTVAT PTKYCY0CT>OnVVDt7rASTPKnCTATLA 
S^PNPO^VWSt^FWXOA^ 

jw^b gy pinfCfg/AIQCUU^gTCSSKLISFAFCOIJCSDKDFti^ 
HrrttSCriCCXXnCLPC»WSHKPLVLECOtAY 
KMLQASSHSYPEYUCrtyirAPYIlCl^TriRCDSFSEKCTEI^ 
KrEKTSDCNCFSY^Tl^YVTOLlIUroPICCTTALVISCUSWITYA^ 
YAFSPMFEWCOr/TEVRCSSRXYWDLOCICFOF 

CPTU0450 508121 507180 

asp 10-Polymorphic Outer Membrane Protein 

SCFMXSOrSV^VISSTlJCFTSCSTWAATAENICT 
TXTCTITLO^LTOSAALTICgXTSiyiTESIJ 

^mresisrEQacssATcaatrxAiCATcm 

CriTQrrSLVFSOfSVTATACMQCMI^COAiTVTXSW 

usaaoavspFLTX 

CPn.0451 508156 511058 

oamiO-PMP 10 (Frame-shift with 0451) 
rremiKiLDScmnttrriJcmoAN« 

et jLurrrr-C PnpSPlCMIDSflfTTTICTSATTYSLTC 

LTFTCNGH5 tJITCTlWiC^HACAMSl!TWIOfLTFSCFSllSTVSS?U 1 1 V I lGOQTt5S 

AGGVKLENXIUaAnaCI^STAaSAIIQSASF^ 

RXAlWICTVIun^A£TSaGAIDI>EOTSXXAaaCll.YFE^ 

Ex^iSKNm^FASWAxrsocMKAnajasTO 

n,cTf S^ r i t - MI ' l ' , vlum - 1 1 ii^iut>iu» wiMiGSMcauT8XMwwrif ii ufiwi 

C^SSOVlJCXMeSAfAXMPirCGTXX^CCTLTAD 

KBVllXSTSFSOEACSX^AffiSCXTLSTTACS IT ITNLOINVDSLCStJOOPVSLTARSASlJ 
KVIVSCIUIUDIirAaTXSWSKroi^LLXXTA^^ 

y^rYJ09a4Wtmi7rATimc£ATATWrmr\^ PEJWSALVCWrtJWVTTOXMLOOtAr 

EIGATCMiMKQOFWSSMI74FlHICICDE»R^ 

LFAROCTCFIAK^inYCXrrUTKHSHTWP 

>A3TOrSHS»0WrrWrTSU'ESHlSWSKECIAOC ICUL PFVLS>aWLFrrTTPOWCVX 

MVTVS0MSFFESSSXX»SrSXCRXXl^IPVGAXFVrXDIGOSYTYItt^ 

POSTAWMSPDSWiaRgag^ArLLRCSiayrVY^ 

VCSTXLRF 

CPTU04S2 511304 512860 

omp 12 -Polymorphic Outer Membrane Protein (truncated) 

FNErariLRNTLTCSAiJ'LAU'AAA^ 

f ^ntM T jannoagEnAGVTIKRSEaiLFFMa^ 

LSNFSYIJ^»PIXPOaOWYSLSSVHIE»SEEmCCNYSSW«^^ 

SRr5VKLSCNRYLVTRX1^0tnQaAISTKNLTLTT1ffi 

APOSSISISVKSCCLirKCOTAS^OOTIHNSIHI^CAOnOJIJUVStS^ 

SESHKITDLVIKA i- U<RLTyEJCTISF5CLCIJDHEVCAENLT5TILOPVTlAggHJI^ 

CVTLOUiSFKOEASST^TMSICTTlXCSCOARVONl^ILI 

SLOOJCVAFEAYWSVYDF PQFK EATT I Pt( FT \fi PSFDS UXCrTTLXRTOVrTOOAVR 
CFVSUSWEEYPPSLO^RRITPTKinVFLTWPErrSTP 

CPn_0453 513156 S16152 

pmp 13 'Polymorphic Outer Membrane Protein 

|3CVLLYLFFYSLSL:=RI IWKLYVOMXTSIRKFLISTrLAPCFASTArTVEVIMPSENF 
DGSSCKIFPYTTI^PRCTI^IFSCOLYIAKLONAISRTSSSCFSNRAflAWIUSICOCVF 
SFLN IRSSAICAAISSVITCNPELCPLSFSCFSOMI FONCESLTSarSASNVIPHAaArY 
ATTPKLFTN»raiLFOYNRSAGFCyUkIRCTSITIEin7^ 
II«.INNSAJVIFST>»TCIYOCAIYLTOOS>fLTSCWLSCVl^^ 
FSNNSOLTFO>>WA5PO^IJAPTPPPTTPAVTPIXCYGGAIFCTPPATPPPTOVSLTIS 
CENSVTFLENIASECCCAL^KKlSIDSNKSTIFLC^AOCCGAIAIFESCEI^LiAWOC 
DILTNKNLS ITSGT7TRHS I HFCXOAKFATLSAT0CYTLYFYDP rTSDDLSAASAAA T VV 
WP«A3AI^YSCT:\TSCETLTATEJUCrPAflATffrLN0KLELXOT 
FTODEKSWlMDAJTTtATTtCAWTftCAI TXiNKLV INL DSLDCT KAAWMVQSTMSALT 
1 3CTLC LVKNSOCCC3NHCMFNK DLQOVP I LEXXATSWTV n I UFSLCTNCYOOSPYCYO 
CTWEFT IDTTTimTCNWKKTCYLPHPERLAPLI PNSLWANVIDIJUVSOASAADOEX3VP 
CKQl^ ITGITWFFHANHTCDARSYRHMOCCYL INTYTR ITPDAALSLCFOOLFTKSKOY L 
VCHr.HSIWFAm^ITXSLTOSSRFFSCCTSRVTTSRSNElCVin'S^TK 
NCWLOELfiC)«.Pir-SSRILKLKOI I PFVlCAEVAYATHCWIOOnTEORIFCHCHaNVA 
VPvnVHFCKNSHNRFDFYTl IVAYAPtVYRHNPtXrOrrLPINCATWTStCNMLTRSTLLV 
QA3SHT3VNDVLE t FCHCGCDIRRTSROYT LD IC3KLP.F 

CPn.04^4 Mom S1911** 

pmp.i4 •Pf>lyme:vhi': Oiicer Membrane Protein 

CMr-L^K^^FtrL^VLCSA^AFACTRLfjair/P P ITWEEI ULTr^FVC :XFLCA3F 
rSrS I Mr«'^ML^^«-:(^LTFTSCOAPTNr:rr/ALLSAAETLTFKNFJ:l INFTOIOSTnL 
« JCL f YOKO t VFOS I KTL I FTTNftVA Y 3 P AS'/TT^ AT PA I MVl H*Ajv\LQ I Ttt I .TVD 1 1 

: t fffcw anfvVA i ^ftawkf i nntatks FsmiFTSsmrw uveitis 

<T 1 1 FTAN-'If.'VM:" KJ'/Tri^YrrMLC' Xy»A I CI PT<7TTELKN>J0nKiTFI)Y Nl JTPNOAC 
A I YAETTN rVaH,VAUXO:WrAARNni *>A [ ^'AKVLN IOT3RGP f EF.1RNRA KK( >"»AIF 
: W Jif'AKOT.'n-l-T : *. JUfWP 1 A^V»WlJfrFF^ I RNA ITVEACX* I V.:L.^ X 
DC I TI LTUPTT^l -rfKKD t T ( NAW iA; 3f :: .WFT.'ir ILS.TTELLLPAHTTT t Ui rPVK t l&X 
(J( (TUIAWW/li-.FAT-y SUVt.Tt/ rrUILATPTnAf-AAVOFT I* TKLAPI>l*F;TFLrfl t) 

yvrjcrm* rrrt at^t* xuuau atd tf.Yrjr /^lo^k/a ( v i AVFiav^TVTini :ki-ct,e 
t ATpm »Yf w:mH\Twr:Rn .1. ( pamj i ;Ff /:p^P2AtrrLYAVWc «nLVR:rn* 1 1 j*e 
HY':ErT^;t«i;:KuarjAKrjJiu^A'iJ.tuH^W(TAKAU>WHrrPK»3irt:r/;K 
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yr; r,'v^\L.>«rrrOHm/:L;;r^LYCKTNAfjp czmxtLsmorp ivtoksia 
woLunr [SAEKLot>csKrrcr;DL0iCFSRcitcYiA^LPrcc3S0wrrprKKAPsn.Ti 

K[.AYKr0tYRVNH[NtVTVVS*CEST3I5CANlJlRHGI^V0I^ 
XKNCPTNIIRV.-mUCnT 

'WiJMS5 520 )« 519458 

no rnnust nnmuloo present tn Oneo*n*/EKBL as of U/7/9B 
: ■ - f •■; .. '.' "■" '-: i *s?*r. '." " " * f : *p.rt i "Lrrr.v."; L :cCT-*r£.~*.T«'-J'.".' 
■ -v . . ■ . .. : vr.-: ; :kj .i tta irn r m« „*i -v :r» : r*v: -pekeaaday 

A^TCWAFDDEHLPWV^XAYAEEIREKQEtfT^^ 

ALDAVIKQSVWR FRNPOI^AYEIl£AI^A5VTDALVSYV5MUHtPYTSS0GZVZtCSSIV 
RTSOEKTLIV»CAAfI3KlASOIEFLCPSDVl^ISCKDPLISDDn)EEt^PKVSSAAI3SKD 
KT 

'JPn_0456 521558 520327 

No rooust nomolog present in GenebanJc/EKBL es of 11/7/98 
IPCTFtSKKKFUfTKCUO^SWRKHTVO^^ 

MC^mi5HCFE»CVSHPCFPS0ZAJ>IUCVOCIASRSHI5RIEMQUC5Lm 
VHOQVOEJIPYADMASSEVUajIKCWVSEUa^ 

'AJCLSWXPQCABCEDTIWTOJWXYCia^ 

SRYRSfti^LPXNTEKZMCTELYXEXSRTHHQtJfTLGMGLCAO^ 

HCHSYLA^LTHEnJCIlXrSATVTlAWISIQCEIJlEVSL^AJCTSVICCCArifr 

CPnJ>457 523866 522120 

No robust homo log present In Genabenk/EKBL as of 11/7/98 

TO^VMASCLSAWFSIVraFYIUrt>rSU»rCM 

LVSRY^FVTKPTFVSWVSUJCITXVAGRIWIA^ 

?vHpra;iOPvcNa,n.ypEwonATtGiArsKiRNWrtWY 

N«KimaATCDCW»nrHGE»)0CnW 

SStJPOtGOXEKAlJnfSELgKEOLYSRLVVVCE^ 

£GHYCHSYI>Ol£NFCL0mXJUkFWItEMSTM 

M5RSQR]fVVVVVCDSWVCTIWKEEPSF0HPZHEI£CP^ 

KESSOSW^ZFCCDSVSOCOIItCUaASSSKUXXECT 

ERESNLMNRJO1CUACRCVRKOK0EAAXOQDESCIYVCN0LTAQQNFACS 

CPTV.04S6 526344 524236 

No robust homoleg present in Genebank/EKBL es of 11/7/98 

YPLCCTUari^XFTWMPIPYTSSWISTVTO^ 

IPZVSHXVMGHENLVSSCVACZ XTRSSrTSIWIVlCraCMjSWWISRVAZIlOT^^ 

I RT ISIVCSKIJCTPODPSOFVSIANEroRUCPEAX.VCLYLTCLN^ 

LHNSSLI^QCKDSlCEODACSPENPEIJrcGTY^ 

HVTOXTOFmPNFTSYSHTROSTOPT^ 

UtPEDAOIiCPCmrrAPI^OaHYCHSYLADIENE^ 

FNIARLI»LEUJSLn^VAaMn3WZVTIJU^ 

NIFSYKSWMZVKEFOFTOTRSEGXSFTLZl^^ 

DICASKSCZOFSEKOSPOAIKROWEAJIVEDEACEEAREW 

FSI^VWOAZWRfTOKGtXTWERWUGEEFra 

SFmDRHVgVfcPAEWABSGNDPTRPVW^^ 

CPn_0459 527062 526619 

Mo robust homo log present in Genebank/EMBL as of 11/7/98 

SOEKLNLFYVESOALCRE IKVSLEElf IQSMVGXL^SQRTTQCSFlTSVDrf PLEQALQERC 
SSDCCEDATATSTATGAT AS PTEKHEDE 

CPn.0460 527840 526992 

No robust honolog present in Genebenk/EKBL ai of U/7/98 

VIOHLLNFAlXrrPSISVOYOEaEIU^PCDHSPEIGJCMCRWna^FSTrc 

YKLNLGIONSl^GWLLDPYRVCAFLSSPYSCPS^^ 

TFR5VS INFt>SSFGORMSE7LSRVIJ{DEXEXHVAVVCKDAXIXEECI^PEALSIX^SL 

R£SCYSYLNIL5V5PECVSKVOEROZLIUim£GRSPrVKZTDLPl^SE^^ 

LVSSSLDAADACASGCKVLWENPNASV^ELENTYKOVTniRR 

CPn,0461 528647 527844 

no rooust honolog present in Genebank/EMBX. as of 11/7/98 

ISZVACFSZSSWFTWROHFVNAFDFTHPVCSRZTOTA^ 
WZ PRHTWHOMFTSDVSSAIICVECTRCHNCLAPl^^ 
?r/DrrPTEIV0LLroEELSTVT3EALQGVTlSRLTYAYl*5VE^ 

L t NFVRLANt^/QNHYPtrnrVKLYIAXNZAIAnrfDCCZ SEEEXG0LRALCU3PKX ESISLT5 
ACLPSVPEVATVOrMZTCYCKDOEVODP 

^Pn.0462 531124 529037 

mo rooust honolog present in Geneoenk/EMBL as of U/7/98 

LIFYUl^YtACVRFIIFOCWFDPMACYZSZHXSTVKQHFIRAFDFTRPL^RtTMPALC 

VIKAIPIUXWICVSWLVSTCSAftRFWPArrSOVASIVKIETrRCTWPLiAWVEDYlJlO 

UWPJ^ECOLGXIHCKVSRDYVCOP/rFOENI/^ 

TPLEVPCXTlWFWUiFEDOVNFVJUJSNaiG^ 

EKEX»UlSLCLIl^IKCVSAPAUXOKYI£SEKXPSCt^ 

NLS3EH I PAISVTYRPOOPFYSYYrrPGSOOCTAPOORI PWSEOEH LQTYTTLSKPRCDR 

YAVHt^EDFAar/FLJPLRVSAP[^GEYSCPSYLLDLJfSFCTJgriXSAriDPKNSGQC 

NPRPMSIN7GNSPU»RWSETt^RVU<DETD0IVAVVCNNP0LIKKSFPSKSLSLLE^ 

EXOTS?LNrVSV^EJm^nC£RRILSSDPSCRSFTVILTOLPECSSDIRNWlJ^DRI^ 

7~SAL0AADACASECK IlXYEDPEOE«AOOYASFYP^IOR>GQLOROCIPGCPLCVSAST 

RWI^DIVFNUWVIOOAMWKFKKRD^AVESQALG^ 

'JVA/^WW0FATU)EAVCAACnSAOOAPSB£NMTO0 

:i-uJM*>\ -.".1248II 5H191 

m» rot*i:jr. homo loo prusent in OneDank/EMDL utt ot 11/7/9A 

■ - - ^YEKTWI-I/TOJCRTPltVN 1 :7VJT t I PIDET3 tAFVDSMMKO* IVnOOAKELYTFLSR 

■:"«r/Qti:iJrfK::u:EELOFLPDEKMI^rLi;ED^ 

Ml :'JViEf ( UV:: I W 3K;F:*.rL00KnFL'WLRDETr: WWWFKfWLSLPATOVCKLVEE 

1 2r.:rrrr,vt i mj f::ci v iosr.roi .LKRKETiCTSORYPTV ioalyu nyrow;;LOLA3ER im 

V.'IKKFW.VnAYAAH .K I JJZ t DMTNWRPnTFSRliADFADAVOV^Ak.*FNSREFKLITOANOC 
I U-> ;EM'Lt H .;KTI<VFX*l-'l ^VFCOHVTVTR.UF I rttLGAA tKOAWniKllp-JLIOKECEALO 
I iHV.'t.! 1 ::! V:tVl.ir/VTN::HI3(T:WJr'FWKE 1 1 ACCnPLXBALPTCSt DEDVPCT^EDPS 
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CPn.04»i4 "> st; i*- 

No rooust nomoloa or wont m v>n».*p.m< txn> r »: ,» ■ ?i 

3LETRCRrTCrCtGU.^r&M3UflL0l^3ErTTAaiL#«;t FAPtW. ft '. ISA MftmpOL 
HR rAZVYICVU3SE35K t LBPU J SYM5CIT^3CWLRI»rHCIOW^V^Wi^e^ 
t RCCFT5EDAVPE5EPFXL: :/V>nT3RSCPt.PTKKR5;;j«ElC^XLPESrYPCStniM 
RPRHL5 

CPn_04S< 5^227- V2«7i 

, , ( „„. * . . . . . - ■,-.!.»■•• ■ ■ ••' ! t 

:;r * ■ ■ ; ' v • : w-- ■ .•.»■-. \. u ; • . ■ • , :k : .mr au~ * : 

a^u : ; j^i t* : i -kki j^vu. i vhvwi-. ■. ui-uichVEDiiiCRORi-^tr 

ROECKVHCCLPSAPFT 

CPn.04«fi 533718 536537 

dbsbl 15-Polvnwrphic Outer Heaorane Protein 

TSmPTCPCKlXPFTFVUWCCUU>LETYr^ 

0T ILOYXYYRSNQCALTCXKLLX SEN ICNVFFEXNVCPNSOCAIYAAOMC? I SKWQKYAF 
TTWLVSCWPTATACSHflf>At ■TAXNCSITWIJOCTFVDNX^rjDXSSAl.TTCtt^IKfJ* 
KCPXXIXONIULNSISLOSCXYSCKSlWXESMSSAZOXTSNSSCSCCSXr^^ 
KXLinSENSAFANWGSNFNPGQGGLTITFCTIiJ^^ 

C inWPVTFtJWATPOGAUJ^ACSOlCSFZLSABNCOI IFNNWrASKHALNTPYHN 

AIHSTPNWLOICARPCYRVUTDPIEHEl^SFPILrNTETWnXT^^ 

DEWFFSYLRJffSELiW^VEDGAClJUrnCTO 

NNEDPYBSI^UHSLOCVFaYZVIVAAOKXKSTOIX^^ 

TTZltnn51XCAKrnnai.YANHSPLCYTlPHPE)WCEFZ7^^ 

DEEXCHAASLOC ICLLVHOKDIOCFKCFRSWfTCYSATTEATSSOS PNTSLCFAOFFS KA 

KEHESQNSTSSHHYFSGMC I EKTLFXEWI RLSVS LA YKTTSEHTWTKYOr; r f irwyffSF 

KNKTUCAX^CmPOPHCESX^ZYPrZTAXAXRCaaJkAF^ 

LPVGZ]UkS>nO>MKRVPLVWLTEZSYRSTLY1^DPEXi<SKL^ 

IKVKNTTCWTttVTt^IItfSAOI^ 

CPn.0467 536528 539434 

m.l6*Polynorphlc Oucer Membrane Protein 

NXXLTZSOOWUCIOTLVSJCT^PItFLFYUaO^ACWFOfrPAW 

ETirrSrPlXDSi^TL7t7SPXTrrVGN)U<N5SOOXVLSNYXSZ 

NTU^NVEtJHArrSXm>IC7RXlAIAC0CACTrrW^ 

AIAOCTrriSONOCTFYTWOiSVKNWayU-STNCT 

RSOOTxsxamtPXYntwGMOUXQTSVTVAxnaisc^ 

GGAXrTT1IL5II)nDX7TZLFWQ(YCZIUX3CAXCTQFLTX 
LQQSTCXXFASXSIXAFONNEVFLITFGKYNAIKCTPNSNL^ 
HPTnn^IFW^«NKOCTIU^SAYlPEAStnfE>INFISSS10rrSL^^ 
IYKFT0XQCZLICLCHAA5ZATTAN5ETPSTSVC»O^ZN^ 

Pl^S^rrEMPTXTtSCPLTIXNXENRDPYDS IDLSEPLONIKLLSLSOVTAJIHIKT 

tKTHPESLNATritYCYOaiWSPYWVrriTTONASICT 

YQGDIATTPLWQSF}n?ffStiH5YNRTOTSDIE 

lOSTCYSLOASSgrSLHOXXSLCrAO y 1 1 Rl AU CSSNW^AHWWSSLYVgLPWfOW 

ATST^YCYCDHHLHSUiPSHOEOAECTCYSHTIJUUklCCSFPW 

AZl^lOTAFEEXCZMPIUarVSOKPFYNUTLPlAXOCKWOSKFHVPTetf^^ 

0NPCXGNmXA5GCS^XlA>04rVIWAI£YKVHNOT 

LOACSTUCF 

CPH.0468 539608 540432 

pncul 7* Polymorphic Outer Mewhrane Protein 

XYlOIAflOlKZrrZlKLTrHZievltlFfmpZCUZLST-Al^ 

SAmSPSFIUllVTPEPLVSSmSNlJiJCFCHCZTOOZTZTCNSZlWC^^ 

XlWUSO^ISENnGNLSFEJUISSHSSaaaYSVRECVXSimO^ 

FCGAXHAXXSYinMLCECCFlfNVTSia^RaCMrn 

FOOCirCRAViaEWiYONIOIKrmSGOCVVYrLP 

CPA.0469 540399 541460 

pmp_l 7 -Polymorphic outer Membrane Protein (Frame-shift with 
0469) 

C^RTRCCIFSAI^IISSK)CEIIEIS^«SASSINTASCKLYPCOOCI>CTSLVIEtOIPICC 

UFKWCTAAI^G£^KTTlSFIFO»GFTAFlNNSATSCCALIKLSCICSTP<^^ 

COX LFNNNT ZTS SSPOPGYRNALYAAPC INUCLCAJIOGYK X LFYDPXOKOOmDPXVPN 

YEPKHLGTVLFSCXNVDSNATWLMFLSXFSNSSRX^RCVIAZECR^ 

LRLCNAAU RTKCPG S SI NTNAXAXNLPS ILOSEASAPKFWI YPTLTCSTYS ESTSSTXT 

LSG PLTFLWDENENPY DSLOLS EPRXOI P PPLP PRC0CKXNRYFE5 KCRSHEU 

CPn.0470 541357 542532 

pmp_ 17 -Polymorphic Outer Membrane protein (Frame-shift with 
0470) 

isuJi^ispu.YUJr/rAxxzoTSNLivEyue^ 

EOTNTNKRQ LYVDWTPVCYR PNPERHCE F IANTLMQSAYNALLG I R I LP PONUCEHOLEA 
SLOCLCIX INOHNREORKGFRNHTTCYAATTSAXTAARHSFS LCFAOMFSKTRCROSPST 
TSSWYFACLRFDSLI^RIJFISTCLSU^SYCWHMLCHYTEILXCSSKAFF 
LDCTFLPAA ITRTLELOPFX £A I ALRCSQASFQETCDH I RKFH PKH PLTOLSS PZCFRS E 
WKTSHH t PM LVffTEX SYVPTLYRKNTEMFTTLLZ SNCTVnTOATPVSYNSVAAItZ RNTSO 
LFSI^LSUWSAOVSSSn'COYLKAESMCTF 

CPn.0471 542561 545401 

pmp.iti-Polyworpnic Outer Membrane Protein 
TVONOTSL^KSSFFV^ALIUSCTTrLLNATPLS^ 

H RATLfCVROtTTNOr t VLDHCN5 1 ESWFENFSOOOCALSCKS LA XTNTKNO Z LFLNSFA I 

KRASAMYVNCNFOLSENHCsx x ro&i£Tto4Asttr*mcTaiAvicsmni$MxrrM 

F INNKAKSSOCA IOAA I X N t KOfTPCPCLF FNNAACOTTAQCALFANACR X ENNSQP IYFLN 
N0SCLSCA t RVMCEC I LTKNTCT/ 1 FN7JNFAMEAD I SANHS3CCA I YC t SCS XKI3N PCI A 
AFOWNTAAROQC1A UTOriLTI0OOTPVYFTNNOCIWC3CA IMLROOrjkCTLFAOOQDI XFY 
NNRHFK DTF.TN1 fVSVNCTR MV'.TLTVT. AT^Q l.^ATFYDP I LORYTtONS tOKFNPNPlMLC 
TILFSrTY IPOTrTSROOF IPHFRNII [CLYNfTTLALEOfiAEWKVYK FDOFT3CTLRLCSRA 
VF! rTTrJI'EO: '•: ~ • : SJV I N t KNLA rNLr-n rUWRVAPKLWIRPTC.'; ^'APY^ECtMP C ZNL 

vwr;fYwrrr tTT:'rT:;r,nm*NTi jiRoi.v^wTWwrvNrEwicnof ai j:apmoufhnlf 

ATUTYVIW V I AITA.WJVTM .r/IWjf f INNI lAKCFIIMRATTTYn fi TTTl INTA! 3fl ttlFTtVN 

F;;or-F:JNl.v k; :i u:rtt**VA:;trTTrMU^ ( mm «i«ER P:rrjASi-\Y;;y;»ji in [ r kxwx 

I VTFiKini-:nTL* UAL.X:;u:f ^UWR:»I T J IFfl'F rOA I AVRSNUTAf-'OIiTnOTAHK FCVI I 

Kt , i//NL*rvii*MC;VVWF«';Kn{!.r'rvvffiiKij\Yvr'vi # Y'> ( ^NpE tNv:;iirj*:Ti.";:MjfcjnT 
i ai*na i AKKt a«J0 1 k i fiki .: Vfi i,v«/ s:v:^-*rpmiYu lAcrrFKr 

i.IlijHV.* *.4 'I4 '.4'*'i»M 
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No t nini: it ttrmoLoQ prpswie tn '^«nec /EMBL *s o! : 1/7/98 

FvywA:a;>'y^:yxxippicDMrowRapspKf^igsHEiSLPPQEHcm 

JJSFLPEDUCIQiSMAASSPCFFfflWRSGVDPJUJCSF^ 

.jKT XTADER ROVQSSSAAATEARVMX)ASVSCZNP50CVPCnSCPCPORLF5LP5VKK0 
JCLCRLVC7VRDR rVLPSCAPPTOSEPULYELNLRLSSLROELSO XQSNDOLTPEEXAE 
ATVTIQQLIQITEF0CCYMEAT03SVStA£^ 
noSLONXIXETAXJOtXAAlJHTRUFSISD^ 
T^EMtf5TRLWtRCi\U^U^ILSZl£StU<RLRIAAHA 

r' : •■ •t't-t vriffr.TW! — :^.■:v^ci^:/^5^A^f) , r:ri^zr>T-":•"■ 
"i ■ r ! . " "» ; .*f ■: :. r;: • *■* ;vmet*.i i r fv* v* f v\i*i: v * -pw:*p \p .• 

rOUybVPirJA£PSKDKNIYKrPfUATPAXY3Lr3RPaSX55iUP5JLfIM^JPMIUtC 
VPU»pVPSPAJtSEEC5IYXDliSGASGAGESDmJM 

0RWIDRIL0E31SG(^ASPVEPlyDEIWrHCRPPATLPRPDm.TNVSUtVSPG^ 
RAAUJCSVSAVMVEA£SrvpPTEPOTESr«XPLOCLVATO 

CPn.0473 549602 548070 

No robust homo log present in Cene&ank/EHBL as of 11/7/96 
SSIMWGCVGCSRSPSPXPPKIUWSEDGieVSPICa*^^ 
0LGCTDXtPLP5VKePGO50TSCRSava4RIWX£MCQTI^ 
GQRLPGIX31TORI0KR£ENPEAD*J(30}«RSYSDOT^ 

SXSSmSGVRGATTIVCXALSDAJCEieVSA^ 
WNXAAI(HIJUCAUM43CVAPK^ 

VC50SVEYA5 IVPODCSQAPAETA£APCTGCVTC5AAQCWyArJU)rWSirOAVASFTR 

AlASRLSSAJWESAVDDLA£ESNTOWFVEaEGVSm>S^ 

05L£KLeSQArrDPVZQ0Cl£UUl5FAPEC0 

CPn_0474 551600 549807 

CT365 hypothetical protein 
LKXtX5XSFMSTSPISNDFRYI£LSMAn3CTSU^^ 

IFTHSVTLFAGLVVIIA/AVSVVVVALT^^ IMKSLVYM 
VRimiSPlUaESSRXICSAIJtVCTGF^ 

HTLASFSHY IYTKFF1lSCXVAXG£KLTEA£riKEAKKLHYXTLSIAT IGVGLAVLGXLLA 

IAGTVLLOSAPATIAX IIJU>PLISICLTn^^IUiSSICKWRAnXTOaCXDIJVDTSL 

KDIRLDa.PP5EVECSer50SVXCVPDSESIA£TIUSA£CXX7XWATriW 

LLX^IAAFXVTCFCCLTVHrjVTiVASWSAVA^ 

RWKBKKKKKRVRQFLIESGVXASOR£F>QMWR7\nf1^ 

SALVGGIlACWCTGIMLIJkLVPA7APIVPGILAi^ 

LYEWWNRRELLYCPESKMRS XATDLVVCAXJU^SKZIKLmfCPVXtFIimWDIDCAA 

CPn_0475 553850 5516B5 

glgS-Glucan Branching Ensytte 

?SHVIIKLIHPWDU^VSGROXDPHICLLG^ 

AVAYRSCIJTI^VPKGIGXCDYKVYHQfCUJU^^ 

RHCMPKEVTXISGVI^VU^HAORVSVVGDFim-^^ 

ECIRYTOflErvrOSOArrvrropYC^rDPPPOGTARVAD^ 

PVTIYEVKLCSWOWEGRPLSYSIMAH^^ 

CYYAPTOTCTOEFOYFVDYUiKKN IC 1 1 LT>fVPGHFWEAFALAiIFDG£FLYE3(TGHS 
OAIJiWfliWITrFtJYSWCEVTWFlXCSAlJV^ 

PN XYCXmoiLeSimJCHLNSVIHICSrSCVLTFAEESTAi ' K,VTTU)VUAJUU5FPYJOt*L 

C>»<HI71WyFMKDPMyilKYHOKDLTFSLWYArOESFI LP LSHDCWHGKGSLVKXLPCDT 

MIllFAOKRVLLSYOXCLPClCKIXFWXZFaTYC^ 

LNALYT^PYUWOKSOECFlfWVDrHDIEMAaAYYRFACSNRSSA 

SYVUtCtCVKHCELLUOTOESFQGSCaGCa^ 

FT 

CPn.0476 554B77 553858 

CT865 h ypoth etical protein 
CRGRRAIwaX3<rDIMQHFTCPYTMVPOQiaJ>rPC5LXY 

I^XIPLiaiFA\TOt3LHROCI>VTSERYlCYYlJJ>SC ITTOS I KGKLPSAAQAG PLLSLCV 
HXHAI^tt/RCRW)tlC£II^LWFR/AA 

IAPAI^IAXAGFSECFU*RSYDEZFQGILPODGDP£GGVPFEU£Y^ 

CQLVEILPAU'PEFTCCRLIKVAUTa^LSIVWTKXTIROVELHArYK 

C5AMJl£N5ERR£^CSKIU^LCSTUIXAGrrYU^ 

CPn_0477 556 1« 554844 

•yqev.Bs Hypo thetical protein 

RYmVAEV KG n KtVCLGCRVNOYEVOAYRDOLT ILGYOEVLOSEI PADLCI INTCAVTA 
SAESSGRHAVROLCRONPTAHIVVTGCLGESDKEFFASL^ EK IFS 

YDTTFPEFK IHSFECKSRAFIKVODGCNSFCSYCI I PYLRCRSVSRPAEKILAEI ACWO 
0CYREWXACXNVG0YCSXjERSLA5tIE0VWXPCXERXRX55XDP00ITEDUlRAXTSS 
RHTCPSSHL\WCSNS XUCRKNRKYSRCDFXJXVX^ 

QDFEOTLR 1 I EOVCFX KVHSF PFSARRRTXAYTPBNOI ENQV XY ERKXYLAEVAKRVGOR 
CKMKRX/jETTEVLVEKVTCQVATCHSPYFEKVSFPVVCTVA xntlvsvrldrveebgl xg 
EXV 

CPn.0478 557640 556210 

hflX-CTP Bind ing Protein 

WHOGPL0T I OTPCEOC^SFarmjCARTOLPRKEODPSOALAVASYONKTDSOWEEHU) 
ELISIAOSCGISVLETRSWIUTPSASTY rrMCKLEEIEEILKETPSTCTLX tDEEITPS 
QQftNLEKALCLVVUMTEL XLE IFS SRALTAEAN lOVOLAQARYLLPRLICRLWCHLSROX 
SaCCSGCFVKCECO(OIELDRRKVRERIHW^AOUCAVIKORAERRICVKSRKCIPTFAl.I 
rJYTHSCKSTLLNLLTAADTYVEOKLF ATXOPKTRKCVLPGCRHVIXTOTVCF I RKLPHTL 
VAAPKSTX£AAFHEX7VLiLHVVQA5KPtAIiEWQ^ 

'>:SrPHKLRU-SPLPVLI3AKTCECI0NLLSLMTEI IOEKSLHVTLNFPYTEYGKPrELC 
QA3WAS5RY0EDFLVVTEAYLPKEL0KKFR PF XSYVFPESCCDOECRCPVLESSFGD 

•:pn.D479 558434 557616 

phnP-M&c*l Dependent Hydro laoe 

A t( *VRD IQSESIGKLVFLCTCNPEC t PVPFCSCRVroiOTCIHRLRSSVL IGVQf IKTLVI 
I>A< IPnFRTOMLVAGVSELOfWFLTMPHYDH IQRIDDUtAWY I VTORSLPLVLnATTYRFL 
NK;«ir/LFATPWESSIJ>AVLEFTlLNEIXro 

AYLTDLCT/DAK [ F.TYLDNVCTLX LSACF:*ETr I PFV aiKiTCI (LTVEEAKAFANHACX KN 
I.I miI:aiCI.EAEROWIPLVrFAYOt»4EVUm. 

•:iii_U4h*i '.V»170 1^H»;5l> 

-.T'.hi iiyrmrh«>f- icit uronein 

: :ijkh i :.wj i ; a :HU;Erv::KKeoa.*Mu^:i .pcy n wai ieevknryfwvlcaew.ipyw 
t v i wnw yvprrr ; i lovi j«;kohkfu iu^;p tT:;uMLEpvr*vju\jLEnAKYEirn 
•.vHMiui'.-::rv:»wFr.z;i^ i fagl tvnvttvEAPL t at jljawv t rtr t iowrailcltail 

MAYUll* H'VRmillUMEY rTOCHCROtQWirW-T/ rrKYrATl'AI^PlTKLPNCSRR 
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CPR.04U I j—W^? V AW »•> ^ .j-y --53- 

No robuac rK»lrjacn»J»*ntH.iih3>n#Datt«/ChBL f LI UIO 

2CLR I EC I LMAT3VT^TmTT<"CEANSSKER FTERT SRMYY AALVLCAL^d. I F IA« IV I 

LCNEYWRS EL X5 LTLRCCLHE^l r.TJSKORriC I DC.T^ I UCXXTirrriJUJCIOXVK 
IWIILHLVRqWNLUr.-LSPr/TAHAFET TJ FLIEECYYSFSILCIRYCDALOATSPLH 

, VI.*' - ."- -.•-. - — * —-.--* *«—^* - • ■ *• «?f ( T«fJt»-* y«-t- — • •■»•«■ 

r**r i Jt^! ,: ■ .■;.,*■**" ■ r .*. ■ • ■ v,:,'v^\ . rw«.*:-*wr.-::. *a»: rgc : 

LYEYPL5YLZCUAV'U^^P^£X5LEXX}AXrmrL0C^SKLSOFA5RL^^ 

OVO^EOAAVKLV>CLAACCV3FOC1JULKYLTAVTO 

CSLCO 

CPrv0482 561764 560961 

artJ-Argimne Peripiasmic Binding Protein 

NIAYlUlOITHXXQXGRFTRAFXFXMFl^LTSCESKXCRJJRXWX^ 

EW5FDXrXAXAX5EXX£KgL£VR£FA^AUX^tJCKKRXMXt^ 

PYYCDEVOELI4WSKRSlCrr/lJT.TOYSSVAVaTCTFOEKYI^ 

WEVRYGXS PVAVLEPSVCR'/VUCDFPKLVATRLELP PECWV-LGCGLGVAXDRP ETICT I 

0OAITD LKS EGVI Q SLTKKVCLS EVA YE 

CPn_0483 561830 564964 

No robust homo log present in Genebank/EMBL as of 11/7/98 

XXUIDXRAXF1»MFPXPPPHCPPNM0)MFYMLrrirn^ 

LLLXHYYIOHMRVVRXECnJTPPTI^KGPEPKrXEIAXO 

TPPPPPI<APSPASPKVPKOPADIOCPTPPPEAPPPPVRVATPMPLRPSSOCrW 

MVUOUPLPLPAH*TiPIU3FNPHrVASYP^ 

RLAQIJIJO ALYIJO>T^VWV^IXacrYIUYAVGWLSALYEtSSRW XVT EQtATRLL 
*XPFASSSPANANirAI>4A£110LCSTYCSF:DLYIX7^ILSOKKTAT^ 
RQOXAAS5NEXCrARAXJ'X5CK9DOLLFSVUTLAA»ffiPYSELFQI^ 
LFLLLEHLPALFLTOAELOKKS PESXliUCQrER£IR£AFAJCLSRRXAOSGMIR1ltnUI 
ViaWIJ*EAIRCOYSRriATIENRRSGDUVSPAI^FFAF 

OX I lASAPPWIOEXWXSNWLSYLNXOILDSSW 
PCLETUORIAmJCNVISTSFETPPLJWOPtJU^^V^^ 

YAFUCVENEIOXLOiaUtSXX^SLVLICPPARYPSTPSIfl^ 
E)0MC>CrWTTPW01Xr£AIliEXEZE^TrR>O^ 

ATIJCYSRARIX3UCNIIJCMIVTA5n^CRSCFTOSLICY^ 

CANDVMKTTOTLOPFAVCXJMSIMXTrVS 

NKYGCXXPRNPRTESGN5KPOSSNP 

CPn_0484 564931 565824 

aroG-Daoicyhepconate Aldolase 
RSIUaX^UCSLVI^EVLILTriYPLPirriJCOHPOTVHTVP 
. TLCSYlHTV5SALTVXXMGACVF1tC5XRXPlrrSPF5F^^ 

TEVXiMU-VEITAEKVDXLRXGAJDAOOTPLLOEVSXSHRPX XLXRSPAATLCIMLCAAE 

YILASSPSCXX^LCXICIRTFZ«STRYTXJ)I^rVAIXXElSSl^ 

LPt^SAGLSVCMXXUiaEVHAi<PEXAlXDAKXITX^^ 

CPn_0485 56S993 566229 

CT382.1 hypochetical protein 

0PIGRTPTRV7T>TRFMIK0ACXFYLL0CIXCALYWtXJrrcRXLI^^ 
5SLX0LLYQLK0LPAFTNE 

CPn_0486 56T799 566405 

hypothetical proline permease 

A0MtSlXICTIFiajGCGVLYF)4NFS^^ 
SUXFPlJt4TFXATt3XGGGVX^JGAA£CAFCY1»GGILYPL^ 
SLTTVVSXrEWYGSKXLMXAFXt£ACSLFTILV^ 
VLASYTSTGGFTtGVVRTWXCAOTtXIAN^VCGVSVWI^^ 

WI FKPMLFMLVECCMVQRCVAASS PIWI^AAVCACLVLLLFNFI PLFLCSLSAKACLKA 
GCPLXOTXAYPCNP5IJUVMAAAXGVAII^AO5LM»UVS0LXAEEYPTl^^ 
GLAVAAPLVAICFTNIVr^Il^SI^VCCl-SVPWFTUAP 
LC^fCWOrVSLCTMFC^LI>WVCSLVAFSrV^ 

CPn.04B7 569833 568112 

CT384 hypothetical protein 

RRTOG X 5 LTY55FRWA5FRCYSL X FFCFCGSLFCS ESLRYOLLXODFAKVSEEGXGLLCS 
KEYSIX0AXLVUMU^N5SFI>DWPRSFKKC0XSYPE1WU{0RE^ 
SVTVRAVSVI^IGIJUIDFRLVPLI-WSCNDDSAIVRSIJU^AVW^ 
NOXtSXKVRXTAYOWALLO XEELL^FLRERADIKLVDSVERREAWXACLCLSSOFUrCV 
AJOOtDOALFTCEVLANC^PrTTEIPTELI^VENPEVOESU^^L^ 
SKWHVMCTS PFAXV*FOAAAUJiU<CD^ 

(^/PI^EHLE5LSSRXAAA^^IlXL\^RECIEAAC0VIARYL2^^EHCUAIEYFLI k XA0 
WNLRCCTFPLYSI>4IKREIJRKLIRLl^VARY3OAKAVTATFL^C0OAOC3WSFFSCMFWE 

EC0VICT3EIlL\rr0ACFAAJCLECALAS LCOKX£%A5LORVSOL Y^ 
S ENLDAVPFLLDCCHHEAPS LRSAAAGA LFS X FK 

CPn_0489 570147 5fi9767 

hitA-HXT Fdmtly HydrpLase 

RKLPTCFAVNVTRSRDMKn'FKOIIDCLIDCEr/FENENFIA IKORFPOAPVHLLI IPKK 
P I PRFQD X PCDEM I LHAEAJK I VQELAAC FC X AX3C YRW INNGAEGGOAVFKLH X KLLGG 
RPLCAIA 

CPn_04*i'# ITU*!? 570111*. 

CT187 nypnr.hnt i.mI r' Jtmn 

R t VFAI EHYF"L\rr>ti;rjWI^R I */*TMO I PR: » I ' *TI (OCSFH AOEVTAI *AI .LIT FDLVDDflC t 
I Ri IRDP WI .TKi : E\*VVDVi**VY." IENKR FW It V/iV. Y Dt ZW. Ml Li IYLKFRTYMDCEE 

YHKiKrrrLviK .vdkchw ;RFK.:KE*-;FC.*:F::r>i t k iYNi-ftEEF.FrN:;tjAO«a:ALirpTrDF 

U.*hU'KKFUYOHVi *(4 ; t VR KAWmOM :i .Y^C I'LAWOCHPFFU Y;P3(IIPAAWFPf«) 

<jw ilk; rPi-Ni j>KRHi> , /f'vrKrFJikAi ;i j* *KEi.':Kvr/> r roAVFrnitf .1 ji j wnxRp'v? 

OKAUhLTr^MH";! I 

i.'hi.iM'in »i.*7». './nil 

t.TlMV tfffOrtfi I. Ml IM.It'ttf. 

iMYNixiiAi 11 iiMA::ri » m.vm ur r.u:n 1 1 YtHKVL 1 1 Ji 1 i-ayki / :ki iu • tuVNJjt;; 
:x*\iurWAVUu it yj warmm* ijivi >x* W' i ATAMLELf a:w :k i ^aai s whi. 
vh: a « ."YinttHFYt m wet* z: 1 u ,i« r :r ki .w ik in jfi 1 1 f ion lwfi 4- 1 1 ,n rr 1 it h\ tt \ v ; 
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FLPLK>':n.TwrHU(rRicruPL"r><vTnpppvPE ^ticriPuirRr/rARvvopLL 
tTctoevf!^ 

ET^DWFlZoUFACI:*^^ 

rirLKT^^LVPV FL IC PVDYVWK ! TAL YN5NHAVGT I RGSEWWCLFCLSSAJCAG I A 

•*-Mr: i w.t ft* :r/*" 

-*Pn.04yi 574Vi5 V/JJJ<> 

TT399 hypotnetical protein Lul oeror D 

oSwrWDCYXTCW^ 

RD\misAci!tfKE>cweNR^ 

ISmTPOXRLYOtaBYrVSRCtffFPOIPinfCWG^ 
WEEOKFCLDOSYXljatEMWOEXGRKXW 

CPn 0492 574643 574804 

No robust homolog present in Genebanfc/EMBL at of 11/7/98 
LTSLIfP ICIERJBQQTVKHUIVISACriiESPLKtHWSSPTGPPPFYIWDEJCL 

=Pn_0493 575142 574855 

no robust hoaolog present in cenebejik/EMBL as °£"' 7 .lf£«^«. 

SKTEX5HSKTSKCFVCRFV*>fIRT^^ 

WCOEDATTSriET^lCIUCKPCNKDPICCKHWWKOS 

No^rotoutc honolog present in Genebank/EHBL at of }}**f99 
SLEWCVXCISEAHFK 

CPn_0495 S75507 576793 

aspC-Acparcate An inotranst erase 

RJU4OCNC»attI0KAGAnJUiPSE»^ 

EKPEISVIOUXCDTrgFIXSSrroAIXEFCW 

YEWUSPKXFXSKyuePDIFMJSFTC*^^ 

RKCTCriPnJWSU)Il£LCYPNN1TCrVLTTO 

VSDPSl^KSimPEAXYCAIEXNSFSKSLCTTCMR^ 

:*ATTn*»SIJJ»EAGYYGLDLFSW^ 

WVELPECISDEEAnJFTLHOYHIAVTPCHCrGSCOOGrVRrSAtTOTO 
SLKETKVU 

CPnjD496 576751 5T7B12 

PXJJ«SKSCIETlJaroiXJEIV\fcKA£I)SI 

VKSK1 ETOKPVX YAAVPORESLTl^ronMNIYCNOT 

WSEPFPSDXOTIVKKLHASOIEvIEISITSSra 

ECTAfTjOEILKEXIPIITODT^SEGACre^ 

RXI IAOPXSPTTTTNEDIIKnfilXXiOerEWFLSnCS^^ 

CPOJ0497 578107 577820 

IFORVWjSlLEVK^^ 
LPKRJJ^XACCTSWUOCFLLPNRVOOIIFSLHIDV 

CPn_0498 579062 578085 

No robust homolog pmme in Genebanlc/EMBL as of 

YCRIJUUtfFWRMCARWVVAlJAKr^^ 

pEAEWraJLIESSSAWVLTPaRFSGELVSICCOTm 

DCAVVFCCPLPALRORLDFLVREJCWr^rKKrWlXCER^ 

PTSJMrJtSGNRVTPSSEmAKTVtWQHL^ 

DTlXtFIlSYOEAFICTVLFVSSOPFIGIJ^R\W 

RICLKTLACHLKETNCCUIISEGCFC 

CPn_0499 580404 579205. 

no robust homolog present in cenetoanlc/EMBl» as of 11/7/9B 
LSWU/irrFCNCSTNSSVNOSSGTTOPEEVTSPESTE^ IV 
7QLSLPEGVCTS SEETASNPRVOEI VAEVSSSRAVADQ I SSLVERVCELLDOLKCAOSLF 
TSFOSEXJaCLPAI^STRRLTTRGACTlNAD 
5KLTDVKKKL0Gt£REDX£LAFDNNDRv^^ 
OSMLVOIKKVNLPreEXJmjOGTTES^ 

EMIVRAECESIHPESP1CDCRN0PSRCDX0DSDSEEETEL 

CPn_0500 580647 562362 

pros -Prolyl tRNA Synthetase 
OPHSWCTSOLrnCTSICNANKSAAVLShlEIXE^ 
IREEWXGGOELLU'LLHNAELMOHTOWEAFTSB 

C3 FVAQWLSSKROLPLKLYQI ATKFROEI RPRFCLXR7PXLLKESSYTFSOSPG0MIEOY 
KUttAYSKIFDRLGUWIVTADOTX(»GK3EEF 

V.-5 1 PPCIUYDREFt»PVEEVATPCITTIEALANFFSI PLHICIUCTLVVKLSYSMEEKFIAI 
JMRGDROVNLVKVASKt>UUI>IALASDEEIEJ^^ 

NPyCACNAKDKHYVWNlpOROLLPPOYCDFLLAEECDTCPCNPCHPYRIYOCIEVAHIFN 
rTP.'n , rrTr/flFOreKrX r TCOCV<MCTYGTCVCRTLAACVEOLADORC tVWPKALAPFS I 

t t AniyarrvnoELAET i yhewocjcyefllooroerlcfklkccpl ic ipyklilgksy 
o:::?i[FEie:;r»ekytvspeafptwconiila 

lurA-imi Tnnr-ci ifjr. inn.ll Rtsptcnom 

I LLTPnrr :F1» I ML: I VLVrXO*AR:'4CVSKftOltK 1 I J) t LFATTELYU-TO&K/TSKTLK 
ttWittJUrrAT I RWrAiaJ^AL'AlFLKKNIfT^tR ttTDLALWIYVnUOKECr-EAEt^AP I 
K[<T IXjl j :;E:;i<N 1 1 KPt /JM ATELU^E t LOUPTFFf^" PfcFFNOJ IVTN tO rT&VttCQRAVT 

: u .tejt v i rrnrrurtj 'kaojtu: i kr i exfi x?ny t pklptncel^kkeehu'^clynev 
wifYi.Tnvi w:*ia»i.7vn k jku j(yeafkdpevlaux::lfe«rrvmckuji icmiikc 

M^rAF^WKUIL/l UTRVII* X.-JV IT t PYYMNR.1PU'^Ln I U!f* I N1J *YK FALM .UCLFAN 
K 1 HKT1 ,TV;: F V It KKt .'"PPM M .TSNCKLrWF.P I LRTEYS.1 1 Kl .LlttKFTI . 



SSI^otS5fiS3sS^ 

TPK8MKE 



CPn_050 » 



M4225 S-JA213 



PAKRUAVTWFEKTLJ-jTKFvF tCKKV JEV AJEU'TVt (T > /Tao^>oJDAVF EVOCXQTTPfcE 

1a^ycidkvcoxkia\^ijccc^ 

SlTSTOEGXDt^10»eUL0RlJCn^^ 
LTMFDOJU^IERmFCXICAliOAXlJAJCDIDC^ 

KolGvlSptSvW 

TCmxmAAISOPAVTIVVLQCERPMAJCDKKEXCRFOLTD 

ScIrHVSAIOWASCarJOKIRXEASSCM 

^FUEmKirfXEOXPETLVKnEStXCNVRfUL^ 

CC^SWASAAASSAANAKQQPNINTEDIJOCHSFSTKPPSJ*^ tCN 



DDK 



S8BS14 



CPn_0504 58*418 
vaO-ribonuclaase family ( 

ATorotraixvocpiGLTOMUjapi^^ 

spdhtceypfpifvparolrcat. nnrmvrvsvu^PRTOKucgnsEviARcrrrLVCT 

ITSLVS FTSAUkYTSHSCSOSLI PVnXPCTTTYXICORI LI^PWVDKPOEGAS PAtflM 

LErrCHITNAJCADrOAI0Ar^NUaiFFPEVIEEA5Lre 

XDSSTAIuirODAXSXTYt»N«YXLSVHXAD/» 

tnoiSHPUKiLNDaTi^iarrsDXiieEWXRr^ 

SaOTHLIWNWWYHISlCTOt^rP^PPN^ 

SSSSlottsaoipijqvijisofvrsmct 

UMIRSjWsiWTOlXIIVWCSTra^ 

HAY X ITAIWICLSFVyTEFCHECF IAAAEtPJCEYSLJOOlAI*PES X POKMKPCASXKVTXD 
CPa.0505 588471 589106 

•3-swchyladenine ONA glycosylase . 

^SSjHtSrwcrWAKY^ 

DQGOX^XOIUQWWPHLLTNBPGIC^QALCXSI^ 
ATAJUCIDYWETWPWRFU5PEDSCICVLS 

CPn.0506 589055 589840 

§WSPWRF^FILN^ 

vS3^TOPFVCLA«EFFDCFSrmHIUlXIC^ 

■ SSrZe Wt^iLmrPimvYicAA^ 

<H»PTvrCTClXnfflPLVTVrX |>TT^^SC^T^FIUtSOIVT^£la^iBArlCYlISOBV 
L5TPHACPSSPLPDELH0O0GS 

CPOJ507 589898 590122 

Stwfp2s«x*5n^^ 

KALSTSDSIFIPXIC 

CPHJD50B 590133 590100 

CT421.2 hypothetical protein 
SRIKSRHRSYOCSVK^nTWNVUOlFC^VWUUa^ 

CPn.0509 590299 590808 

(oredicted Metal loanzyve) 

WWVaYt>IFIRVTOacIXnfVSNEOTCIPXHLVSVDtLVLT^ 
[RUUAEIJ10KVrAI»SLTXTXTU>XDAPCDPAYPKVt£EAFXSP 

iYmsRYLVKSiuoiiirfDcrrssiiiouc>orwENO 

CPn_0510 590804 591973 

tlyC-CBS Domains (Henolysin homolog 1 

OLNHUULLAIFCIUJIATGLTSPSCHCS^ 

TLLC I LY^ALCTKLYTIXPPKTAHKDLLJVPLYSX-SALI AYCF LP PWI SnCVPKETTAHL 

RFWSVF0LCLFPU3UJYHIWPW0GVRSSTSFOSOI3EALSAFO^ 

ALPEtTTLOEALVLVSESrfSR\aVYKIWU»IITGIU.WC^^ 

KPFFYAPEIKKASStJU3EFTOXHRHLAIXVNrYGrTEGXATMEDXXEEXXGEl 

OIXPYKKICSSWIVKRWISDAEEYRILKXDHEtlSYDTLOQWFHICVQA 

ENFDIEI ITCTERNVQCLKITPRKRKni X S 

CPn_0511 5^>2141 592488 

rsov-Sioiu Reoulatorv Factor 

MSDIOKEEHC3TT XFHLHCKLOC I5S P EVQEN X 2QSLAAGSKN X I LDC AKLDYHSSACI R 
VLLO^HOVCOHSCXIVLTTVPKTIEOTLYVTCFI^YFKIFVTVDEAIOTlJntOCO 

CFn.0512 5**2538. S94412 

CT425 hypothettcal protein 

SLPLTMRRSVCYVNPS IARAO0ISTWKFLYSLATPLPA<*nCCKrOLACSCICPTWEAPAT 
DLSOTnWXYAiMPBCEIIEATAIPVKWIPVWEFTLPYELCfvCCrLT 
VODACNCAQtJWRKPnXYtDPTCE'TNYOEPOVFSMD [ PGNVLKKIEIFTPSWVKKK 
RFO rTVRFEOEFQ*LTffFSrEETn t ZU?( D1LP.CILNUQLFI PETOFVI LPNLYFHIPCI 
YRXOLKNLSTCEZ F tr.Ap I KCF ADS AM ILHWTILLHCES ERVOS EENXETCMRYFRXJORAL 
NFYAHnCFOIOENUTDIWKL I NOTVnDFNEEDf F rTUSCFOYSCEPHtJDCVPlH XLHTKI 
TK3It::KHK(rrKH 1 Pt AKLyK;rrVNIIDM C; I riiFTASKEWFOFENFYPEFERWCIYNAW 
i :S::riTAALI WPFP I Of ;KP;: EDI •» 7TV I CSCLK KHLRFCP/AOGLDDRS I YKDYFOSPW) 
Y::it« .TA [ 1 1 .TIKYmr-'U .VKAI. VARI II :Y ATTi ; I VLJ Ft aTSAPHnSELSTQSKPGLKV 

Nnn i :r ttkix jttvf.i i wc *4«ijrrFFra':itii^*rrDDMvri^mJCDWiCWF 

VFYY I M VTCAI MAMAW; :i* I WV W M 



'•■MM ' t 



I'llljl'.t I 
Kt- ;l t*xi»h«r«nliii'i.iiw* 

VI* li a-M JW WICNO I AAKKKI (V rKf/.TVU* IIXR I VI IO-*VI lOmTX.l.t'OPPICTSPtYS 
I f>* I J>AOF4'i: ISErJAI JIM J J.'IWRr/yl'T' JWriKADOVPKOPVCDTVYYI^Tri.YLYPTNF 

. rirp*-r;rMKpi a t^afl ,y;:i*i i wf i i ktp itf/h i vaif:Fp.*xrNi4JYYSOLF 
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TK tKETOr V 1 1 1 1 KALTA [ EYAYLSOLOMUil RD*. JCOACLDSt POC3CA£ILVPICnW 
rLAPKBL^DFLN I HFMAHOW I H5N I TMLC YH )CECP EDLVTKMVTC/RDLODtrv-»r 
F ILLKFACENJ/VLCKRLPIC7XOU I PLKSLKAVAR i nJWTSW KALWMYLStEAALDL 
[jCGAOTU1.7™MSElC/P0H^ 

CPn 05 M ""9° 596520 

T427 fvpor.net ic*l protein _ 

■TJfnPWHTTRENAKntlOLOPC tSLfiCVSY tMSTPULCLtKRNOIRC/LAPPADLUlLLI 

; ;:*•*• : ;rv:;.- *:_.**. "r.'*A*\PTFP:.*' , : "*.*a»...- p.- 

■-"•piii^i — r::.. . ^r^vaxioav,:.-.;;: ":.«*':r.TVCi 

^^GWYDLTKLPFWAl^CrTCWXEHPLPTflJ^^ 
LLOEYYALCOYRLSeEr/CSFEKFREr/CTLYOOARL 

CPn.0515 596450 597181 

ubiE-Ubiguinone Methyl transferase 

QOfTTKALKNSCNIKEPSTNKPraCKirDSIASITO 

CYSLLDLCACTCKVAKRY I AAHPQASVTLVDFSSAM^I AKOHU>C<^ 

LENHSYnJUOttYGLRNLSOPHKAljOEISRWitPSOICI^ 

RAVWMGKSVSICOPOAYSYLSICSiaOLPia*OUB^ 

LEKQ 

CPfl.0516 598904 597255 

No robust homo log present in G«neb*nJc/EKBL mm of ™« 

USISFRVSWFVKIILJWLGRAIAKAYYVO^ 

GAKGKEFAKRKFSIIKLDDILKLCIWRRPFAWWia^ 

VSOISPAK^STLVWAPILCSTOLWUTTMOIIJCIW 

ANPTOEIPAASETVTSSPVAPGNTTDTOPAASGTTUITSGVSEAAA 

PS FSLRYALWQNVPYPEP PKEPEVHTTDEEXSIJ LiATRARWffi^YNC^T-WTYELSK 

nEIQKHVPOLPEMHRTWRWSSRLY Ki 'K F K T KIlE lSLPlFX/nC£I£l6^ tAJlGX^ATOSQ 

AWKVFNSLVAWU^FNVCRSCTAKPLPTSl^^ 

LFKGLRVLEPGIECWYDHPOOACEl RSVLECLVOAGRISCYWEWPrCRTVlJCVGERRT 

ttVXtiESLVASGEIHQFFESSDEEeAFIIONWSKTO^ 

LPRGRFTILV 

CPn.0517 599637 598795 

No robuic homo log present in GenebanJt/EHBL *» of 11/7/98 

FIMSSLLSCGRI EPTRVTCSLKTYl^I7rS0IWI-STRLVRA5VIFLCALLI I LVCVALSSL 
IPS IMWATSTTVMCLI L7VMSLLCDVAI ISYLTYSTVTSYW3KXRAFEJHKPAIISVYYE 
GVRKWDLGRS$U7rGEIPIVRT«PFCNHGXJ^^ 
CLIRDFKPHVSSLCTVIEKCCSSLRTKII^I^ 
KMCUNX3I IPSVMVRSnfPSRPCECYRECIJiOWOCKCSAli 

CPn_0S18 600806 599832 

CT429 hypothetical proccin 

p^IYpvpgNPlXI^IUU-KDATSKSCOERDTrU^ 
EEKWRYCUP^TTYEVKKDlETFrMEKIYOIiyrKEJCr^ 

AELoarorfVERSRiRJiE«uttoiKrHPVFEzrnj)rn^^ 

AROtASNXAIOYYSNEAiWPttPIWG^ 
r^pSSITTSEjrJCTEEEFtJWU^^ 
OFFCTDCEXVVTKTKCSKPCRXKSS 

CPTU0S19 601707 600904 

dapF-Oiaainopiaelate Epiaerase 

OPTKLRILVYWMAFYS PSTISKYTIYSGACNRFI^ETIJ?rVEDWlFl^)ETRVDCFLYL 

KPSSCMJAOUirNSOCSRfTHGCrGLRCA^AKUSONCaCS^ 

RVLVIHnJUJWRASVKRIXSRPBPLPK^^ 

HOTFSPDGVNVNFVOILGHCOLRVRTYERGVECETAAj^^ 

tKTmEUfTVSONRCRVYLQGSVTRSL 

CFRJ052O 602233 601646 

ClpP-CLP Protease 

ERHYFMAIX;EVHiaJU>IXEKEU£ARftVFFSEPVm 
VIMSPO^TOWFAVWDOIia^TSPVTTmCWASHCSVlJLCA^ 
HOPS IOCPITGOATOLOIHAREUJCTKARI IDVYVEATNOPRDI IEKAIDRn*t4TANEA 
XDFCLLDCtLFSniDL 

CPO.0521 603803 60224 1 

glyA-Serine Hydroxymethy I transferase 

KSLXJCVffEKFKKFAIVEirriCWAWSUJ(KFIJJaSCKICCOSl>CT 

PS ICERI IOEUtS0RSHLKMIASENVSSLSV0IAMC»LLTOKYCECSPFKRrYSCCE^ 

AIDIECVETAKEIJAADCACVgPHSCADANlXAVHAILT 

EEYTLLKAEM5SCVCLG PS LNSGCWLTHC^MUlWKSKIiffiCFPYOVNPWECFOVAEI S 

RLAKEYXPKVLIAGYSSYSJWLNFAVUCOIAEDCCSV^^ 

:PYAOnrTTWIfTLRCPRCaLVLATREYESTLNIt«PL»CCPLPHVI 

S'/DFKXYAHCVVNNARRIAERFLSHCLRXXTCCTDNHMHVZO^ 

CIAVNRJISLPSDAXCKWtrrsCXRLGTPALTTLC^ 

G33KianCELPEAXACEARDRVRNLLLRFPLYPEZCLEAL.V 

CPfi.0522 603825 604655 

'.T433 nypocn«cicjl protein 

REPLSP EKTSLAFKVKNVNQRM XKKNQCKKKNYFQY I PLKVQKLRQP5FY PKRLKTLYLC 
UOXTARKYOAHYLP I L7LFPYAX3TP0NXRALOFL PQATHV I LTSPSSTHLFLSRHTSL 
LSKATUCTKTYLCICEJTKERLWFlJGQVKYWATGEIA 

JLARFVI REFLYNR FTFFS YPHYTVX PRKLKKN ILSKYKK 1 1 FTSPSTVRAFAXI FPRF P 
£mVC0CRMTLQEFCRFSS0X0VSLLETLCKSRTSP 

CHl_052) «04720 605052 

Ho robucr. homo loo present in Cenebanr./EHBL <*ti oc U/7/«fi 
ttV^SSATMFDCTApy LFP PATRPRYNFKLALFVT X A IALVW IALIATT IAICLCIHPLC 
..F I FLTAIf LYF ISRV ICSHYARNVY t AUIWrOlCJ'.LQDMRCHSF I FCDR 

IV. ttti*v.:r iMtwil.Hi pti.'flnfir lit OmeCwitiK/lLMRL .u: Ol U/7."1H 
r/V/FKHVM;i-.r: 3lTK;\ W :VL;7iVrRDKEI APKKOFT t AK I TTUVt LA,"LALCALVAO 

: .:lt (vi/ wivh ai j jrrAU*:nr/TFLV , /iiOMT:*j'*/^nNwyKyi j^wfkpuiiouwkkn 

7'/ .vrJirnVFYNTR I LN1 -KKKVA 1 fJTOA; VPFOFTKLT^LnV ( KKNOtTlJ ( IFNPVnpTNL 

lurrATtifjiT i r ,Y;rrt it i vc: tvwnri rKOKEnn pakofjvpf:* rri-VR wklphealdotfnl 

lll^^lAKrr:: I I.ITKI * :itVu*» :i-K;:KeLPNugfYYItwALlAYWK.IJ(/V.\rE:3»AA ivalplf 

r:rr/rvfi*KK 1 1 .i *kf* rrpywi w/ivafl'ki'Auda i vrrrAi * yi i uoorFwr i e 
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VNAQEKSTAKRRRRRAAV 

........... 

• ■•1^1 J* ' ' • >~ .* ■ 1 1 : " :;«'-*i'> * 

t^Ml. tt^^JA'YUXK^J^KEHKMr JfM j JTSVCCCl JJiuKWSrFrQAF JPKIAM 

olaexx uwscwvff3gvcx3gcvarklva7w 
pSbttcomxfotwklijksrc^ 

ut rtnrvCTjaJW^Ytgnr/erVDWFRLKSirT^CDtJR^ 

AJmCrr EBSDI AIALQLKESSS PVAVX^VXIIItZVRHVtCLLIWHTlAUICLL 

CPn.0527 609910 608726 

RYMFtFiJpxieaTSSGKrniwjMLcaw 

r^TIOTr^rT*"^ rrr rrrgrxgPf^rscPPTSCgmgAass^CVWFSPAVLSL 

AQRCSICXJ^XXAmXCGRVTR0XeAYXSr500VSXPCXFQCt^^ 

XSSL^KSSOEVPHASLVVCWI^rroUOILlSC^RQR^ 

FPLU«3SLDCTTIVKKICSVNVGVAVNXjaC^^ 

UIiai)PSEVOOCSVTVTNFOnCAUGKPXIRYPEVAlLCXCT 

KVWaTrDHRVWIYCSXTLTSIJWRIJCSVnC 

CPIU0528 611165 609921 

alcT-ClucJBUCt Syaporc 

SSaO^ICIICSVCLYl^AIAmClCFAWIFSPOOCOFAO^ 
MYTl^IIAOVTPSNPVWFAECNILQIIIFAI 

UWVJMlMSFAJ^WWStAWISCaOtCI^^ 
Cia4SFSKFMS»C^SCAVSTASSSATWTimCVS»^^ 

CTAIFOCWAAVTXAOAYNCPl CTiSSTJJJ.VVTA7TSAVCSA{CT/PCC P(ITU2 SVlASVCL 
PIQG lAILACIQRiJlDIVCTPMWILCOAVVXTWA'gffT^ 

CPn.0529 612298 611165 

-ATPatc 

rs^x wiu ^w aF PSWLiTWttff 

R FSK5TFY1tARSTVXS^KamrV0C»ClCT1T^^ 

)O.T%fVPS1CVHSASYTOEPLLhAC)gJECSVWWro T T ITnTfl YR 

KLKRVmAVVIOODFLaSRAFTntCItlJtDF XVNQCCXEACTVVmSKA 

roirremASvvvrwcERiPKEAUuiJ^^ 

p^U^TjOCELNyTCOQHAXROCff 
OTX^LLMKiniKXKRQI 

CPtv.0530 613323 612460 

■Dou*rRKA HcchylAS9 

SW^KFLKW^OSlJU^EFCS>GXICI^ 
lU^jrXOYXCOHVrCSTHLSCXSSFLYtlJSWSTKXL^^ 

OKRVMnCS>FLX0RXNAOPFYLX XEQVCRPCNVGAX UIXAXX3AOVQCV1 LCMPXVObYNP 
NVVRS^L&AVFSU'Il^ISRXiaCZIWECWTVFVTS PRAETKlTSXNYLOPTALVrQS 
ElOX&TnSUFSEOFSeZAZ^HUXSDSLKl^TS^ 

CPA.0531 614198 61J245 

SAM d*p«nd«nc MChylcr«nst«rat« 

GPVTLIl««SIAVMPKSllPElKSOAOL0YVRBOTl^^ 

RTFrTXbCnrFFEMtSFWPAtJCOAXEXHICtR^ 

AAVRHAORJ^VEiaMAPPERRXFWVXEDVTSFlJCXEIRRNKir^^ 

KXOKIX.FPtLSLCSXIIJUOASYFlXTSHTPCKTPEFUU 

CCXCUGALPSGSFVOWXA 

CPn.0532 614716 614075 

ribC/risA- Riboflavin Syntrwit 

ESFCCKDSWKWOOCFSGI IQE1CEVCFFEA0OCLSLCI KSTPLFVTPLVTCDSVAVDO 
WLTLTSO*ESKIFFWPETIJCTTU5EKRCSOOVtJI^AAl.KKSDSICm 
AXXFLIKEimYn^lCEl^OYLFTCFIAIDCISLTLVSVDSOTFSVCLIPETW 
CKKRECERVNIEIDMSTKIOVCTVKRILASSGKD 

CPn_0533 614918 615385 

CT406 hypothetical protein 

EVAFKOCPFCNHCEUCVI DSRNAPEANA I KRRRXCLKCSORFTTTTTVELTLQVTJCRIJCR 
YENFOESXLXHCUUASSHTRX<»00\AUXA5fMCSIUCKONREXSTKEXCELV^^ 
KACNXAYIRFACVYRRFKOn.'CELMEVLLSATPOHEX 

CPn.0S34 6 15 389 6157B4 

dksA-OnaK Suppreieor 

LN FT RSKWPLS DOEI EQFKKRUJEMKAKLSKTLECNAO EVKK PNEATCYSOHOAO0CT0 
TFDRTISL£VTTKEYEIJJlOINRAlXKINES5YGICDVSGEEIPLARLIAIPYAT>fTVKA 
QEQFEXCLLSCN 

CPn_0535 615763 616296 

IspA-Lipoprotein Signal Peptidase 

KRTPtWKLSSMATRFRSTLLVXTLFVLIOhWKLVVLLOYItDLOXWIIPTLYT^ 

FS t APVFNEOAAFCLFSWYKYFLFLLR I FV I LCLLAYLF FKXKS X05TTCT ALVLL CACA 

tCNVC&t IFYGHtWFICFNYKOWAFPTFNVACr/LXSLCTliLWKrYFPTKr^^ 

TPruO'»:^ tiloJOO '.I76*)l 

'i-igA-f/ Al.i«t:iy rninwa** 

YR ."I Cf."F I M YF FKTVHNR U-sLLjVFOOFFW.TT /AT I L t r/LCiVCF^WKitRF^FTKFSO 
tTTKLFF- YY^rVERETKCOVMPUrVFFASAlirailG IOr/'/C IVTAAf.' 1 0;r»^LFWVWI 
A* i I FC" I VKY;ifW t *; I K FRXLDADnV/*>T;PK/FL 1 KAFKTPWnV I VA 1 1 J/.* I YIIVF. t 
WF5V (TDTLAIK W[.PKVYFHLOLLFLVFYA t PtCCLQP. ICK ICC f/LPFPMU-YCALCL 
Y T I .VK RK1 (TLf1 1 IX: rTVF:i:^F WJOSAt/XiFACCrVATT t HOC I SRAAY:i ail'ltOFOtl I 
IO::Fj:.*;AKDP;n\.-Av^; rVCIA lOl^l^TUXLKVlA^SWSLCU^iA^VEIITLAJYF 
I HVKF F! J TKKFVh TYTT L t : W Fl .W rT!AK FLYCNTCA P. I YTLY'JU* I Ll*l .f r HI - W 
ALL. lM::v: ^AU4^'I- Nl.UA'f I U'KET/ 1 FPAfAACLTETCLTTE 
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•.T*i4.; iivnnrrwei': >i »f.t.«u. 

L l FLLFMDWY l*Li *lLt FTTV! «I r'SHCTI FVMT tCFUlOLWK I UCN THRVTT TLMFEAJC X 
LAf-LMU MKLL. A*Uf KRKNW;5LJE0 1 DELLDEKKORUWKKNLOOCIKhCAALVLIWCV 

:rrsi1 nypr>r,n»»c t';u: procftm 

TKHt/r^Hvr^FHKrr/TKTrrLRDr/MFRNWKPKCT^ 

. ... ... . ,.. t ... r .. : ... v: /r-!.-r r ;. >- t :r--*.-r^vAf:j^K:i:.*:!r:.;cFf: 

CPnJ>539 518678 621545 

prcp_ 1 9 -polymorphic memorane procein 

TTYSUUCDFrVCOrACNSIHKPGAAFI^IJCOIJ'riNSTPL^ 

E3WVTrKCLHSLViaWESWGCVLTT5CDLSrrNWrSVL00NNI5YC TO 

ALTFWlCTlLrLICNXAVNOOeSHPGyGCAVSSXSPG^^ 

ArYNDQGAITFP^rOTTSyrSNXASFXSCAVYSRYC^YSOWCPnJTKNAAAKVCGXCT 
ADYVH IRXKGSIWEENSATAOCAIAVNAVCTXNAOGPVRF INNSALSUOGMYMOAT 
^.SIUUJiANOCSIEFCCNICVRSOFHSHINSTSNFTtJNAITIOW 

LS I EGGAI LAVRS rYOECCLLALG PCSKLTTOGXHS EJOJICIVZTNLGFNU3JLDSSDPAE 
IRATQCASIEISG'/PTlVVGHTESFVQWEYASKPYTTSIIl^AWCLVTAPSRPEroiOt^ 
1 I AESEYHCYGYOCSWEFSWSPtfflTKEKXTX I ASWTPTGEFStflFKRRGSFI PTTWSTF 

sgu«asnxvnwiyuwsev:plohixvtgcp^ 

ippsnjtilsaaltqijssssqowadkshaqilictvsu«sk>al ^b 

vmkhvfpykg7tsrgsmrnygw5gsvghsyaw 

YDPRYFSSSDmiLSLPIGIJUJllRFIGSllSStfta^ 
YTWDIOGVPLSKEAUrtTUJSTIICrKOT 

CPn.0540 621631 626662 

pap_20 -polymorphic memtorane procein 

FIHLIYLSLIEFVMXS DRrSSMKWLPATAVrAAVLPALTAFCDPASVK STSHTSSCDPT 
SDAALTC t " VSb 1 L I IX. I TY TIVCDrTrSTrTWIPVPVVTPDAM DSSSK SSICOEiSSSSOA 

SAWAADPKGGAATi'SNEANGTtTTTTDSGNFG^T^^ 

KNLTTTGKESQKSCGAAYTglALTTQArVEAVTrTCNTS AGQGGA ZYVKEATLFNALDSL 
KFI^SGOAOGCrn^STLTISKITKSICriSNKASVPAPAPEPTSPAPSSLZNSTnD 
TSTLOTRAASAT P AVAPVAAVTPTP X STQETAGNCGA X YAKQG I S I STFKDI.TFKSNSAS 
VWTLTVraSTXGESGGAXFAAISZQZQQCTCrTIJS^ 
>nJOfTKNTOCGECk^YT1UCAL™ 

rSWOCTtaff SAPITKAASNTAPVVSSSTrAAS PAVPAAAAAFVTNAAKOGALYSTEG1.TV 
5GITS ILSFOOJECqNQOGGAYVnnTQCSDSHRLO FTSNKAAI3EGOGLYCGI3DVT1.TML 
TCKTIJOEKSSEKHGQCI£LASGKSLT*frSI^ 
TPTfYTCPAPVQOPVYCEALVTCOTATKSGGGCTl^^ 
TOKAAOICnCSmiTNVNmOTATaiOTIAOCK^ 

EDAULOCVITCSVSOt/rATESCCCIYAKDIQLOAL PGS F T 1TOMK V fl' SU 'l' S ftCYCC 
GX YSSGAVTLTKXSGTTGITGN5VXOTATSQI)AD ZQGGG IYATTSLSINQCWrPILFSNIl 
SA AlJUUhl imj IACGAirSAAVTIEWNSQPI I FLNNS AKSZATT AATAGNXX2SCOGA IA 
ANSVn.TJWPEITFXCNYArraGAICC I DLTNGS P PRJCVS lAIMSSVL^OCNSALNROCA 
IYCTKDKRreATFICMSKHreSAICCSTALTLAMOLtFENNKW 

FY»VNV5TKETKAQ£LiaJ<EKATSTGTIt^C^ 

I^VVSFT0SICTTITMC3CSVLSKKSKEAOCIAIWVI XDFSEIVPTimNATVAPTTLICL 
VSimStDSKDKIDXTCTVTLUPtOILYQNSTO 
GNXXytfCXGYLGTWNII>iraSGSKII140VrFD 
KOGZlXMCJ&lAJtFQ)PAFNNFt^AICSFUUCevSRN^ 

FZLGAATSQVFQ1AESEYHLDNYXHKCSCH5T0A5LYACNI FYFPAIRSRPILFQGVATY 
GYWKETITroSXEEnMANWDSIAWZJBLRFro 

0£KTTC1£YDPRSFSACSYGNLAIPTCFSVDCAIAWREX Z L YNKVSAA YLPV Z LRNNPKA 

TYCVX^TKEXCNW^PTfOtAAIUEVSSQIYUSYm 

RFVF 

CPn.0S41 627137 626003 

So lues binding procein < -yebL-Synechocy»tis Adhesin Homo log) 
NNRSSYCTAFVMHKVT VF I FLTLYS LKSYGNDVX DK PHVLVS I A PYKFLVEO XAEETCFV 
Y A X VTNHYDPKTYELP PQQ X KEU10GDLWFRXG£AFE1CFCERNLTCQCVI)L£QNVSL IOC 
KPCCNOHTTNYffrHT^PKNUCVT^ 

e:ltztskakorhilvshcafctyfcrdynfsohtieksshvepspkdvarvfrdzeoyxi 
33vxlleysciwssahladrfhhhtwa£^ 

CPn.0542 628000 629737 

ABC Transporter ATPase 

FMTIRILAEGLAFRYCSKGPNI IHOVSFSVYOCOFXG X ICPNCGCXSTLTMLILSLLTPT 

FCSLCTFPSHSAGKCTOSMIGWVTWFSYDPCFPISVKDVV^^ 

EAVDHALDLVtlLJDHHHHCFA^L^GGQIQRV^^^ 

E US I UQCLNRTCriLM^DU^HTTTrfFNKVFYWflnX 

F5CSPH 

CPn_OS43 62B710 e;?603 

(Metal Transport Protein) 

K2G I FMLSSLIROSFPLL ll^VTTlAAljiA^MXMKTniV^niVSlSOSlSHAl^ 
IJLTLWtOYKlilUFFFIIYGAZVt^ZFLALCZCKZHLKYOEREDSLZAMIMSV^HAZCXX 
F ISRLPTFMCELZNFLFGNZ LWVTPSDLYSLC Z FOLLVLGIWLCKTRFLALCFOERYTA 
LNKCCVOLtOtTLLLVLTA ZT IVML I YVMGT XLMLSKCVLPVA X ACRFSYKKTR I KF Z SVL 

'Th_U r .44 f»H»S'.»fl tO0525 

yhW-'TTf* UintHnu pror.cin 

K.:.WT i KIKFFCUKOKNV ! MFVrVITLELRACKf fcNCJVVAWRKEKYLPKSCPYOGN 
. - i» IFATT.^TltAYRNIPFUCArr/^ 

Ah"H :Kt UffiKT7rr43U4.V:W k WXTKiW'FFXT^rRAPTKATpriKPrjEIROVELEtJtL 
1 Al » U ;i.V> 1FI -NAi JKlJTLKMTt tAl rTEVK\.\"AYPFTTl*APSLCLVU-*KOR LYOK PWIIADI P 
: ; II* W^KCO UJ'IKI.RU I tfl<TLLU»FV rrA^IKRKPJISPEroLCTLIIIEUISHQPDFEX 
Kl til .VAI JJK 1 « Jl J.IIM^KfcJ Lo::KOKUFP!:*mVL ( SCI .ft ,VM JLYRFFTORLAV 

f l._**M'. .. tUM»H i* 111*, It 

TKAHKKfvMAtiKr' v »::n» aut.TiiKRUT/KWMf rw.TV i i.vw^rrwitiPAOMVC 
t« :ki ii/it .i-Ai .vi i ; t wwKKTHirnf i: rwEU. 



CPn OS<" 



Protain 



EXLX 



CPn.054- 

— llV -— yr:-/-—- •ff-.-r.r rr»-rf - * -:pftt 

tAljUUCWOK;jWA:T;i^KWFUTiL^^ 
FCCCDCVOCFCr-CrrtttYCO 

CPn.0548 «3«4 632191 

SJttflEKTlCAOOVFtVUlELLSCSDS ZNDSDP ZYWWTOSNtJrrirnCVGrULOVLWlCS 
PCEVOTV^I^SPmVKVKlTTSEXVSW 

YrjuorraroxpiEiJAESVTPLiJRrrsiASSPDL^ 

CSSPXCSn^Wr^YXrVOPTKHrrLSTOTECKPLVMICAOT 
PCNNtLFFG ERXEXVNTfYRXTWWAEirCXL^LFl^JSRODOKm 
KAYnDCFFTJCCRICVUCI EVIWAIXEXl£»rLASLRKEHRYVvWY 

CPn.0549 633662 6332S5 

«10-S10 RtbomoiMl Protein 

POOVQHOSMWIXRJUCICFKKRUJISKOCKK^ 

TAjamwunnraPXPLPTWunnfTWJi^^ 

KKLALPACVDXKXKAA 

CPO.0550 635686 6335B0 

tu«A-tlon9«cion Factor C 

UJYOTWlTJKSWEFDL^ZRNZCDlAHIDACmT^ 

OWMAOEOERGITZTSAATTVFWLGAKZKI XOTPG^FTZEVTOLRVLOCAVAVFtUVS 
GVEPOS ETVWROADKYGVP R I AJVWO^TOCADVF AAVESHKDCLCANAFPVKCP ioszs 



KKVUI)PWrrEI«ZH(^WCGVIENKIKPVLCCT 

NX RCZKIJCTTOEX SlXPRJtlCTtJUCLAIXIKroPYVCRXTrZ RXTSCTtlOOSSAl LIST K 
DIOCERXSPXIJJO WEir r DRflE FTVGPIGACV^ 
VIMAIE¥KSraREIUJU)AX^5LSEEDPTFRVSTNECX^ 
REnCVVANVGirKVSYIXrZTVSGKSITKrvTO 

KZVOGVI PKTfXPAVllCZEECI^TCvlAGYCLVDv7<VS IVFCSYHEVDSSEMAFKZCCS 

KAVI03AC]UCAJCFVILEPIMKVAvTTPEX«IXXVICI}Lm 

PLSIKFCrrTSl^TSOWlTSTKEPAJTAICWOKtOEEIV^ 

CPIV.05SI 636174 635698 

rs7-S7 Ribosoul Protein 

MYHSRRKSA£XKDIFODPXYGSVIX£CTXNKVMm 
VI£CrG£AXJ34AJtJ>XLEVR5fUCVOCATYOm£VASB3U^ 
VCLATEL ZIXnT«OGATZKKR£rm«MAEAMCAFAHYKW 

CPH.05S2 636698 636219 

rsl2-S12 Riboaomal Procein 

ZOACYVTSSSENXPLPTTOIALLYI SMLVVVRIJCREEYMPTXNOLIIUaUUCSSLAJUaClPA 

UJJGCPOXRSVCLOVrnCTTKXPNSAUUn^ 

GGRVKDLPGVRYH XVRGTtJJCAAVKNRICOSRSR YGAKRPK 

CTIU05S3 63T753 636812 

Ho robust hoaoloa present in Genebenk/EJtBL as of 11/7/98 

GOWlWLRn.1 IFIU5RAVTPLRASESFSWETSTCLTVLCI PFZOXZ LTTNIDFVAOCC 

t^XCTZSSTT04AXOTZFX.rnCOTPEASZSriCRKEPt^l^SHl^Da: 

GSNeMXOCPAWPKDUUAnJUrPYQPDTLLYSEXEAE^ 

ILYG^XEKFUCETXRJWWrLVTJLCCSOWTTTLCT 

ZPDL)OATQU£HTVPlXFIYrVlDSIHXXErfflCC5SrrYN0DLTEPXU 

EYCFNCAOSSLGET 

CFn.0554 637806 636141 

CT440 hypothec icaI procein 

VFSYIXLCX Z LVYVRFMYEGXS RMASPTPOQ LHLOO KVT^SXA Y0Y S RSLAM I AT ALLFF I 
VALIUGLSU.POVF^PTSGAYFXXGSFt^IAU:XLLXNCVa)IJCQm 

CPn.0555 638298 640241 

tsp-Tail-SpocUic Proteese 
HFVWCKLVRl£VVlXSU.PNv^nSDUJlEB:iia«W 
SYXQSFDPKKSYLSNOEVAVFWSPrnOaUJ^^ 

RNEHVXNPKELVLEAS SYO I SKO PMO^KSt^EVXORORAIIXSYLSLHlAJASSSRYEC 
KEtQlJUUrLitQIPfflEWvYLGINDHGVAHDRflEEAYQFHZRWKAi^ 
DEALAHRZQLEKGMCG XGWUCEOXDGWVREI X POGPAAKSCDLOCCOI I YRVXSKDIC 
HI^FRGVUC^RGC>CSTVVLDrHRGESDHTIAIJlRD<ZLLFX^ 
VTLHSFYECE>Cv^SEC0UlRAZ0CLXEXNlXGLVlJ}ZRENTa3FL^ 
VVVVSRYAIX7TMKCYRTVSPKKFYDCPLA ZLVSK3SASAAEXVAQT LOOYCVAtWCDEO 
TYGKCT IOHCT ITGDASO00C FKVTVGKYYS PSGKSTOtJOGVKS DI L I PSLYAEDRU3ER 
F^EHPU>AXX:CDfmi(OPLTDU7r0TRPWFt}KYYLPNL0KOCT 
SENSNFQAFL^O ZKSSEXTDLSYGSNDIjOLEES ZNZLKOM [LLQ0CRK 

CPn_055^ o40«t 640325 

crpA-L5UX* CyAtttintf-Rich Procein 
ENCMS3r^P\*CCTCrrCAAAPE3Vt^IVEEIAA^C\rrACW^ 

KF rOPXP.ESKXFOCRATO I TLLVUIZ LLWACLACMF I FHSOLCANAFWL 1 1 PAArCLIK 
LLvTSU;FDEACT5EKLHVFOKWACVLEO0LD0r: I LNNTMKl TOrVKTtrwrrJRATTPVL 
NDGRCTPVLi'PLV^K IARV 

CPli_i)S r .7 t.l.Wt Ml I'M 

E I PR;iCL t P.KWTVl J\[.T::MA;V:r A^f 1 ; [ CAAVAE5LITK [VA/iAtTKPArVltfrAKKVR 
I.VhRNKOIVECK.:id :AKl:r%>^t<:EFy;R^VI^FA&OE.^WRLYSVKVNL»)l -nveico:: 
VPBYA1 V J: IPi'P t C U A U1KK Vr/llVV ITWI-I*' :EAEFV^.*:DlirrTrT;:i> W LVWK ICMti 

ryujDKCK rrA^pticixuxTTAATvtrArPRLnrrrrmTJOPA rctKcnipri.w.-Mii.-Pvr 
yk lEWim iarnvtvi wrvtu r/::iiA.TrK>Mvu:rNi/XMPP' wckvftvrk;po(«i«i 

'jlTWAT/TSWi 3tK»:;*«\NVTr/ , /IIF.t*."/» - rVH l^yVDWr'.YVCItPVfY;:!; V^NHtt/I^VUI 

iaw i m/rt it « vrvi io\n » .r. i » ? :t wrtitv t k mt :p/ oti-ofk lwkavvi * ikitnt/vav 
triEiiMcrrrrr: valttthww a aati m vi jthidp icv :nnvYR i rvrrmr t:AW/wv: : 
i.i ixk: y ei*of i a: 1:1 k. rr r: y OTVTHbAM'Ku ;:;ke^vef:wti jo : i ah axu* iea i 
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ank:A-ykai * Vncnne-Pi-rh Lipoprotein 

KLKiaXAVLt^FCCVVr^J^^IVOra 

VPSCSWPOT3Tgtnp.TP? V K GC TS P DC R C K 0 



• 4!710 «4T»:!7 

. . . • f 

v :.. ;r.;r.:y :::inrru.:Y:LA<.OT: :.vmkahju 



FrtAWVFTffCYWTNL 
CPn_OS60 445666 644098 

(Utx-Glucanyl-eRMA Syncnecase 

RMSRFtXraSWSKPKRI>C*EW^ 

RIEDrrDRTRSRODlfEEMirSAUWCCIOWDBOPO^PYCPYROSERTTC^^ 

TOCAYKCrATPQELACKRAVASTLSVTUXYIWinfin^ 

SGECVTOJYSICRWFPWADVDDCTVXVW^ 

TPKHUXYEAFCWEPPVTUOtPIiLNPOCTICI^iaUC^ 

MGYSMECDEEVYSLER 1 1 ETFNPRRICXSGAVFDXOXLOWHNKHTlJMra 

OftXNDEFFUCILPUttSRirrUlEFXNLTSFFFSC^^ 

SYVKY W r U J Wm T n XCSKWLAOArWHHigCAI IPLLYVAITCiqtOCaJLrOSICIL 
CKFRAJUkRLVYAEJOlJaGVPXXl^AT^^ 

CPn_056l 646407 643871 

euo-CHLPS Euo Protein 

UlACZQHCCCVEUEREEtCJtXDStrnCWVSITOAAJaiO^ 

TTO^IDIKDIXEYXRNRVSWaSLYOCELVIT»CICCCYSIM3VW 

CTXRCEJTOAAWIHVSEIERYIWEYl^KgAAJaCLIC^ 

CPn_0S62 6480S1 646918 

•CKLPS 43 kDtt protein homolog_l 

NYKVIKS UIAREXJYMILCNHPKPSIAMrSSEOARTSWaaWAHFyLYKiEI I>CWX 

pxjxxurriFLGijwi^icONFiiija*^ 

VSSVRRVCI£YI)EWIDGI£XJU4'NAJCF*1R1^ 

ESOSmLIFWPCTMKSQGNXTRNNVVKSYQA^^ 

QAEAl£KEIAIXlSX)5VRWFVVKI)RGARSTf^VAXQFXX& 

LHCPELFrYCKDSQG^rC3CLFKKETCrAAprLPPKNLEECSPCKIPVA0T^ 

SOOVXKEVACHZORKrCN 

CPn_0S63 650113 648393 

recJ-BsBNA Exonue lease 

QYXKZXNSFSPXCPaSZXFMXNSaiASAAaUMAHPXEDPA^ 

ISRGF0TIOEIH1CFLY*HI^SLTOP*ILFU»(SX^^ 

GVAlAVEFUU)IDVHVSYTFtjGAIt*OHOT 

DITROOIWIITDHHMFTGiaPHCVATI^PiaiPIWr^^ 

SPJO.VFKS0CSUClCIADLVTLCTrn7TOV^^ 

<a/IXSTVT$TDIVlJCIAPKIliSI£RIII)PAW 

XEAEVFODTOEXUISNPEXLJCOMXVX^STAW^ 

IGRSSAJrTICSFPLLm/IJ«CSSUI£VOMFAA^ 

KcryrunaxiDAYADrnMDYOUASMEX^pnaDC^^ 

faYLSQKERNLECVAFClAIUUDAUUSIWPL 
SEFRPSD 

CFH.0S64 654359 65014S 

secctsecF- Protein Export Protein* SeeD/secF (fusion! 

SCWMICOKVKOTAIIIOTAIALtW^^ IKSFTKQAQQV 
PJCTAttPRVSMI-SSlJltJBHIOOHPAIPDIVSVRFra 

RliiWCYSRDiDDHVIC'VASSINrSLVESDPSFVSYSSENEOIMASS ILQRVYSACTTPK 
QKDCSCSYPSIWETAPXEOU^Am^SCrrVF^ 

PJCIFLTUISDUJIOKTCLSKEORI^KRI^^ 

TQGKI IL0CERLLOCIAEHLTALTLKRPAAZSCOLI FENFPVFCROFRESZAFGCYIFSP 
mCXMFSKC5VYXUJCCU%SWAlCYOQOQClCEtOSPI3Ct^ 

OQVLEIRH PLOOFUnA<aZ3FVICaCBOCAn-EVKDI QDRLATVNQ I EKNRQSDLVRWHEO 
7RKAXCSHDWERI3APIPYONLFI2NK!aVNKRr*^ 
KOaCOLTDKEOXLICVSDEXXJUUJaajraEXEU 
rtSFWA/MER FS SYSASRYEVORnJCrnjhnTSOAOCICTSPEn 

HEAITKLKSEGLATSP SGCH P S I' UL EflTF^XAIGCTAEQIUkNPLVIVrRNYALDGASL 
KDIRPEFAAGECYVLNFSVKDTS PKKMAEXLS PTES FHTWTSAYCOECI SCTANOOYSAN 
RG*RKAWItXWHWSPIlJA^ttNHASVSCX^^ 

TissDLCwccyrrwirsAccciJWLivijts\nnfKFG 

PLTI^GXACXVlJlMC^VDANVtATERIROT 

'yLASAiXFFLDrOPIIOCFALTLILCI nSHFTAi.FKrKPTPKLhMN)RQHTOUa«IMCFV 
OIKHDFUlGCKKI>LWSCSVTLLCCVAi^ 

OHRCKVWKUJEAC^SRDFRIQTFCSSEXIKrYFSDKALSrnCNOT 
A\A3Ll£ETCUFSTm2JETQNFWSIC\reSia^KXMP-^ ILLYVSLRFEW 
^AFSAVCALXHDLLATCAVU- tAHPFUCX lOIDLOAICAUffVLCYSLWTLII FDRIR 
TOROANLrrPMHVLVNnALOKTFSRTVKrTATTI^/U^LLF IOGSSVTNFAF XHTICXL 
LT7TL5SLY I AP PLLLFHVRKENRSK 

CPn_0S65 655741 654533 

TT44v hypothetical protein 

FIKLFCFLXFCFA/NtSAILFOSSFLXKIXRNSXRJILilSHKFPRXSXSDLXPTOHVIVMtK 

r^JVKVVPNAQNLPKKXlGGVLACFCtAlXGCAAFAAG^ 

UTYSlOWSRFEHPLFRETlCVTOPIlefl^^ 

£ t VTK t F0KK20RNTS X PL tOEKDQXALRQG XEKSSLSRKTFAXDPSWS SLLSE XOREE 
C/mX»PKVlSVT.SE0Q*SDRTHPK5AIYVNI3E^ 

i-n r vkki rr t yvltp i lcvpoalpkeeoenucllsoaaflysaeov akrmreekods XR IK 

F I FTOrTSPTr.LY F:JPlfll:1CTPH.'!VT P I5LSCFVCE9E3YTFA 
t.jinilv 

id* ivtyai. i ma volw atnnae:;kfp:W)R1.pnhva 1 1 mdcnrrwykkiireecohtht.': 

IMYV !AKVUtUI<NAVI JMC IKVLTLYTFCTENFTOPKEEtOEIFNl FYTOLOKOUPYLM 
KflKtCLI* ' li:U^Xl,PKt; 1CTK tWI\ft3WTA:;F;:RLELVt AVKYOflKDELVftAFKKUIVO 

r utr.v. I ::::iMil-*:Fra J ;;:YLC>r:yXTr)r«txinrpr^EKRV:wFu J wu iaytely tTDTLW 

11 *TI VMdPKAl HVYiVHtfltt 

'In.'lV./ i.'.ntfH r.S7Hl7 

"•Ij-A Hn*t:|4i.ii i.ln i'yr t«*V lyi r.lUKtOf i:." 



VLNSMCmKTTAinOLFOP'- 'SMTrUT^YSJtI^^A^;; A ; --J"^ 
EYSS*AlUU(W0fPLrrF^X^3FLFUU^FLJ «*^^-^'5!5^r^^l22Sr 
RVRKSTICA«)l^CriT3ar^Rl8iJX>(Vt 

DIFCYFTCICAFGN^IATCI3PWKTVVCr7ACCLCirrLKrrT FTTOtFTRFAifPPWA . 

UPUaJU^rTCFFCDIIESrFXRIAKLKNSK)^^ 

TOSKZriC 

OPn 0St« 457H(!« «S«4M 

i<\v.*r ■.-rr-r-r-\FAUV.'iii.i»*iv~"':w— v/apu'-iwatltltW- 

EEPPFSFTrATSOPLESFF: JCHLLTSELTTttEVANAAStUW LPEVRAFKJOIORRYAOL 
QgCVTECRCMCSXVTPNADLX I FLTSSPEVRAORRlJULPEXnX5PtX3L0AXLVKRfiAAD 
AQRAHDPLVX PDC I V IDS3CUT I ROVLEK I LALLFRMEL 

CPn_0569 658398 659099 

plsC-Glycerol-3-P Acylcranst erase 
tJCFWmSGTS^mSlCRAMIFRICKFFTWA^ 
««SIX£PIALH>CVNEC:'fKUkRASIXNIPWl>«0«rc 

WU(XLVrYPCy^PtX»LOPCKWIGM<AAJCSRVP X XPVYIRCTFEAFKRHOKXPHVWK 

TXTcvFtnpKrFtsii<yn>EixNxx?rY9xrxvoTm 

CPn.O570 659044 660789 

argS-Aroinyl cRNA Trans tcrait 

TKU^IOOWRGAKETCFKlXSTlJ^XLSVXCStt 

HY0OIDAMK£JUtVlJQUU>RAXA£AIVA£^EPFSLIEXAC^ 

FK0MJUCFQV5QPXXIIIOF5SPNIAXCKHVGKIJISTIICD51AR 

rGOMJTAfta<UTYIOENPCUYSDU^TSLYXKAY^r^ 

POAXJamiCCTSEXAFWXYDXUXVVmGESFYNPFLPEXXEITUX^^ 

KCVFHEArSXFFMVOKSXXXWYATTXnJU^ X X XVTDUSSLHFOLXXA 

TAIAACYI^tCIFSHVtnTaVLOt^CIQCLICraSCE^^ 

LTTlEMQERAFVIGIIi^lCYSBtJSHRTSDYVFSFD 

RRUXSQLSUCPPEXOEPAm^JaTlJJirFEAlJOTXKEXXPHFI.Tm 

miDSHXOOSPYAXSJUJlXAXAEQVXATOOaXClXt^^ 

CPn_0571 663179 660749 

BairA-OOP-N-Aeetylglueosaaine Transferase 

TTtKWVSFSDFIlAOTPJOCIAOVFOCCRXJ^ 

RNVPDXCITVSLTVEXriCSI^AHVSWDXETE 

AUiJRCPECVYVPTVCCTAIGERTIJJFWPTC^JC^^ 

LPYPS^TEJO,IlJUaKAJCCirTVTKWAlXAEILX)LV^^ 

TOOjCaVI^ILPOIUEAASFO*AAVVSCCRVrVW4W 

SSXIFTqERPLVOCWLimWPqFLTIl*XPFA\aj.SOAOaSWXHg 

L0HHQA£CQLJ7i0Cl»STKACRYAXGNFTHSAVIKGATPt>IASHLVI POLRACFAYVMAAL 

XAEX3KSXX8m{LLDRGrrM«VCXLRSt^^0XF 

CPn_0573 662349 664616 
CT456 hypothetical protein 

IMAAPIN0PST1 1 IML 1V/1UJ1 1 1IU lVCSI^SEHSVTTTCSGJUkAOTSQTVTt.XADHBCE 
XA£0OGSAVSFSA£HSFS7XFPETGSVWAOSAOSA^ 
SISRTSSHASSCETSPJkESSPDLCDI£)SL^C^ERA£CA£CPECPanJ>ECT 
ASIUlFI^PAVOOWtjrlttJCHPVYVIJEARSSF IFVRtCOW5TAES X S^SJtAXTRSCXTK 

nssscrTF^AiffiRaAicvrrcPXWDovaciJQGiNu^ 

CPOmGATVVP14VNVI<U%XXVCLCSXNXX3CX^ 

rTSTCSOi»llU^IQ>rX)PGOGEg)WIPC7mPPPPGPPPKI^SRXXTISNASIi^ 
L^^V»QHIJffAYDS^OCVSIg^DUg^VKKSEX3^^PTVILPA HUUIUPWATOO 
VTEQOGHXRNX X0RNT0STCQSIXMTPTP0PTXAKIVTSLRXANV55SSVLFCPCVATTX 
TPOARTASTSTTSI LIUIAISI ISlUlUiU SV S TQ S T G VCT Pn 1 lRilUl SATTTTSS 
ASTaTPOAPLPS(nRWATXSLVRNAACR^IVL00GCR^SFPXPPSCTCTT7»K^ 
AAS(TVASTtC0WNOAATACSOPSSRRSSPTSPRRK 

CPn.0573 665413 664691 

yebC fanily 

VtDMACHSXWWTOttKERADHIOCGXIFSRXI 

□OnPNENXERNXJUUTSAEOXNFEEVTYELYCKOCVCI IVEAKnMXNRTAStMtXAXN 
nUX^VEPCSVLYNFARXCACrVAXSS XDEEVXFSYAX EACAEDLDTEOEENFLVXCAP 
SEIA5VKEXUS0GATCSEX]RLXYl^LRX.VICTEXDGEANIJaJCM^ 
S 

CPn.0574. 665978 665394 

No robust homo lo<? present in Gcnebank/EKBL ai o{ 11/7/98 

SACRGFRKFIVMVETVLHNFORYUKYLYRWRFFCRJOCT 

KXYCUEXYCElJlAOLFQGAUUjOXCWFCRIUkTPJCCKSWUXFHE^ 

E X PRFFNEYLVYKEtfy/HSWPR£YSl.SCRS I F KCKXFKEYEORFPLYDRAVAWCXAHAYL 

LRCYKXRVOOGYCRA 

CPn.0575 rPoS24 165982 

YhhY-Amino Oroup Acetyl Trans t erase 

SIFCRVWRSFKrAEXONTCILCLEIRYTLPSDATYMUCWLfOPKXLRCFPIOTEAEXRET 
VMFWCFYRYHS^TA\^7«NVACVATLVU*PTVKVSHHALISt IVCEEFRNKCICTAU. 
ffftt.IKI>ICTRFKL£VLYl^VYECNPALWLY0RFGFVEVCR0NRFYXOEICYl^JCT/IMEK^ 
L 

CPn.0576 «7S43 66^494 

prCB-Pepticto Chain Rel*ise Factor 2 (natural UCA trme-sAlCt I 
HOENUNCRLCALRTEXSLAARSL 

CPn_057il.l 

prte- (n.icural UCA e:om* -shitr. t 
H&ENti)KRL&\LRTE ISLAAR::L 

cPn_H577 UJtfrf* i*..; <**. i./.hiss 

ErrnnoKnic^AFMirvN [ xrux av i »y.rr ii-mprte r vrrww i KKurir.'OfWNKRWiL 
W>ani^kvw;;:.:p»' t r**vwrr au;ki i [ vk 

Cl'nJ»S7R C 67J38^t»».K t>t>t tM, 
y.jtf) -ptvjt^iht'Uiyiitnl.iiit? 

ttti:;f i vl t :; i r. latlp i u\f. :wa: ;k i k.i t <wt , pttk i r«ni .pk ici iai iu k ;u( i av i :.x< ji 

Kl IKNVPEXFLNKV: tK " I KNP: .' M A . f V P 7 :i t\ A /rMAR LFXiK ER LTTF LfIT l,EAn I JVFA 1 1 * 
riatUYft.'VC.'IMrrarcf'rV I [•EKr::KMVl-At I AVMr/XFr^I-tTHYOMJl.TIVKntPfW. 
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UUJCNTTLTLUWTTHVrPOTLNrvCLCOLFARu. -^EOAntNYDPSLP^U-SMNPOC 
ITTUJLXJVr^DTW^^pgmjWPItFAJlOT 

nrguxjucr tKFcsppciarrrcsyo 

CPnJ>57'.> «i69)l0 66999) 

vooP/y<tcM*.«ua«r Nucleotide phoiphorylete 

KEFASAPUJCSATnHVPMI KS3LI li3CXK?CTRF5S^I PKXJY^ 

r.PorAi^r/rDpnYOCTFOcrivsrArpfOTROosvrscLoovrrr»v/r 

CP4.0580 6699)6 670793 

truA-Pteudouridyl*te Synthase C 

ASSOffLPlUttNDCF5FPHnEVAU.MY0(nWSGK^ 

CTPLIASGRTOACVHAYOQVAHFIUFtOTLFAJ^^ 

FHAAYLA XAKE\TlYStJHUJUCFLFWQRJfFCfTP1UtPFSTEI2(QEG^ 

HSROniSTVirrrmDtVDK3ISL5t ICRCNCTLYlWVRNLVCAIXOWinSAYPPEHLLO 

ILEOPnWESPSAAPAYCl^UiHVCYSSPYtlNFCCUICS V ST S OTC 

CPruOSBl 671533 670745 

Pnospnoelycolate Phosphatase 

GCLmmsVXSFLROCWIYSMLVSDEFOLCUtSCMVXEOYDtTrm^^ 

FtaACA*FSt£WWDFSTYYSHTTlCTIF510C« 

cHAGPAXJUCVEA/IEIjVLSI/OCTFGVVTIIS 

PXPYGOSYI7YAYRTFAJlEOOC\rcGFEDSVTGIJ^^ 

GKEFTSYPSrDVLTEKCSQOKLL 

CPn.0582 67X305 672177 

CT465 hypothetical protein ____ 

KNPNAUJaCiaHia.V»OfDlQaCVLYLOANKUI0X 
SNIVLKNILRXSTVSXATSCSFSiaiSKrCWPERXTSO^ 
ISPNRrNaTAYSryCROQCKSriTPECVL YDaX^ HSi mitfH YIHP^ 
DKPKVSIIEAFCCFIOIrWIZAaiSLSOUaOTAATTA^ 
KSDPSLSNF IVTEArLTHPgJU.T^DLCSHTSITVtll^yLQKKrAf^SA 

CPIU0583 672349 672717 

CT466 hypothetical prot ein 

iVTJ^rri'KTKVTPRrLMNgy'n Jrl J.LKK ffTt-rTA Tl^ TTTrFF^^ 1 ^-**^-**?** 
IFLSCIDRVDLOIKEFRHAKSELPODIOEELEEIROVI IRILDrXSatNYAOKKXEFCX Y 
ERP 

CPn.0584 672659 673798 

acoS/ntrS-2-COBponent Sensor 

IR^TMKMJaWLVFMWPOSO^PPAmiXIJU^ 

SETOWrLICNSOAREXLGtDEIMLNRSFTt^ 

CKBSKEmrnJIRJCNEISCYIJIOIRDJlSrW^^ 

NPl^tVCfASXUOCEISSPRHamX^XXSCTR^^ 

DFTSStXPU-SVSrpfOTVRIX^PlJTlSIDPDWWSVNn^V^ 

TgftPT«mrnrTTTPggTig»rr j ¥P i i i m t« »rM 7 r /7 r ^ r » l OT r TT »t MrrmTQf jfFencw 

SFTI IIPEUAALPKERAAS 

CPnJ>585 67SI80 673865 

"similarity Co Cps lncA_2 

XSLRUCIIJtFIWrSXCXCSSMMATPAOXSPTFOIVSrVim^ 

XAKUOOOafNKrXVXVrcUXLAXXrXLCBGXI^^ 

IX^YD3QCLPmJ»PVPtPQ0IQIgPIJWETTlE\^XCrrUXV^ 

CEXIUO«LDRJaJWEE£IX.YRSTAHLraEE^ 

OGIKTVRSESGEXBISR^ 

LQEBLIg.TSXAnTOSlJYRgYICBCYl^KXJMQKILQE^ 
t£OKDANLKXAMVWEEmaCQmEIffZ^ 

QOHYfifttij^mtQWRK TLgggMwy ft p r JTMOmfttAYrmAin JXaAAPTETffTITnP 
mTDSASLSOKKXRELVEDQOIJCALArKSNXa^^ 

EVERLEQEOFQC 

CPn_0586 675993 67718) 

atoC/ntrC- 2 -Component Regulator 

KEXXNP SRGENMAI KN ILVVDDEPLLRDPLSELLTSQGF I PDTAQJLRNALQH I RSRDYD 

LVISCMSMPOCSCLOLIKX IKQSSPHTPVLVVTAYCS XENAVEAMHQGAFWLTKPFSSE 

ALFAFURACEtJCNLVH£MLFlJlSQTTPI3SHPLXAESKAKKDl<lAXA)aCAASSSANI FXH 

GESQOCXemFnKHKSPRWPYXKVNCAAXPrril^CLFQlOCCAriCATnaCACR 

FEXWaGTLtADEITEVPVNLQAjaXRAIOOCEXEH^^ 

IDDKSFt«DLYYRI/WIPLHLPPlJU)RGCDXLPLANYri/l^ 

LLNYPWPGN IRELSNVLERWXLENTSXXTSOKLALA 

CPn.0587 677)78 678124 

*yvyO_B» conserved hypothetical protein 
SYCElJIliTUJCHHVTLCDKWWRKWS^ 

IL£XS0HLPPH£TXRVVLT5HKDKL07XVHWASHCKCXLQTKVMNANPYTAVINAFKK I 
RTHANKHSNKRKORTKHDLCLAAJtEtRIA lOEEQEDRI^EWLPVCCLOAWDSLKTLCYV 
PASAKKKISKKKMS IRMLSODEAIRQLESAAOirLI FLNEQEHKXQC I YKKH0CHYVLX E 
PSLKPGFCI 

CPn_058R ti780)3 67B626 

CT469 hypothetical protein 

TSKSIKSKAFIKNHTATHSLLNLPSSQDSAjCDSrSQSO I FOP I RN RELVST PEEKVRO R 
LLJFIXHXLNYFKKLX I XOCELKTLFPLUOUOTLIPFJtRPOILI ITPFTYTDAQGNTKM 
LCDPKPULLX ECKALAVN0NMJC0LUCYNY3 1 CATC IAMAGKHSOVSALFWKTQTLDFY 

rr^pcfsaumrisutL 

Cpti.OSH-i «7RfiJ4 »i70)lS 

i"M 70 hyix»ctt»;tuMl protein 

I'.Vn WJUJRPlXiKWrrLTTLFTPEnLFTFFAKCiXrrWCDYRbTL'/r t SLCKYT 
I JIKNTT.RLTKLTIItlDI LNAFEA IKCT-^AUXA3nKMIQALUU«>WEKP.<IIKLF;:LFtNF 
I JIM [PE^Mt'EFFAAf FVLKLLOYEn ILDLTt'AC;u:KA:;r-PY.\CYH\\\aiKU:KK»OHK 
IHIRKEEfXi IWA I 1 IIAKQFSELLAIAEFP I AtACKIFYLFIfc-LaCEKK.TERMfl.lF^P 

, *liRXrj>ij:icvviii*Y 

I'lujivm ».Hon<; A7'i5t»; 

iT47| t(y|tMHM<r umL pior«*tri 
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'JFVtn&MUTFACrtlSTT I . .•J'UFi *NCLi.1V P* .VL. *EK rVFYTI I F FTF "It jYPOKE 
»#IEtOAORKRRVmfLTCEFPItLrrLfnrCl^Fni(LRAXrw n,-vn,XYAL HFJC^C WO« 

ofwgwrsjt i ? ±wm±xnu cL9 wear t wwer qcyvtmftu agJ ■ 

YHHDtVCFSeVTKFWS^U^C 1 1 FT ^ •** ' " ^ 

CFtt_0S9l *S0)dJ DM021 

yjgE caaily 

T, 1. 1 MRCTAYCTA3AYNLHVLFH UJCPPV FT I t^REYV^lJJTCASKJLA 1 rrP _T^ V 

^lLOTPOrF^V^AIYRT\"l^^;^XN\'L :v ~ 

CPfl.0592 6811): «6l46l 

yidD family 

LYSKMrSHSFKRFLOOIPVRICU.: lYLYO^XSPX^CSCCRFrPSCSWACOAXJCSHCF 
LNOCWLSIKRXCKCSPWPOCXXM^ICTAUJZ^aXPYCEXKSOSSHFS^ 

CPn_059) 682494 681)91 

CT474 hypothec ice I protein 

VYUC2FtNtXS0A5tW*XSX^KDERYMYCRPXICLUAl^A 

TtLttSSVMLLPHlBLXDTWZt&rUlCXKtm 

CSTPEH.YlJCTU.VGATOA5SVA5UAIUfVIRCC5ERrFm 

KSYinrmr Af r r T,^wr«DVvi^croixx^cvxRXj<pr^pYsowmiu^PWE 

UU3CTORVCAPRWEI»DErYV^DCDSL^IAXRFC:PHDKI XCnCLMKHRLTPGKV . 
LXLPAX0S 

CPn_0594 682517 684958 

pheT • phenyl e l*nyl cRNA synthecese Bete 

inCKrTOVIVKSLVKrSXJU^SIAXPXtllX^^ 

LYSFASVXTAKXti{TXPKPIlAjDXLRVXTt.TZX]tXnQVVOCAP 

rP?FTTn*YTT inrgTrT g^yerytM-mAnrTrtT.nrT^T^fnixf r n pf?mr t TJ v *' n " r - 

IfrSLUSLTFNLGXCUriiXAJtEXCHVTOM^^ 

rSYVVTTCISA0PSPIKWI3U3AlJC0KPIKArVDrrWIK^ 

LRVTO^TPESLTXXNCXTVLLPSCnriVVKODHSI^^ 

AYn^AtRASQIffl.PIPSl^YltrrWSIPPONVVPALOAAIHY XIXIFPEATISPXYSS 
CtXCRgJCEVAIJtPICTWDjCKSFSItlLSQICLOSLCrs n'KJLl lOX VKVPgYUHDIW 
mDLVmCRXXSUMXZTQNPV5rYTPXYKLXWACFIJU4ACL0E 
LTUEZXEEXSXOSSWTTVLKSSXXPtSiJCSAATKIJOt^^ 

rrqriAiL^Tm^SR5Vft.TOSL5nfsui>wmxYKHHi^iDALn rtsArrffMPY 

QQOVIJlIHKQSFATIjCXfVHPEI>XXAOIKHPVFT> FT WL TIT. T CPflJOCTTICLYOYAXYP 
55F1US»TLTVPII>XPAiaiJt0iajjaX»lCWLESVrXX5XYC}^ 
YDrrLSWDIt£XvYCRLVAr.rNrT.f.I'I/llU«I IKS 

CPR.0595 684943 685926 

CT476 hypothec leal protein 
fUTYOFMRQXXFCVC^AKSCSAYASPU^PSVMCE^^ 

TKVXJNMTUSSnfSGCUJffXXTXTFPHTTAXJlVVQ XYX30CRX*VSIt)CrTFVM3t 1 PSOEE 

LFNmrrFVLTWpoNHDSt7rrTWYr:rrr^ 

vrSOWIU^EETTVEJWMCYTrFYPNRDPK 

WRYCf\ja.Tl , lVFRMJLJ».IStIAY\nCCVKEClTLRYNro 

AOCI0IMEWYYPCRSVSIUUCFTO1UAC 

CPn_0596 685930 686457 

ade-Mchylc rens Ceuee 

FAV>tAron.IPKU«KSL50AC8ECU.lAXYPPU^rvKFD 

LCTAAiOXAM0EXVI>CS3lYAjaCE«PPrSSHFAICl)[,IPS0YLtXL>CW 

XAKXTtyiHPKTVGAAClGQNPflXFFPCHRVV^ 

CPtt.0597 688215 686479 

oppC-onoopeptide Permease 

HQCTPSFYORFI^YYJNUJtflJWXFFISVALIGIYA^ 

RYtJTPCYYTXPVDLFFNVLllVTF PFF XLS FKLTRCWLRRWLLGLC I XSOCM X FAMAYSC 

KVCPPALAOgJCKMRAEKVRiyiSKVlCEKVHIXPKin^ 

KYIDOXOEASVKKYOVAFESXRQSmPTlJWLBW^ 

tNjmnmPFixuTRXEHXXiujum^^ 

UXVAXXRSAISFXOXlKRIJhri EX£SEX>LR X LXNPFF5S FKWQJOAOCSREMNKYVPWQL 
SRVTRXXXLAALVFC XRIALWACXCXT XALA IC IM XGLVSGYFQCIVGMX LSRFTCXWE 
TMPVtP lUO.VlSlTQQKSlUirTVLLCCFSI*rGrSRr^ 

PAESAVLWPPAX ILTKLLXAIALXCOCVROALOPRL0OS 

CPn.0596 689712 696219 ^ 

oppB*Oliaopeptide Permeeee 

EGOCSVLKY ILXRLXT. I PLTLFAXVS XNFVXUUAPCDVLEtKSRIULC»ACIC50l(MMSY 
KCPQRYLOFREHYCLTLPIFFNTftPX ITHKKIQTAIjOCIJWAMNTTPSAKNAAICSLVYVC 
OCAKFVKPALLFEACDASRDOXYRH I AADLFX ROCVL0CFVCPNLS PEORAOWCEIAXSN 

FtTTUWOAHXTVIsEVIKRUlCSLNA^ILPMIVrc 

FLILFSX PVFVAVrH t LDNFV I MRTX PFTTX PKPYSCLR SPPCVFW ELSTLCR I FOLVSH 
CFLPFCAVSYCAtAAOSRLSRS X FLEVUQOFXCAAKARCLRWFO I LYKHVQKNAAVS XV 

DXCYVUDPRVOLESRRI 

CPh.0599 691823 449682 

appA'Oliqopeptide Binding Lipoprotein 

KRRE^HMYKRC\TXVILXCIVAC3LILLYW3SDfXEWtK5IKCMVR0I0E0tREISRV 
VKCWyT^AlPAArv^L^rKL'/RDE^FAL^rCDPSYWU^U^YKOOTU'EU^ 
HIHILRTAJIVtlKrENl^PFfCFtr'AA/CFYOLCtPnLAliHIVnKYEEFItTO 
E£irr4JKEFH I YLRrWVFWRp C wPKALfT-HVULOEVFOR HIpVTAIID t R PFYDAVMIFYV 

rR TV(*j( F ANT t EK [ 1 EOEM IDT'/PTN:* (WAONFTXIIWAW// 1 V.Tr XVtYYKAiatDOCXtVF 
: iRNIlfFYDPLAAI. I PKnFWFF EJ7TT?rf *FOPFKT f IKtDI ::Yt.PH40RONKY^FKlb^AYN 

KyvAr?WRF^:V«jl«AYT\u%*/M.T;;f,FP'.i::HAR»:Ami«uwFJiluXt:L^ 

t :*i;i'KA.n:i:p:nyNKOlFa*im*.*:r-KfAr\i'ija;KB mi oti* : inckv tr* v ivr-KflFRu: 

y y vrrr/TAitT rArtVATAt'KK r'; xn::;i j/u j maul^ sjAKf/WMKRAu je m :i i : i pi*eo 

f'RAli*fll.'7CIIAMKKt*<*ANW« !FIU IF.FAf *" I UKiU:VKY(« JCKfUHLYHRFIIKI (HF.FAPYA 
KLfllkllCSLLYKmMCN | FVPT MITHL I t-l^-i^rr^lVTWWIJ'JtKKHtX'L:^; 

Nn t*4«nL lN«*tl.N| ft,~:<-t,t u. '^<iMt»«nlt/KKni. .n; of II "//•« 
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HfttllKlKKSFOYSl^OAKRPONMLPNHFDPCLOf'. -J LKODRLAYCEL 1 1 LL3KYQOKT 
r.-JSLUtECTCILNIlAICOHLLYKI LROriTMOHUGLCLNCKCEIPHSPC 

CPn_O60t WOM -iUni 

CT4?J hypor.rwf.ical protein 

QFPRXHADDI IN»O>EIT1WPLLR0DSLWWVir/S^^ 

AFCASAAVS 1 1 rTANPLWVTIOtXLALCLiPIPUICLU IG 1 I VLLYG IY LFP00RE 

kgfiocplgfV^^ LLIALErsr "j rr "f S T 

VI PAFCLACFFLTLFVCLvTRLYLXJCTTOFFEDlAS W 
AAMT10^INUX*3EYSLLSEIFKFT.PKHDLXRKFSCFCF^ 

XWOAI PVDLSAHVSLADAYVALSCLYADPRXYPCnJANWI PSGRYSACIOOOTATA* 
RMEETOIUiEYAPGNAWVHAOlAYSYTOtJOMPKEE XOClfEIVUCLWHWIL'lllalUiSIL 
YnjOaWAKGUUYEEIKXRIJTlCKSOXLI 

CPn_060) 694136 695185 

neaz*Fer rochet a lase _ 

WKXKPilVU^LVSLFTJUGXVTUTmYLIJW^ 

lprvlhrhlftf iakxrvpkvlpoyqslqnws p rrrrriTn^rn^Eiuixpyj ptwjyl 

PSTHEKTLLAUtHifrRHVICI PLPPHFTYSVTGS XVRFFKKHVTEI PI SW1 PQFGSDSK 

FWLrrcHiRorwiaj3:LacEccrLrsvio.pvRY:soc»PYSKOCYEsrw 

SENFtjCFOSKFtJIKPCWtJ»PSTACLCQNIDTXJKPW 
LRSRCYWLRIPAIYSSPUIOTLVOIVKEIISTWAEGJKSGiaCTO 

CPn_O604 695981 695196 

fliY-Glut amxne Binding Prowin 

CKKRQKSEAOUA/KIKFSWICVNFLICUJkVGLirFKSRV^^ 
TSOTNAFLNDLVSEINYKENLN IN !Vl&ZHWlJT2IlBDKKTQ&fTSVLPTLZXLXXfQ 
FSDPIU.TCPVLVVAOOSPYOSIEDIJtCIUJCVYKrDSSVLV^ 
LEALTSHCYCALLAPVX EVTALI ETXYRGRIJCI ISKPUttDCLRLAIUOtJIMaJLLBSPN 
ACLVKTRRSOCYOAIKORYRLP 

CPru060S 696737 696150 

LRKLCSSROW 

Qe^GFEAIJRQU^SVVPVDISIKAIOLIKITlSAlJXTO^ 

KRSnJLniDPPYrLCNCYVETUJOKWSGNILNPEXmJT^NASDEElA^ 

fOXaCTYLAEYIVEXSP 

CPn_0606 697492 696707 

CT488 hypothetical protein 
SSYSRROUirHCSLQMHIYGtJU)LKlJtf£VPE^ 
E0IVXIiCDISNAttJUEAHKDrAnG2lJCri^ 
YUQCFAIXTPWA\n«NnU>fDSmCVKXBNn»TPST^^ 

ArAALwtrvTEviwmfYppissocTTCPiscnxAixaw^^ 

IRGIKTCLVAADYVNFVPQEVM 

CPn_0607 698910 697573 

glgC-Glueo»a-l*P Adenyltransterase 

NRRIQHIENDFPEASNFESSHPYRDKVGVI ILOOC£CKRI<SPLTMCRCKiWSFX3CRYlCL 
IDIPISHAISAGF^IFVIOQYLTYTLQQHIiTJYTY 

QGTADAI RKULLYTEDTE I EYTLI LSCDOLYKMDFRS XVETTA UCTHVDKVLVAOP I PEXD 

AYWCVLDIDSTOOJDFYEICPOEKEvIJCR^ 

IU>5LrSUJtim31DFC]CHLZOA0Hiatf»VQTU 

AOtJCIJCYtOCMIYSXNHHLPGAIITtJSMISSSLLC^^ 

ENSWDOSI XHSKARYCSPSfPSLCIGKDCEIRXAI XDENCCIOXWIOjQNXJCYIICYDS 

POKttFVRIKI IIVPOGTHIPDKYIF 

CPn_0608 699690 699016 

•Uridine 5 ■ -Monophosphate Synthase (limp Synth*" > -truncated? 

VSFT.YFVnGRIU><M«NYEDAJaitGQAVjaLYOI^ 

ISSPXVLQTVATLIWPARPSFNSStACCVPYTAAT^^ 

SOAIKVGCLPTPOQTCLVXNQKVSSCKSX IETAVALEEUGLWREALVFX«ORWtEAO0PI* 
CPOCIXVSSVrrVPTLIKALIAYGXI^SCOLTUMOCXSeXtXIES 

CPn.0609' 699672 699986 

CT490 hypothetical protein 
0NTKN5URENMLIRLFI£ISLPKGFPLYI^PPLVIATFCCT 
NLNYHYTOEliYKAVPCNYKSI YREI PLI X FPEVLICSTPTOSTE 

CPn.0610 701450 700029 

rho-Transcriptlon Termination Face or 
RIFUirKGSIMKEERSSEIU>RVKET1CKHAY\^^ 

I AKIjORMC I EELMX LARCYCvKNIGS LTK50WFE IVKAKSERPOELLXCECVLEVX.POC 
raFLRSPTYOTLPSAEDIYVSPAQIRRFWJCKGOTIICTIRSPKEXEIC^ 
STPDKAXERVLFENLTPLYPNQR IVMEHGXDHLAEHVLDLTAP XCXGORGL1 VAPPRSGX 
TVILQS IAMAI AVNNPDIVLIVLL IDERPEEVTOMI R0VRCEWASTFOE0PERHIOVAE 
MV IEKARRLVEHGNDWTLLDS tTRLARAYKrVCPHSCX XLTCCVSASALKXPKRFFCAA 
RMIlIXMSLTILATALXCnCSRMDEVIFEEFKCTCNMELVlXRRl^DRRTYPAIOLIKSG 
TRKEEaYHPSELERVYLFRQAIADLTrXDA>WlU£RtK]CTOSNA 

CPnjOCU 702133 701420 

yacE -predicted phosphatase/ kinase 

RR>mRDAKTSERE0CISYDFlR5YSCnXNWRRLCRKLKIZJCVSXTCD 

OELGAYWSADEISHSrLIPHTRIGRRVIDLLCSDWVDGAFDAQAIAAK^ 

LEA XLHPEVCR I lEEQVKQS IO0CNY PLFVAEWIAYE IH YAXWrjSVI LVMANED t RR C 

RPMKIOCR3CEDFD0RCSRFUIVEEKLA0A0VWEHNGTKKELH0K IEEYFYALKGAL 

t?PnJI*iU 7U4ti«H 
Mi I A UNA PtUymiM.iuc I 

k< : 1 1 rr:;LurrvERf *rkvakkklfvluajof i frayfalpemkniiqcoatqavfcf r rsl 

NK L ( KEFS PEYM I l.VFB'.rNNKQIiRyA I YADVKSNP'jKK FED I PPQ [ALVKCYC£LIGLA 

Yi xte:veauw i a:> i akki\reenykv\Vv.tadkdujulvnokwawn pwaccwx: i se 
v i e*y»: i pr* ;m t toy laua &zr, on t PCLrxxrorwtAMiXKOFWErjuxNLOAVKGL 

.VTMLSEROETLKLltKkl I P I PVPI ESLTPPQHPVOEEKL IHFY lOQfiFKTLVP 

:;KVr£AATVDVO1 1 KOAK-'XTN [ IMLWVa^D t APAVA YTi 7ff tLLCf JCLECLALTOCSRV F 
F t At J££B .*PK I LI' I 1 JCMfl .HEDl ,'PFYUYNLK RDCT lALLKAf ItVEREt lIYGLALAEHLTN 
- XtV. IV.WM jLVNISJFTKTAJ IRFAKEHl^TnLF IChLPEOMWYFnEFVAYLP I IKOAl L 
Kl£ 1 1 CFMt H\\ I li :U I EMH .KKV LP.'MERA WrLDVEFA J\ T 1 ,KAt .FFTELAVLTEE I YDLS 



aPmiKJPrCL50tLYtlEL^D^^AE^ 
TYVKAl.IWTOHTORIHrcFtX^V^^^ 

OnffiELLFEVPEEEI EEMCP LVREKME3 AMTU7/P I WN til ^W*<AEC 



CPn_n#; t *» 



KTAPX IAV I EMKOVIAJ3KWTAKT ICN : U:Oh*EKA*'UCDHVKG IVXOKDCPCGIVFXIDR 

TYS^FWEIUCGmYIYVNCLL-ASC^^ 

Ec3«YCTOSLLTACKOIOkP»^ 

OCLVKTLCAR I FSPEXAXO ECY I PA'CATKECVICD I V'AVCK I EDNYRVXCSOCXZAMCR 
VASAAA5SPLVTCNXKK0XLPLSH0AAYX PPYtAL 

CPIU0614 707435 705793 

ade-ADP/ATP Translocate ^ 

WIllHKVGXETWSI^PFSRLRAYlXPrnCSE^ 

T^VTVCSCACACV^PFUCVWCIV^ 

AVIIYPVCOSLHLNSlJVDKI^EIiPOClJlCFrVMVTlYW^ 

CI^rrTTTEACRTrALIWrCUILSSICACE rSYVMSKCTFVAYSFACDSWHSVMLKLT 
hXXTCSGLIMIWLYWIKKLTIDTSXPPSIUlVWEtCMTA^UCEiarCT 
IOSRYlXCLAIXVI^YNLVIHl^tVWiaJO^IYSSHVEFtCYKSR 
VliTWIRXWCim^VrPLVMI^SCSJJTCT 

QCaaWI AWb ' llJ L f* 1 UJI KEW IPL^PEPXNHCKAAIDJVfVSRICKSGCSLrYOCIXV 

IFS3UAA^UnaAXAUIXMVVWXAWAYlCSEYYSfUUa>AVATW 

CSVEOESMAVL 

CPn.0615 708149 707634 

DO»A-Clycerol-3-P Prtosphat idyl trans (erase 

LMCIMR0FO<II^IUMUa.YFCQE3a^ 

ItiJPITDICVFVrVCIT^YKIXJSl^IAKl^CAR^ 

FWaCIFTWDFI XXlJmAflQEIPWICLVPLVALCFLYFLEiaMITTlCKOFlJl 

CPn.0616 708704 710137 

dnafi-RepLicativ* SNA Hel lease 

TtTNY*SSXilffilCSTCVPLPSPI»HSICESEMIVliXKLTCVH^ 

IRSXSIlJUO«STAJCEXEXIUUX0PKWAEAI^EAONSFFKIS0STSVS0rr^ 

LTTTTOCPYLVOWEROEXJWJMAOCDNKSFTTCII^ 

PAJt»AX>WAE}flXPWRI^ICXFSIJ>frVDOUHR^ 

DTORIVS\rttiEHOE>fTXLIDDOPCUCVSD^^ 

RATESRgTCISEISRMLKTIAJtEI^PIl^l^LSWCV ^^ 

SItt<VKFlXIWEYYl>PHDKPGTAEl.I IAIOMIUCSXCSVPLVFEXtlARPWIYSAnCXS 

CPa.0617 710481 712316 

aldA-FAI>-dependent oxido reductase 
L»mfPIAYt7VIVVGACKAOCEAAYCSAXieJVSVl^T^^^ 
IVRXZX>AXiXIMAEVrtOSCI0FRILtCmPAVRAPRA0VDX^ 
HXI^ATVESXIJDJCXOTrXSCVTTXlX^M^ 

SSOGI^EIJIJOaiCrPISRlJCrCTTPRXIASSINP^CMEEOPCOUWC^^ 
OLSCFITKTMEJaTAI ISANLHRSALYOCCIECVCPRYCPSI ECKIVU lULfMtfVFLE 
ofr-T umrr v AMm^gTSMPrn^mmHI RSVLCLEKA I ITRPAYA I Dfpy XHOWIHFttE 
SXLISUJTXXaiMrrTtrrEEAAAQCLl^ 
TCIZilEPYRilFTCRAIlfiUJWJIMACARUKYCYELCt^ 
RlJ3KTTROYt»SVV^lJUUU^SJlPlVSYT3KL^^ 
TOKXLX05X£W^LLIPEIXi}YX0XTAl^UAaEXLAXrrPRT^^ 
VLMIALKXHAHH 

CPn.0618 712300 713010 

IplA-Lipoate -Protein Liaase A 

mPTTCXFLDLItCH5XLK0L«XEEAU*VA^ 

XSRAOADHI PX X RRYSOSTWXDSNIllfVStrcireSEASAOPOEL 

TPS IRENDYVUCHKKIOGNAOY IORMRWVWHTTFLWDIDLOKLSYYLPI PQOOPTYRNQR 

SHEEFLTTtJU*fFPSROOFt^IKASCSLI^TWEEFLI3NEl£EILAO 

CPn.0619 713462 7130X3 

ndk-Nucleosid«-2-P Kinase 

RRYAnH7<E0TLSIIKPOSVSKAHIGEIt^IFEOSCLRIAAMKKMHL£0TEA£CFYFVHR 
RPFFQE^VDFHVSCPVWLVLJXSANAVSRNRELiCATN PAEAASCT IRAKFCES XCVNAV 
HCSBfTLENAAVEIAYFTSKXEWKASKPLV — 

CPn_0620 714145 713519 

ruvA-Holliday Junction Hel lease 

DKKYDYIRCTLTYVMTCA XVI ECOGIC/H IAITERWAI EC tRALHODFLVTrHVIFRETE 
HU.YCFH5REERECrRILX5FSCICPKIJU^tU^PLKVLCSVVRSEDXRALASVSGXC 
KKTAEKLKVELKOKLPDLirLOSRVETSOTKTTSSCLEEC IQAI^AiXIYSKIAAEA 
A I KDLPECSSLTOI LP XAX.KXNFSCVNKO 

CPn.0621 714707 714144 

ruvC -Crossover Junction Endonuclease 

LSRLCSSFKONXFKVF0E5IVSELX ICVOPCTXVACYAI lAVEORYOLRPYSYGAXRLSS 
OHPLPMRYKTLF EOT«SCXXfl0T0PNAMN^ETCFVWCNP03TKKLAHARflI VLLAAAOW 
L IFEYAPNVAKKAVVnKCllAJKROVOVMVSK tLWPEVLHPSNEOlACAFALAICMTHVA 
RSPLCCVR 

CPn_0f..l3 ?l^7f.l 7147'U 

*TMJ» hytx»thnr lc.ii protein 

u y :;vr ixj : lklhlf: :ut- wss t rrnv* :rt:hu illckwxdao ikemoo icn i lscv 

';"PH?X »Kt .V; U OKLTOOfSi T (OEHEA [ IILOYRrOl.'TALEErrRRREEAKtJOOLEXLOOENT 

wuy ift i j^jc i Ri to: m 1 1 pe i r. k ellvivvkte i :;et ;r p.LCYra i x t koleeqlqr yvs 

OJf ^tt: It: I EEI4U: v\AVAK IMP LXK!;L t DUA7EF.D I Y I KTT1 1: • R (AKLKEKLOROCCAO 

t: : ; £ ek ltivutdi -mjck k a [ alu&i» i vkloyi vi J'CU i KtacowtrcrmcixHU 
- u-u jrKi^EVtA/VK::K;:K;:i 

•:(*fi_')0 » *i 'till vit.ii. * 
•TVI4 tivihtrhM pt 

i rtrttn wftrdi v r rrv i r: :i» w nr RU'r/mma n^^ufM'FWKA 1 1 v i :;uai i* *kfrnc:d 
n::ri'Ki//rA\;r^Fr/HKVRi'rK i m.KA-/ r ;ujJf wk ia/jcheal ikXTFrrriuviDEKPLA 
u ; i +.w rn:PT: w •: * kkk k kak r 1 1 : vn ikkwk kk k r i w< m n i k e r a pot wyjK 1 1 .nrvK 
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KEUrfEt «KMC : VEQKKFSLLPPPAJCL I5CVI3C - VPWTSADWCSIOALVRESSDL 
INALX»'IAD0AlHFPETEEEPTSA3FEES3AMrrPET3SATCEE 

CPnj)h24 7180U 717011 

^apA-Glyosrildehyde'i-P Dehyroowiase 

AMKW I NOTCH f-RLVUlOtUCRN3SVmjkiraLVPCOALTYLnCft3STHCRrPCr^ 
EAQHL I VHKRK IOFL3 EWNVOMLPWKDLSVDLVI ECTCL FTKKEDAOCH IQACAKKVLIa 

ArnKm r rTr.^ywpprPHprjror/irwAsrnrrLAP TAKVLLDNTTTTCCLKnvHA 

. • r . ,v • ' . **>*; ww > n- s ; ; i ;.w,v.T_:Lf uL>*' irLT'MAFRvr : ; -V • 
. . •.-J.r. , n:xi ..TTv; .kam^ -..ultvlk/ ; : - .YTCETv.v'.rsr :*^r;r J : fcala ; 

I ALNDft FFKLVAWYDNtTCJYATR X VDLLEYVEXNSK 

CPA 062$ 718488 718060 

U17-L17 r LbouMi Protein 

VKO*ARJUCFRVGRTS$HNRCHIJtt»aJCSLIHra 
UUUUUAIC3UifWYNia.TSXEAP£AK£X^W 

r z uaamicDNAOKci iotas 

CPn_0626 719670 718495 

rpoA-RNA Polymerase Alpha 
WLPAWCIttQSVVt£ICEia3MSI»We^YI^ 
GHGHTLOULttUUXXCLEAPAIISFAMIOTIK^^ 

PHQDSSI^RTTQVIXASISIDASDIJUWCOKgVTLO nT rflm n T E AVWPDQ VIFTVrOP 
I0LEWUlIAPSRCrrrPSERIVL£DKCVYXIVLOA^ 

LVLIVriTGRVTPKEAIJU^STOILrareiPTJWDEKKrVrEE^ , 

LCWnB-SVRSTNMSAiarrXGELVlHPXPIUXOnWCXI^ 

GMDLTOFC^UffAnCEItMCVOfAEXIRAKIfflCC 

cpiuoer 720059 7m«o 

rsll-SH Ribosomal Protein 

FLIRSRVtVKNQAQAlUCSVKro 

VCTSGSRKSSAFMTVAXODAAJCrAMNSGLJCE^ 

IRDETPVPWCCRPRKRRRV 

CPn_0628 720461 720063 

rsl3-Sl3 Ribosomal Protein 

DAYTTUtSAORMPAIIGZDIPAKiaaJCISLTyiyCICSAJtSDEIIKiaJCI^ 

CEEVGRLNSU6SZnvrcCLRRRV0S0Xl^ 

RKCXRICrVACXXX 

CPn.0629 721881 720487 

secY-Transloease 

KIRLfRPYmTU^OFTLITaJlOKlJTTrALLTACRVGVri PVPG INCELAVAYFJCQUX 

5GQNiJ0iJu?irsowAOHryiAijSwmsA^^ 

RIGRLXRIJTVALAV^OSUJAJffAIJBC^TIPtK^ 

riCILLLMWICEOISOnCIGNCZSLX IALCIX^FPSVZ/35XVraCZJlLS50DS5DLCLZ5 

iLii^vrvrvx^TTiLiiEcviutiPWAJUflrc^ 

ASSLLKPPATICOFXASZSSWKKRIAJUIAPGSLVy^tCYVLLJX 

iASDoaooiAri pcziioQcpniKnxrnttiRVTLiaajiJM 
SYPijQCTAMUvvGvvumfxovmAmiautyc 

CPn.0630 722316 721885 

r 1X5 -LIS Ribosomal Protein 
MIKLSSLFDZSOUCMUCXLLGRGPSSGHGlCrS^^ 
RVPTfCFSHKRFOKCVEEXTTtaOAELFOESeA 
ERTPVWQDtAWLSQGVQMUJS XT 

CPn_0631 722812 722312 

rs5-S5 Ribosomal Protein 
EEHSl^XIGHKEWLEBa^VVNRCSKWKOT 
TDAZRXQXAAlOOVUOCXEAIfZCSXPHCVLVKHD^ 
EKWIKDIVAKSFT^NNPMNOVWUkrKALTCl^PR^ 

CPn.0632 723354 722827 

rll8-L18 Ribosoaal Protein 
KGLISSWLVmXCWAPNVtXNLIK\rcEFVMraC^ 
KSLMXRRRAIJVAKWJGGSPTKPRI£Wm 
GLT^OTOEVAKVLCTQIAEUaCNLQUm^ 

CPn.0633 723760 723209 

rl6-L6 Ribosomal Protein 

SMSRKAREPIU.POCVtVSICO0XIIVXCPKCSLT0KSVKE^ITUWSIFVHAAPHW 
DRPSCMQCLYWALISNMVOGVHl^EKRLEHICVCniASVOCAnilLSXCVSH 

stu7vsvekwtlisvkclokolvcefaas xrakrppepyksxgiryenztfvrrxagkaak 
tgkx 

CPn_0»>34 724215 723787 

rsH-s» Ribosomal Protein 

C3SIKRKR I YMCKTSOS XAOLLTRt RNALMAEKLYVDVEHSKMREA IVK t LKKKCFVAHY 
LVKEEKRKRAMRVFLOYSOORK PVIKOUCRVS K PSRRVYVSAAK I PYVFGNMG I SVLSTS 
QOVMECSLARSKNIOCELLCLVW 

CPn_0635 724763 724206 

rl5>L5 Ribosomal Protein 

GEPJCAJMSRLKKFYTEE1RKSLFEKFGYANKMQ 1 PVLKK t VLSMCLAEAAXOKNLFOAKL 
EELTH I SGQX PLVTKARNS I ACF KLR ECOC ZCAKVTLRC IRKYDFHORFCN IVSPRIRDP 

RFKKAtf 

CPn.lltiV; 725100 724750 

MIM-UN Ribimnm.il Pirjtt*in 

PKtXEVKKK^IRVrjOKVF riACNOKr»EX3KVLTXTEDKVWEr^NVRlKNIKRS0<>JPX 
' .Tfi 121 GAP ni!:M^LTIA<:EPAXL^VKVTCQCREU«RJtlI/n^LYRLVRCKKC 

'.'hi.Oii \1 /- M i47 1 /:!»i0*»'t 

r II4 I.M kilmumtt l-rnr.i*iii 

i k m lyuKWi jcvAfjwnyvKKVKi •FKvu:cr,RRRYA*rvrjw rvrm-RDVErf re; I KKGOV 
i tav i vimuai iTUKnr^ n-t jcFrrrwi v 1 1 dow WK'rriM ; wark wcDftfTF I k I SSL 

AJ-KVt 

MfiJW. <M /2*«77| 7J'»4'»« 
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MXKOTW5HA5E?ra 

CPn.0639 735979 725743 

rl2' 1 -L29 RiDOSomal Protein 

ASGWINMAAJOGLLTOLRCKSrCX^XYW 
KNtARALTVKOCPrWHn 

rllo-LU (liposomal Protein 

T IKLKPKRTKFR lt<KK30r ACL5KCATTVt3FC Df A*CTIX31CWVT^R0 1 EACRVAI MRYt 

KRKXVWXRIFPDr^.TKKPACTRtCICClOGAFO^ 

RRAAAKLCHCrRP/KRVERV 

CPn.0641 727092 726409 

rs3-S3 Ribosomal Protein 

KQRRXItXKCCP ICFRTCVTIOWR5X^Y^ 

TATUOtECKiryrtOTflf mwBWBituw i mr pit w-fcwTTtrTriBtuaJg. 
VACMXAAOXDtRVSFRRAKKXAJCSVKD^VCTVKIOVSCR 
KTLRADIDYATACAErnfCI IGIKWINLCENSSSTTPTOiPAAPSAAA 

CPfl.0642 727440 727096 

rl22-U2 Ribosomal Protein 
RIWSMnUlTAXYIRVOPRJUkRIWUU^nNX^CAfXSL^ 
A£UIIMIKRE>fLSVrCVRVtW«PVnrKRSK5KSRGn 

CPa_0643 727725 727450 

rsl9-S19 Ribosomal Protein 

EI JUlCRSIJUCCPrVTOOCJJUCA/RAMNI EElOCTPXinwSRRSHITPEMXCKTPXVHKaCR 

rLTVFvs rmvcHKLcrFs ptr i ncsHPViacc 

CPn.0644 728594 727722 

rl2-U Ribosomal Protein 

rT PiE f N S W IU CJK PVTPOTTOLVLPAFDELTTWCIXRCTKSXRSL^^ 

DTw/pi rrer^inmsAgaLVirvvDnatfacm 

YILAP10GIORGrWWSGffiSPriCSQCafrXK« 

QVIAXSPOYVTLIMPSGErWHIJttGCRATIGCVSMAX^ 

TAMKPVDH PtCUCCCRKNSY I PIH'PVCKVTKCZJCTRDXKKSKKiflVXXXtRX 

CPn.0645 726933 728598 

rl23*U3 Ribosomal Protein 

i^otDrtwirnHr.TrrftfM; ^ g ^-^ujuu^csfciu)PKFvrivsHruTCTUAOAL 

EAIYVDXNVKVKSVNT INVTCPQPAiUiFRCRRJIOKrSCFIOUrVTFYOjOISW 
CPA.0646 729636 728950 

rl4-M Ribosomal Protein 

YREDLKVLt£KFI)FSGNKZGEVtVaD$LrA^ 

SEVSHSTIUlPf AUAL !G>WROOCIASP0FRflOCIV7X3PXPKJW0HVRIHRKEiaAAIRLL 

UU3KI0TWa.TVVDErrVFVDALTAPrrQS 

LSUWLTAVKCrVYCININCrrTJLASAW 

CPOJ0647 730490 729657 

rl3-L3 Ribosomal Protein 

YXXYTgYCWgfPLITCPFXFUlOroTLniSZSKIl^lU^I^^ 
RSICSVItaCKEgHIHirOKrCSLVACSVlTn^PKV^ 

AHTXTirosKPieLaajucAaGRvrf^^ 

OISXCKGFOCVKKKFCFItQQKSHSSaFHRilAaSZOO^STPCR^ 
VSSLtVXICVDLEXICVUVXCAIPGM^ZVI^ 

CPn.0648 73X636 730605 

CT529 hypothetical protein 

FTPXXPOCEVmATNAXIlSACSAASIOmXPVAXXPAAVSSFAOXCm 

AKrywTtfUJXcria^iuwsixrv^ 

LMSAVPS IVHSTORCYDTrRQATEliiSyTlffilUCTPCEYS KMLLTTO 
GATTYSATFCVLRPLIfLZra0UTAXPFL£XATVCKFt7TA 
CQK£^XfUUC£5LYriERCAUI«0$0LSa^LSAERALIlK£KVATljr^^ 
WOCVKLI PLPZTVACSAAZSCALTAAS ACXCL YS XWOICrXSCK 

CPn.0649 732672 73X7X0 

ent-Methionyl tRNA rormylcrans terete 
LNUCWYFCTPTFAATVL00IXHHKIOXTA 

LLOPSlU^0POFXEaJMFNACVrXWAYGAILP«I>^XPRYGCYNLHACL^ 
PIQRClHSSATESCNWIRMDAOKPTCDMANITCVPICPPfrSCElADALASOG 
TLO0IESC0L0LVSOOAALATI APKLSKEICOVPWDKPAXEAYAHX RCVTPAPQAWrLTS 
FSEKAPKIU^IRKASLLAEACRYCAPGTVVVTORCELAIA^ 
SKSFLNCYPAKKLKIVFTtJM 

CPn.0650 733513 732665 

IpxA-Acyl -Carrier UOP-GlcNAc O-Acyltransf erase 

SRR>e4ASIHPTAIIEPGAKIGKDWIEPYWIKATVTLCDWWKSYAYIDCOTTICKCT 

TXWP3AMICKXPCCU(Y0CDCTYVriCDCEIR£FAIITSSTFICTTVSICNNCLIKPWA 

KVAtff4CTICNNVA>X£NMAOLACHVQVCt^AILaCMVCVHOF^ 

PPYTrCSCWPYOLACINKVCLORTOVPFATRLALIKAFKKXYRAOOCFrESt^CTLElW 
DIPCVKNFXEFCCtfPSKRCIERS XOKQALEECSAOKGC/LXES 

CPn.0651 733975 733517 

coLZ-Myristoyl -Acyl Carrier Dehydratase 

MNOPCVIKLRELLTLLPMRYPFLLVDKVLSYDIEARS ITAOKNVTINEPFTMCMrPNAPl 

H PCVL X LEALAOAAJVL I C LVLOtORNX R X ALFLC IOKAKFRQAVR PCDVLTLQAOFCL t 
.in KyJK AWAO ARXTSQI .VTEAEL5FALVDK E3 1 

rpn.«H,S2 7MWIII 711990 

kkn: ; 1 1 Yi ;o:;i_ * •j-*%*Mr.FjrtvR*rLKR evr Ysr;vr; i h u:r~sriM lo paotntt: xvporq.i 

A;?»ffO*VPAl.IJ>iX'YTTfIR:rrTL3RCnAVMTN^ILMAALRSNNIf3NLl IOCnt*EEIPI 

*.;onr;:>iv*vKi.r[VAr: iceoedkvi; t apltp p'/yyohod x flaafpt ioelk i:rrrufVPo 
: :: rr rnvYK: ; lv i nee.*:k^k i ah ^tfaly rrELCFLMCKCL iooucloi*»\vvfkoti ; 1 1 
:;m»m*hpadkivrmk r uji.i* uiu*:i.v» !ri i fvamvlavc*>»hssniafiikk iumuu. 

<:i*fi_'|t.M 7U4Vi 7 I4H4H 
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iturE-Apfti iponrotein N-Acety Itrant t*._.ist 

. 7EPVLR irrn/ IttL IAFA0PDL3GFVS tlGAACCYGrFWYSLEPUtKPSLFLRTLFVS 
CrFWlFTI E3 IHFSWLSDOY ICKLI YLVWLTLIT ILSVLrsCrSCLLVAIVROKRTATL 
WSLPCVWVA f IMLRFYGirS(^mLCWPirrASAYCIU7TOPUWACOSrAVXAV»M5r 
YCLLXJCKPHAKHWn^TLiilPYTCSAJHYEYlJ^^ 

SPIWWtOLUJLVnPIOOP rDLLIFPCWVTTGXHROVYPYESCAHLJLSSrAPLPECKAf 
L3N7OCATALJ0HFXPVt ICLDtUNnCKENVLYWYNSAEVISHKCZSVCYDKRILVPOSE 

vrprKrnrr.r^p^FWM/^KRLP^RrsTrrvRaLPRTnTTT^tLiFUVKLO^ 

-'«rj u.TTif'afvi Kr:; :,; v.i rrr jr«u* i ^r'' wr» Vfc,v*7^/r.\Avn.;LvP.:LK 

KEIR 

CPn_0654 737051 736503 

vdlD/yciA-acyl-CoA Thioeiterasa 
KKXXDFl£Vl»YYR*aEYPXKXt£VESTMLXKJCTO 
GUJISLLBRIJU.WAEIUn'ESVCVTArTO 
VKWAEtnYKOEIUUUTSAYrrFVAVNElTOP 

cue 

CPn.0655 737856 737101 

dnaO-ONA Pol tit Cps i Ion Chain 

KEXMsxxKDrmcxicafiCLBvn^ 

QRVHHISNANLRISPKIAEWPOIKAJTXEXIDYrVGH^^ 

YTI IDTLRLAJCEYCDS PTWSLCSlJkVHFNVPYOGNHRAMKDVI IN INI FKKLCXRFBTLE 
QtJCOVtJUCPXnaCYHPUaCHXCRCFSCX PWY WWASKMDn5SOLlJSIRHEIKHROKCT 
CFSOVWPrMEL 

CPn.0656 737842 738048 

No robust nemo loo present in Cenabank/EMBL as of 11/7/98 

T>Q<ri£LPLSlXDILLTVEGn£L7LYrA5TO 

CXDTHENXD 

CPn_0657 738476 738051 

/)■£ (ATPese or Kinase) 

PMQRYRRVSKSSOEIXLLCTELCGfVLVPCAVlJJJCOY^ 
EVASP5FSILHVYGNEPKRLCHYDLYRXR}m 

• DTXNXYITMQTMtEREI XICKR 

CPn_0658 739180 738455 

CT538 hypothetical protein 

KRVGHDX5CAVKQXXJ^FLSKQKKP£XIAT¥^ inCTPEUAV 

RIUX^^IWRTTEIOISSCKPOVKEI7ITRrriCPrTCXVrAI»W 

PONKEX00GVRIKIUT.V5EI»DWKIYAVPPKEPIXIC^^ 

SSYIAPKrLEEVQNOTKFOLESSFLSIAQ^ 

EE 

CPn.0659 739482 739838 

crxA-Thioredaxin 

LQENt8U}SHSIFJlEQCLMVKXI5SX3IPt)SFXAS<^ 

ELPHVTICKINIDENSXPAETYEVSSI FTLItJTCX9ICVARVVGUCiacm.TMiaiOCHA 

CPn.0660 740327 739860 

spoU-rRNA Htthylase 

KRWI^PDIFQNTOTIGRTCVAI^ttEX^LVRPIfi^^ 

SIEEAIJ(OTEDOIFCLSTKKASYTE«LPSSGT^^ 

PMOODIRSLNLATSVCIVLYEVVROKTVALOraJPTV 

CPn.0661 741139 740327 

raip-FKBP-type pept idyl -prolyl els -trans isonerase 

HSRCLMIOBUWaCIRJa^yi^^ 

CI^WIO£RTFQ{XXARQLIUCSEtMnVXASVAICCLOA^ 

OKLVFE10CSK£NLSLAEJCFIJCENS»U^^ 

KtaFINCWVFSSSEZa0rcPIUJ»lXOTPCFAIOO3KXECCTWL 

PPNSLLIFEXNUOASASCVAAVPQSQNOGE 

CPn.0662 742938 741172 

aspS-Aepartyl cRNA Synthetase 

SKCCYMKYRTHRCNELTSWIGEWOLACWWRYRNHOCTVVriDtJlDR 

QPELHORLnAVRSEWTC^GXVCPRIJlGMENPN^^ 

AOCHINWEEIJ*LmYU3KRJCTIXE^^ 

PECAJtDYLVPSR ry PGKFYALPQSPQIJKqiXMVgCLDHYrQ I ATCFTOEDUUORQPEF 
AQ IDIEMSFCBfPODLtPI I EOLVATl^ATOCICXPLPl^WffYOEAKDSYCTDKPOLRFD 
LKLKDCRDyAKRSSFSIFI^LAKCCTrKCFCVPOCATMSRK0LDCYTEFWRrcA>Cl.V 
W t KNQEGXVASN IAKFMDE£VFH£LF AYFDAKDOD I LLLIAAPESVAMOS LOHLRRL IAK 
E R ELYS DNOYNFVW ITDFP LFS LEDCK I VAEHH PFTAP LEXD I PLLCTOPLAVRSSSYDL 
VLNGYE IASCS0R I HNPDLQSQI FTI LK I S PES XOEK FCFF I KALSFCTPPHLC X ALCLO 
RLVKVLTAAESIREVIAFPKTOKA5DLJ01NAPSEIMSSOLJCELSIKVAF 

CPn_f)*>»»3 7«4220 742901 

hisJ-HUcidyi cRNA Synthetase 

KSNHFERRHHVTVTLPKCVFOI FPYXAXlAXQLWRHTSlJfHSVEKAI HTVCMLYGFCEI RT 
PtFEKSEVFLJWCEESDVVKK£WSFl^KGRSKrLRPECTAA 

^^/ILPHFRYCTOOACRYROHHOF^VEAICVRHPLROAE^AJ^XWDFYSRVCWKWOIOL 
NFLOCS ETR FRYDKVLRAYUfrSMfirT iS A I SQQHFSTWLBI LDSKEPEDQ E I IRQAPP I 
LpYVSOEOLKYFNEILOALRVLEI PYA INPRLVRCLOYYSDLVFEATTTF0EVSYAU3CC 
f IR YOGL t SA FOGAJLPAOGFGVGLERA XOTLLAOKR X EPOFPHKLftL t PKEFDADOFCLE 
fC^>HLRRU; I PTEVOMSH K KVKCALKAASTEOVSP/CL ICERELXSCOL V I KNMSLRKEF 
K7TKEEVEORLLYEIONTFL 

•:(Ti_m;M M477S 7445V7 

flu I'ttni: ! hnMiliki ptniient in ti«rw&ank/EHBL .is: ot ll/7/*ifl 

I .WF.AHAHKKL I AL U1 1 FLVlMKCfnWKEWOAHATVLKAARAKYWJ'I^DVFPVHEVIEP 

i:;m>;lviiyb»v 

■ IVi.tllth*. /4<l'»<m 71i,ihS 

• ilifj; Ht'xi>:i|*vt:ntur** 1*r.inaport 

KWftWTKMVI IK P. [ KDUEWKKKYKYWR I R I FYTMF UVfl t-TYFTRKSFTFAXPTL 

i ai a* :pi»ka.ji * ; i r.*j nxr. vm inKFvncvMSDosripR ypka r< ;lm itcltk i fp^mss 

:: I VI.KAI M* 11 M MK> a*arfl'WTAI{LLTIftf/AIWEKn^^rpJIINtOCALIPtUTCF 

i ii *n« iamyvi i ; r u*ii MaLVLiNnLRDTiv::i/y-pp t ekykrdphhahi recKSASE 

' TfKK ( I *KI J TI1K I I^TWr.TNOWl WPLAAA'iFF 1 Y fVRKAVNOWf»ALFL [ ETWfYAAVK 



ANFCVSLFE ICCLTOILVAC -CX ISKl^PMWU^^ 

DCniJVICrTt.YCPQmiCLAVkA£l^MKItMCTAOTTC^ 

CPn.0666 746370 75010? 

dnas-CNA Pol ::: Alpha 

CFFLWI PLHCHS0Y5VLCAKSS IKTFVAKCOEFC I PALALTDHCNLYCAWrYKECTOK 
iltOPrrOCE^TAP^PFOFKKEXRSRAAHHLXUJCIWEO^P**tXXLTSXAFTBBr 

nnr^xjUK.-tZZ-z : v.: -■■.■»•!;•.:.: j.:^wr.L-L*"--.-v^-rr::uut 

jtiEEJiABfKEEKu:-.L-:v : * "'T-*.v:j.vr::^: : i t;xw.:\" ::ianucah 

EILXJ<VOSCE7^IAIC0l^tPNPKRICVYR5RrWKSPAOHAEt 
KICDFTFI)FSKKKYPtYVPCSUerUISYTEEI}RYOA5A 

XAIOCrPHRDPXOXVKEIWDHEHAII IPKCHCOrLLIWOI XKUAUUCXPVCPCRC8SAC 

SVlZ^tXCXTEXCPXRFIXFFERFINVERWPVIOIDXCItACIianr^^ 

AOirrrCTKJUUO<AV1tI7VCRTLr*^ 

VGMXJCVSLLCXJCTLTS INX AKSA XEXJCrCOSX^KATl^ISAT^rSLUfOGXTMOXFOKE 

S«X3nJU(KXJtPDLrXXIXAIt%Y1lPCPiaMXPSFINIUCHCK^ 

TYtXXMVYOeOVMOXA<»LASYSLmOVLIUUMaX^^ 

lATVXFT3IOIX3CrAAYCFMCSKAAAYCLITYTTAYIJtANY PKDCAALLTCSSDOXEXXCX 

LXREAOSKIXPXLPPiaNVSSNHrVATDECXRFAHCAXIGCICRCUESXVEZIIt^^ 

SIWIOR50UOCVSiaCSI£aiDACCFTX:ri)SNRIJU^SVE^ 

KirrrxauffiiuQJEVFXCLPiasxFriisw^ 

VLACETENLPHCSVVRTVFI lOKVmXSSKAQXKFAVUtVSOCXOSYlXJIWPDfY^ 

QEtUEOR^XYAXLVLOKRSXSSLftXSCRWaCDLSXVN^ 

IWSTSGKETRAKGNKPNE^GKTOALAPmStXLN&XRW 

vF i yiJNiJi vASMSPix>AYrv^Eoim.R Q ir:vTADu>vKvrrv 

CPn.0667 751097 750177 

No robusc hooolog present in OenebanJc/EHBL u of 11/7/98 
NISUCMOKRYTTOaCULYrAATVASUOT 

Rr^ISCFEELOVTtW^PEOAJLAIJPrnuaacSFV^^ PHTLHHVRrSKEIPVIOC 
HI ISOEXXEItJgLVWCEHVLH W r vr r CSKOf RJCIJiJWCltOttlRVIOTIArUXnTSRX 
StAOAIJUJCNIRA£RVIKirOKXXLirA5CNOICTKrOOTO 

RKAAVTPAOYSEDRVRHLVKHI rCO^XJVRSSKVYVIVYX IS AX^SVRVTYINAVT 
CESFOW. 

CPn_0668 751176 752162 

CT547 hypothetical protein 

HRTVVVSPIUJHICrUYVPX^VLVSTOCmKPVSrEPrSCI ^ 1 W WPUH SAEPfTW 

QQEFX^KKCKFIOCALLCFSI X7^IKI^RJ3IlJlNOA0Y^ICVarrT0{»PDXJUXArA5YW 

PUUVSEZLFCKKYAIAORFAOCXRJUIICKL^^ 

QA0ALYS»yU^IVW^TEATTCriJCKLTt0FPLHILS5 

OYXJIFAXXJ<EIWaaU?Kf>NHPtJIEWSANVaA^ 

YRTAlTNYPtOTiVAJOCOJOttlJRISraTS 

CPn_0669 752140 752775 

CT548 hypothetical protein 

IEYlJIl^EINKRl^SLOTXWrSL^SCCCYSXLtSP1fW,SST^fnJflriTriA 

PXXEOTfCOXCSALTYXLSiaiSFAXSCRSSCACrTXJCVEII^ 

THSHF XVSNEE3IU<SLSAXVQLX}JNDTOEVLX DOCVAAESVDFTJTETDLQTAItAHIFALCaQ 

FO0iS£AXKSAIUULSXIUARXAQOVYYI)LF 

CPn_0670 752738 753196 

rsbw-sigma regulatory caecor-hlstidine kinase 

DpgTJAi u ^w ^ ■ »r.F TvrPAVT_grr^gi».riT Trp^vncvroopirT t rr nirtrrfm 

I ISYAY0CD*SPGTIAISCISKJCOLrWIKDKCPSFNPL^VSINI0rxu»lX01^^ 
CIFl^XS^ETLYAaEDHCNIVKlJCJaJOOHS 

CPn.0671 753440 753205 

CT550 hypothetical protein 
RrrXNORJCrTtCSXOFrEEFYHOSILNTOSFFEGYW 
KOFI IAESKD)aTUTlWyAIWLVPIXV0COA\rrRCYIA^ 
OSSLX LXAlOI^nJOXlQDTENAXJlSFRFNNDH 

CPO.0672 753723 755048 

dacFlpbpS) -D-Ala«D-Ala Caroxypepcidaae 

TXKST40tK1lPFFTYl£XXrrcSCASl^U4ACUFPEVRCATAAVVKA^ 

VIYPASKIXtATAITIUOTYPTVt^LIKVKOnAIASrTPOAJOCQSCY^ 

TIQtJaJl£EU£WDLFHAU.VCSANZ>AANVIAMACCCSVOC^ 

FNNPHCLHHPNHYTTTRDLISIHRCALKCPPFRCVrSTTSYX IGATNLKCERILSPTNKL 

LLPCSTYHYPPALOC K VI Kl ACXNLIHAAEXNNRLLVT rATXTrSCPVSOLYOtVIALC 

CTVFTrEPLLRKELVPPSrcLQLEIAi<LCKUCPLPECLYY0FYA5EC»EPL5VSFXA^ 

AFPXEOCDUjCWWVryDDEfXKKXSSOPFYAPCRFERTIKPWKLYK^ 

LLMYFRIRKHRKYKNLXMYSKI 

CPn_06?3 755242 755463 

CTS52 hypothetical protein 
GKSTtGJUVYHCFUCQVSIALOTEEVWWPHHI^ril^ 
VSFLVLQEK X ATLK 

CPn_0674 75B«9 r i 755577 

Cmu-RNA Methyl transferase 

RCXLYVTWPFRQHHAYQt^JCOUrrSAXSEADRVSYYFKOMSLCSKOROWXONIXFNIL 
RHRRLLETL X LOSCEOVTP CALVAXVNCCVLENLDSYSA I PWPVR YS I5DDLAHFLVQ0Y 
CEEQAEE I AX IWLTEAPIT t RVTnDX ISVKELQEXLEYPSSPCELPEAUf F5KRHPL0ST 
EAFRRCFFE t00ENSQR(fCCtCLT0KDXVlXFCAGAOCK3LXFAOKAKKVVINI)SRXAI 
LOTAXHRLUtACARNFSLAWLRLC.'lF.'^V IVOJAI<OCTO^RRHPE«ie*CFSKICUANY 
VRVTJKC I LKOASAYVC PRGRLV/ ITT:XUCfcIE>JEAHVAYHH5U>OCCVMRKTLPLCVCXC 
DAFFTSUFCK I 

rPn_Oft7S 7V/-"01 7M.7i.il 

7PUJM I LDFOFS I C,"YY LKVI IMS 7Vt< II*\7MtKRt 4,LOAWI VMI *l 'I .ITNYOTiIVST ( 
ftOV !HELF:M::A I ^Y:I I ft 1RI JiV 1 1 EU'U IKKK I VP arfl.Yfil^ KP::r/l| I ICK A I VOKLLM 

fksu LFtiKRPVDK i vyAAwr/mrr ;r : 3*r: :: ^3)pnib--rrv:tFVvrpi -V IKWULAA 

DA gUJHt tEALTTLLEt afTAYLI-U*:! JXU>0,«K I ' IKKAyHJfTtw :t:K: :YV! 44tEL ICLFSL 

AEVI^KVIVEXKLHVJKtUiNrrPEC/^ni V: 1 1 Uf/jNPAI JMOKH tTHI JilKWt Ji/VPROtn 
lAlAKAAe\*YNPIIPnFWROFrJ>LVrj^|.p 
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TPnj»»»/ii 7S1221 758051 

iicemioqou:; CO CT695 

UMrTTPISCNXDRffTISDPLEE3AAEBCD5DLn)RVS^AT0VICTrADTCIPCATPSrc 
TTCDLNJDLVDRVEYEARG3 LLTTMLAR I RKAVSQ IV»<HVKTKRH PKEQCVRSLCO I PCD 
LLKATRLPKETA£PPYrYAl^ALAXR5rTFHVFLR^ 

PEAAVAFAL I LR 5CCKWVATDAV0BCLPLEV I EEAGHYNAFSLCATTTVtEVSKRLSELL 

Y.^m tCXILANV^HK IITSFajCAC0CV5VVCNZJCrYDU 



•j' 1 *.\v-.Kf::jrr;: vkk 



CPn_0677 7604 10 759255 

no robust homo log present in Gcnebenk/DOL as of 11/7/98 
RIAMSIKPSC^SPtJDWWGAOGOSSSTOC^ 

WQ7AmEmcnCSPDSS0CASGPAM0SPSCFTXRPTRPAPPPPrrC»ANMCRPATKroR 
APOPCTAGSSSCBGOPTAHSSEVAJaVSnJOAVHSHAKm 

MRCPDYLLHCYRVIARALOQTYTE0SMLI tI*TSy fU PVPQAVTVAXDAVTCTVRGX DCML 

DtPtCPCNDPOGVLMBATCSlXXEGPTLOPCraXG^ 

CSALDRVRENH PNEHPRIWIAIJUlEfcGXAVHSHATSVRI ANAGW©fIT(^^ 

LQaflCVLSVCAWANTKTVLIGDLfT 

CPn.0678 761329 760682 

no cobust homolog present in Geneoenk/EKBL as of 11/7/98 

KXXHSVOTSGNSJOTUfXTGAKDQHPIM^^ 

TCFFSRKSF1*SC»PRCSCOPSAPSAETVRSPLPGGB*W 

IPQVPOQCUORSSSSTTUCmPAPPPPKTOCTTtMCRPATKTCPAra 



CPnJ>679 762936 761725 

pgk<Phosphoglycerate Kinase 

GYMDKLTVQDLS PEEKJCVLVRVDTNVPICOCKILDO IR I RSAKPT XNYUJOCHAAVI LMS 

RI^XGEEHPEXDPTFAASI^SYCCVYVNDAFCnSHRXHASV^^ 

ELEFCCRKIXTSPiatPrrAILC^MnSSXXGVXCAXXNaVDnil^^ 

IXWSLVEXSAIJlJUWVIJCIAJCSTOyT^ 

roi GW JT tEF XRXIKQSXTVnaCPWYWpn)S« 

AAAVVAXWXSTlCVSWSTGaUStET^^ 

CPn.0680 764254 762971 

ygt>4- Phosphate Peimiu 
YSKt^tiFVLLCcrrTswiiQuwAMAW^ 

ORVAOTIESSIVSVTinWASGDYHYCHrAALI^^ 

GFXxvLtatmrywNsvcxxuswxLSpmxxwLxmxRRHXPn^ 
pnjua,\ranT^rrvMisocviLKvssTP*avficvLvccLLSYi rnnfwncKCSYisDT 

PKXGSLTinUJCERCXSKYCRXyLWERZfXYLOX XVACFMAFA 
OAYPASYTSYTURIJIAIGGIGLVICIJtfliCHJW 
XALASII^PXSTTHVVWVLGXGXJUCIRABlLHIira^ 
FAUtALFN 

CPn.0681 765001 764258 

CT691 hypothetical protein 
NarRsrocsrrarR<7VTiAKKMLMQTtJ^ 

UlO3RYCHXD«KLVSDlCEY0ADCIKNr»OWHLPA^ 
AED^U,TXRRI2irYPSKETLFFRFtAS<UAm.TMr^^ 

xraauEX 

CPn.0682 764912 765955 

dppD-ASC ATPese Dipeptidt Transport 

TSXttiUattLfRllNNLPKRSClCJUJIA^ 
QTIJttXGESraGKSTOAHAXUlLLPCPfr^ 

I FONPQASLNPVrri EQOFHC I IHTKLALTAEVAJCEXHLYALEETC rHD PRLCLNLYPHQ 
LSGtWWRICXAHALXXSPlQUJADEPTrAI^ 

M3WAETADIWLVLYACRMVBCAPAVCKF7<NPSKPYTROUJ^SRPSL0 PQQLGS FNPI PC 
OPPHYTAJ , PSCCRYMPrcS»ai^SAIAPEIYPVRBa{KVRCWLYI» 

CPn.0683 765936 766919 

dppF-ABC ATPase Dipeptide Transport 

GVWJ1I INrPOPLIOAT^TXHYYKRSFWFOCIO'IASRPVDrVSF^LYSlWVCLICESC 
SGKSTLAIALAS L LPLTSCrLTFWGTP IKLHSKHCRHQLRSQVRLVrQWPQASIJJPRKr X 
^SLQiSIiYHKLVPKEKVT>TVRTYt£LVCLSEEY FYKYPHQ LSGGQQQRVS I ARALLG 
VPQLIICDEIVSAIJX.SIQAOILNHLA£L0K}CI^TY^ 

Q XVEKGNTKR IFSOPOHSVTRMIXIttOLPCTTOROSKPirOTOXWEESCSTCCYFYK 
RCPOKOEACXSEXXPNOCDAKKrYRCtH 

CPn.0684 768056 767181 

spoJ /pa rB -Chromosome Partitioning Protein 

EXSGOXVTEEXSKDTXXEV'AXDOIRVSPPOPRin^SNEtLOCLXASXKAVCLXHPPVVRC 

ICTOTRVLYYttXAGERRWRAHQLAGATTIPV^ 

04AEAFKRLIHVFGLT^DXVAYKVGKKRSTVAffl'lJtU^^ 

ILTLEDPILREXLNEI I IQEHLAVREAELIAKQLISEBGSS I ELKPTPLOWAESSKOHEE 
LQORWOLCCTKNWKTCSKATVSFra^^ 

-Pn_0685 7680:6 768217 

No robust homo log present in Genebank/EMBL as of 11/7/98 

FPpSQYLUFPNRXLDLCAFEIUVOGMLTDgRW 

KTLK 

i.'FnJW* 768J73 768176 

no rooust nomolog present in Geneoank/EMBL as oi 11/7/99 

AKDSMKPOJRLFAW0EIJFFS*VWCttRRPRKLYPSL0H^ 
MFYMIIV 

• "Pn.n*«7 769501 7ri-J2l4 

• T4*u liypnr.hutiu.it procdin 

«K 1 HKTir AIIAYnFrrrPNt^t:;FH:Kt.VHN IWKKFY.TFSSA ( AlC IVLAJKL.1LK tWNTYK 
II WAOtt; I LLLTRAAbVAVS^V »FLPSK^ALS JLGQAYHt/JGCSHK PYAiIFLAIiCF'Y I UN 
Kl'U* lAYYAl ilAYWKOALCJLPUP rOKLLKeiSEAOAOOLYOVAUSKJYOUJOTAflSaPE 
VITf.'rFl.Tf.UO/ 1 ELKELU KWWDrAALKiSPtFMOFERKYKCCEVrLCKRFnKIOI 

'•It.JI'.Hh /».M17*» /7(Mi7 



SLKLlVLArRC^HSR.'JO. . JCNYLRLUCCNFA rKERT ^^..H ; TWASFT 

FYTWiFPrLEEor i PAVvrrvASRY trSKAAwQUtrsHRLKr: 

QNELXDlAlCPYICEAS3Cm«*»*»^^ 

SOPTSRKLAADK/FYSFLtCaTr rKWCtJCTWJ I YRLDTKPWO^" rWCPffwW I IKWIKE 
KSKQLYXJCKOLPK R 

CPn.0699 77(407 770 1*»7 

vfhO-HifS-reletert ^J^ 1 ". ft™ 

HHAfM^WEXACRRRCiLVKKI RVHDSJw rCLDOLCXLLNECAOFVJX PHV3W T0CVQ P 
LOOVAELVHRYOAYLAVtCAOCAPHLP XDVOLWDVDFYVFSSHK XYCPTCXGVtXU&UJt 
UttLPPVBXGXWAXYtKONPEYLPAm^ 
OKOALTTYXHKEXXXIPX^XLCPSXEEPWLkU 
GH0jCA0PAHER*«VGHVLKVSt£XYNDEEe XDQFH.VLCDSLDKIRR 

CPn_0690 772704 771436 

ABC Transporter Menorane Protein 
LSVUCTKttVSXETFSSXASGSPVOKAAE^^ 
RY10ATCASeX>XXQHWUlM«SXAFECXLXNm 

MocrpvwwpijW^vcsEBRcmYXprocT^Pxr 

VXtJCX3RASA^X0X5HDVCtS<VOSSKTIVICVrEXJM;EC^ 

T X ATVDCBA ICRKTONLLXSCOCFCWrDNTSY IVCKRC HA£^VLVOSPWCTVV»tIJtS H 
DAEgTVSRQNXlCSILYSCHrifECTISISSOCCLSDAWOKHDTT f T SSr»RVSTFPRLC 
nT«VKASHC^TVCPU)POOIFY«RSRGKrEA£A0EKL 

orrs 

CPO.0691 773467 7726B5 

CT691 hypothetical protein 

RCUSKIJaiOiLHASaffi^IIJDFNtWIOPCTWVr^ 

SSGEXAI4E0NU£MU>t£R5RA<3XVGrW 

SIDtnmiSTVlJTYrYiaTTOLFURNVNOT 

D5CI^ZlAX^CRVUlCYRXLKinPSSLCrVTHN 

SUOiELEAJCSYOEVTKFVAWR 

CPO.0692 774945 773461 

ABC Transporter 
XOfXCATGXJCVMGESWCTlZERED^^ 

X XDPRI^YRYVOtOIitEPAWARXifYCPXAYTOrVYrSSPKOICKPTfTHT rpADPCXIgrTK 
Kli3IPU)EOKKIJJJVWAVDLVrDSVSICTTnc^^ 

YLCaWSHREKTTAAI^fcVrSICSFVYVPiaWKCPHDI STtTRXNNK£AO0FTRTX»XW 
EXflGYASYlJXKTAPAYSSIVXKAAVVXLVAMEHAVXRYSTVQMrYACta^^ 
VnCRCLCAfiYRSKI SHSOVCVCAAnVKYPSC I UCr^SVCErrSVALTSOTiOAIJICrK 
KLHMaCRTremSIGSXSSDSSIQm'RSLVSLClOXAI^ 

KIVVIKSTSKEKEATTSKLREOOIXYLRSRCLS PCSAVSLVX HCFCRE XXCOXfrAQ 
CA5XLUX10J96VS 

CPO.0693 776292 775240 

TPR Repeats (O- Linked CicKAc Transferase homo log) 

UtSTWVLCEXSKEBAAICHLAXEFt£SG XNIJXSCEYEOA£XRXJCETtXXJttTAAXAYCY 

DOVBCWFSLC^nfHRIJOU^OtAJICro 
EVAVAXNPLYtO(AWVKU3XI^RSKR>OUkTEAYZ3tWOLR 

TRIJUJCAFQEALfLNACDAQAH FYVGIAHUJLROKREAYEAFHSAU XHLOfDIAKYLLO 
YUWIO^Tt^TKEtlJLOXWJSTFAPU^ 

CPO.0694 779635 776330 

pbp2-PBP2-transglycolast/trans peptidase 
FSKSIJWNXKSKKRnOCFTXYUXAOKTNRXLSCXVXAFAVIAUu^^ 
EAYKPOIRVLPQYVPtATIUJWH.Rl UkVWQLQYtVSVAYCAIRDXJ>TTUWRVT)Oa30O 

SK>OCLHVEAVVWWYPOEWASDILCYVCPISWEYXR\^ 
PIO^ECXJlSXDOVRAXX£SVESmYSIJULVCKHSVEACI^iaiUU 
FX0DfEaAVPEAfC7l7a^TLSAa^YAI)AIXLEYEXTCrFR» 
XKCmiXAIARTOCXlJUtASSPRYRNNDFVNAlCVAESSKAW 
DRKVPLIREWWPLTClTYIXIU>LTFTCFU5rtJ-PE^IKL0L^^ 
VTRLX^LFPYEEXRTPCSAXraVFTNEECKXLIOCVttSIjOEO^ 
- KEAUXTVFVXX^ANYl^ILYTOXLRtXVDPEXlFSPVLPSEVHR^ 
SAFSTmDAFX EVHFKSWRKS ErXjOYIAAKRQEEAJJUCORYPTPYVPYLtl lJl 1 Rtf < AH 
rCODO^FUYU-SXTPYXECl^PYYD I LDLW I NELDNCAHRALSWH EHYLttJCERVSH 
UEKLPAXJSTrREFNELORPLX£lCYPXSXVR>aCRQTEODUU^FYIW 
OAATLCSIFXLVSAYSVI^RXLWHNEEPANPLVIXOXNSFCYRSSKPKVCFFIOXTPX 
FTFPROGSLPGNTFMCRCFXOLVSAIXKSSNPYFSLLVGECljroPEOLACiAASLFCFCEX 
TCLCLPCEY ACRVPHDLAYKRSC LY ATA I CQHT LWTPLOTA VMLASLVNOCWYV PXLL 
LCEJrfBCEHVSYlJSKKKRTIFMPDAV/VEVLKTOfRNVIVCO 
[CIO , STAESlMRVTlI)Ri^CTWWDIWFAA\^rSD0DL5LPTIWIVnrU^^ 
PHAVXKXDMWEKXOORESFtRC 

CPn.0695 7R0201 7BU82 

homologous to CT495 

SLeVSHXKLLKSAlX5AAFACSVCSLOALPVGNPSDPSLI.XIX7TXWESAACOIC^ 
CDAISLRACFYCtVVTFDRItJCVDAPTCTrSMCAKPTGSAAA^ 
Q4FTNACFrAI^XWDRFDVFCTLCASNGYIRC>lSTAFKLVCLFCVKOT 
SNC7WCLYTDT5 Fi?WWjARtiALWECCCATLSAEFQ YAOSK P KVEELMVICNVSOFSVNK 
PKCYKCVAFPLFTOACVATATCTXSAT IHYHEWOVC A3LSYRLNSLVPY ICVOWSRATFD 
ADNIRI AQ PKLrTAVO^ILTAWKPS LLUNATALSTTO S F ZVH4Q IVSCQlNX PKSRKACGV 
TVnATLVDADKWSLTAEARLINERAAJrVSCOFRF 

CPn.Oli^ 7«t703 7R2S'»l 

CT*i'/t) nypnttvrtc^l pttir.#.ui 

wyTFYVRMPLLTY:WFE I CVO;:LF-TO: » KLT t K0t-M.'IA < *;AllF'3IOTnr(WWPKKKLY XFEE 
KNTILYl ttU-AICrU\fLWIAI,lt| [ f<KV lOUNKTVLFVf rTKKUAKt.ViniAMEAOETFIAE 
RWLfOfl.TWTTrRICMKTUJK [rjeW-IKNOAYhT^ 

KAf JLLWVDPlTEK I AVAKAKF t/ 1 f 1 VI J\r A*im« rwi'l- 1 m IV 1 It KUttlt.K:; fRL ItN 

vikexi i eaki iku: i f. rwi vk::i ,i vi -i x ^ :aki Ji^t-^ iukknh kki n.i akkm\^>n 

W *-tx-:w:»(h:n4CTLKvn' r/r:i,iri *ki-:ai.ivwi v»a ( KKA\Arn.kKi*;ij\:A*KKHR 
ctk w : 1 1 aaktoa** rrAL I wtmxt tvt ahha wu ki-v: » ii xni i i i .k y kvi n vkai ^ /jaa 
:5 vjur; :i * r/0F.LttAvm7iv : W n m I : ; h vav ki r Anc rrjr; r v : a r :» iktv ai .tmij * r: . 
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TADSUUCD(AMIIWAA0R3rLSlfC3VPAEAUKt lASgiOCKPOr/tEKIVTCICUr 
fTOEAOiECPF t KNADU tQSLlUDrZKTSfiSSVA t ECF ILWX IGA 

<:pn_0M« 701-143 70420; 

^ tEALDOHJC I L I TTTACT^ PYLTTCTTnAALPACELNVUVT* I KATNWVTCVYWDP1RL 

Vri-.v. :-virrv:r,-v..Ki,-::, r«i 7<L;:M:^rp^T/^,r:„;-v> , ,\i.Kt.i 



GPnJ>699 



704170 704721 



EWIJMWIRREANDIOJCKDSALTEDVV^ 
CPn.0700 7B5094 785609 

CPTU0701 7 «"< 796673 

xtshfhki^^ 

QCFiWaIXYSICE!TO.XDEEVEIITSSII^^ 

SSlKVT^P^ 
ESF 

CPn.0702 7B9700 786929 

yacC/gspD-Yop C/Ccn Secretion Proctin 0 TC LULmillT 

LlttMfVICrVtLHICRJULQGIKKIOOCKrCIl^ 

RggKLAACPKMSAASLSAKKSHTTOffTPCSI P SKVrSKTPX'rOUIir f U^l SflSA TPAJCPT 

TLmJBUCKPRPEIWTTADVICRSPRnTOEVEEWAASKBO 

AVNAlNWIiOCQLEEOTSTVrEJ^MCTOAT 

QOKSEEacwcn^TKRn.Tcn)ua*C¥T^ 

FNWWSHDPTSVDOLSTILLOVUCHHDUCV^^ 

TCIAVV\m*vn^YSVSPSAAVNIIOSTiSHDAIVSASEATR^ 

AALDCPCTSVafTEYEVKYAllPAALVSYCOOVtCTI^ 

lOTICW^YVrrAMDEDTIWTLNSIOWLEVIOIS IVI I® J 0®^'212SiSSSi£Mmr< 

YlE^LIU7TSU3CSWOFCTO^ALGDEOSKVXYASCLUanC 

XPtSpTOLICTSDKLHSSSAF^ 

ftTHfcflPrOQASFTVGOTVPTOnwri IQCTCTVtWItiYEDICVWLVVrSTyAWyyyTL 
OIEO^SELHS ASGSLTPVTDJCTYAATRLCI PC^^_ LV * < ^ GHI ^^^^^^Sf^Mu 

puiRaLrsirriXROKiwiKMFiicpxvissrEBcrrR\rtinc^^ 

APECOGPPSLQAZSDFKIIEIEAO 
CPn.0701 791205 7B9685 

picnS-S/T Proctin Kinu« . . 

RKIGFMDCRGGI PLPEWIGCnrHVXXILSIOClJtSKVVHGt* 

TSRSVYNFUCEAOSUIOITHPNIVKFHRYCICWODCLY **H^Y1 EMSXJU^LAOTISLP 
QAIDXX FDIAQALEHLHSRNILJOCDIKPENI LXTPOGXX ^^^FCt^DJjfflnTCIORAHPSy 
XGTWMSPEOROCESHSPASDIYAXjGUAmiLGHUlfiRVFI^^ 
QPSI*mYSSTROTODIHHYKMSGOK3EX>U*IKDHTV^ 

FISGVLYKOGYPLYPHAYini<LECDVPNLHU*YSP ^SKATIMJVVTOU'COQIWRKi 
ORVCEINECUMOaPIDDCISILClilSKINKEl^ACCncrvr^ 
FSPGLtnaTSLOIRETKVAWEICO^VVCTt^I^ 
ESIHGGIOSROHGSNSPSTLISIJCRIR 

CPH.0704 792330 791209 

UlN- Flagellar Motor Switch Ooo»in/Y»cO family 

RYTKAVAADSSASWU(SPJim$SLGrTCOT^ 

FRC5 ITAVEATKEFCVHLLIOPMVVOWEVENliFLTSEEXJMCtMVAVFOI^ 

E2COT1XGFHYYTVAEACKLFEEL0WVPS LSWOA3CD AJ FTATSUJGSFOWD 1 5 l***-f*»!i 

MVRCRLUJEDTF^SCOKFFSCLKBOTUWrOQrrOQlSt^VEW 

SFIMLOSCLYOPEIXESCAlXTVOKHOFrOCRFLTPSSCEFTCITSYPNLTHEDPPLPENP 

^ASAAPLPCYSRLWEVAJlYSLAVSEriKLNLCSILSUSfWPAYCVOIIUX^lCVCflCEI 

lAWDVLGIRVLEV 

3Pn_0705 7? J 176 79J3J4 

CT671 hypotrwticAl prottin 

FMELJtICTAESLY5A}CTDNHTVY0NSP£PRDS ROVKVrs LEOXQTRO ^^^SS KCNffRTES 
RKFADEEKRVCDEIAEVCSKEEEOESOEFCl>n^AFAC>4SLIDIAAACSAEAW^APIA 
VSSIDTOWIENt rL3TVESKVISEINCEnLVELVU)ASSSVPEArvCANLTLV0SC0M£ 
VKrSSFVnA7X>4AEAADLVTTWPSOl^SLVSALICCK0UTUCEFSVX»U.VOLPXI EEVQT 
PLfmiAJTIRHREEXOOROOfOCOKQODKBODSYKXEEARL 

CPn.070*i 79J68* 7*3 1B0 

CT*70 hypothetical protain „ 

YAVAXY PLEPVLA I KK DRVDRAEKWKEKRRLLE I EOEXLREKEAERDXVKNHYMOK t00 

LROXXCECTTSDAVLO I KSYI KVVAVQLSEEEOCVNKQK EWLAASKELEKAEVNLAKRR 

KEXEXTRLHKE£WMKEALKE£ARAXElCEOOE?1C0t»LFOLROWCKRESCCS 

'.'Pit 0707 / »Wis 7*13704 

yxcii-Yup N (FUwclUi -TVP* ATPuae) „„, 

'/HMDOLTTOFf/TI i4JOU3)VNLTTVVCRITEVVCML r KAWPNVRVi'EVCLVKRMRHEPL 
•/TEVVOPT0::FAFU:PU;ELJCV-J PSSEVI PTTXPtH t RAaff^kVLNCLGEPIDVCT 
Klir WNlVTVt't FftAPfOrUIftAKLROtUm JVRC ICUNLr/AIUVK ICIPAUACVCKS 
.:UIMI AltNAK^UVMVIALU'.ERUREVREFlBUJL'XE'Wrn:^ 1 W:7n;O0tW0URLN 
AAYVrTA fAKYPH Uf.i W(0:nrntFARALnt ,, r:LAAfJEPt-AKAvnfT^VPSTLPnL 

r xn:/iA:;i)Kt rr rrAFvrvt.VA- inrwiErvAoevK.; r u* ;tt t vi .*:malaoayi iypa i ovla 
.: r ::wa,ta t wavrjuk t u ;kar evlakyk animl i ft 1';f.yw« u ;pk e t cfa i oh i okihr 

PI.NU(j[IIKKTriYR£AAJOUlArpR 



CPn.0701 "«0J * 1 "" 4 - 

rr^7 nvootriey.^i Br ? r .? l ?. K rr;. * rar.-S!-:. 

■ "IK***". r" ■■■"**?■' '* ..*.*■-••■■ 
0VIAicKlE5TRAU^-^W«I7rLVAI(JACPU- 



CPtt_0710 



7M482 79*210 



Sr^SSt^a^^^ 

SSSwESVSNILTAWTEMITMARAVKCS 

cpn-07ii , 7M79l i . 796416 

CPTL0712 799315 79<7B1 

S^R^ECPW/IFVIMISKSIGRWSAI© 

ToSgpgiciCI lllUi VM LLOAACLVUYLNI HFim^IXPOCVVVgWU^IAQflXQ 
^?p^RV^YGEISINVVVW»ILT^ 

icmot 

CPO.0713 iw 7 «" 2 

aawvMraftrscsTXYTOrvRHVEsn«tre 

CPn.0714 WU25 800091 

g^SSKERSpr^LJ^ 
TMWVGTSOIimiCVAAYXYOKCWITrCSPWTA^^ 

OTAOTSotscisiJ^iPiaixvronwRTFU^^ 

^S^UUaXLTCAAJaOWEIYtKWSHITOROIS^ 
CPn.0715 M1436 «034«2 

r^S^iiASOKOXSicoQCRaiPuatw 
ijcavnai^ifsvksvriockyhia™^ 

PLTOAVTS LRCK PMNVFS LXTTKHYTOID ELFYLAT ALG I T0NEI 0HLRYNKV X ^J^E!^ 
rraonRKLLITFFLXTLLPL\"ENNHLF I LETPtJTVronCTTTLYYYSlOEIO^OALDOFCX 
^^LE^TRF^LcEISPKEFAAFIGPElRLTPVTITSLXSISSI LOFY>CKNTX£RKOF 
IMEKLITDF 

CPn_0716 803466 904902 

WRTCE^RTOr^AWILERAl PHIUXUCPVORRIXWTLFUffiDCKWKVANIAC 
n3^AfflOTD3REK»DILPAJCLPVLUJi^ 

SmSKTTW POFPsSlSoPSEYOOGLCS ITWASIDIINOICrLVVXOICPOSTTraiR 

^^^i ki ei klpkcsrakeklpllfxktecovi lyskptv : 

YESpVE^SEILK^AlOCYLEKn^^ 
l^oSt^IO^R«^^VLKGLLEWCHl£WTO 
CPn 0717 804«*6B 

^1^x55^?"^ 

RR^IEORCAMGOKKmiriOSILXSTFCLWeKOKDPAOKASItimP^ 
CPn_07lW ftO?300 R0S62A 

RT^.F^Ft^^^^^ 

VEV*EKTVt JP/.^LUC;iFU.TJ)L33UiU-Al.TK FE I LTUIDLYl.'AON t 
*PTi_M"/ I » »i|»*i177 MO«H'H) 

:;trJt iiTh^Mlniii i.lim* ::ynrh.ii:<0 

FDrrcvKicv i r. i;m-TVf::rTvr:i(»CTaiijoitYLTrnirrYCRAPYuni t u-yiLVOifTO 

i ( n-K/ATMi ir^ai i vr r otoKKreu jxlpka ni x*ri ec/;m u.vt KK(Kt»wvMPApc 

iinrn-i.viiAUJiPM/j-:KijtWMU'i^MiiviiiiiXpKr/r:p:[.irTAiPri^ 

rr w j'k: :yi -w* m»'jc i K-rrr 1 1 run ::io wnkhk hxr/r:: :»y :r. kavtih vvi a kmiklsfv 

ai ^ .HMiMrti r*?rr 1 1 / a « wt w nwr.w.i.urw i iay: .vi rt-ntrrrROF 

= : :r r a» ;i .1 « :i .i . : FKKitNKrr i Ihkm .t.C : 1 1 > 
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LR tTHKEFMY ! t KNLVOR PEEVP tKEVOCTHTl . . ..^AKPOP^r: rCXECRTXKAXR 
TLLVWAjrpMMVRVSLElHEEK 

•:Pn.07:i <I07*7t qi84D9 

<dsA-KDO JvnrnatJB*! 

KRMVMFNNKMILIACrCVtECS ITLL XAGKL03 Il^PY^IOWFFKS-nJKANRSS^. 

:Ff^pr;LTECLnt:JVKvxrrr:\T;;LT:^™TPW 

^A.\.:x:>:L^-i .v*LLi txi/. : j t :: :.hy. "a 

CPn.0722 008477 $08974 

CT654 hypothetical protein 

YC LSMTKFLYCCLFYSI^tXVLAFCTHVA I IOVK3ICDV^CMNKKrO£3PPnJCIW(VWV 
SKOIC8PECRFFHCKID^CMELHFPOSSySClCCn»TRISCHILTONrEKOMOF1*OSGL 
UfrOOCSUfNmRFOVDPVP^CSPOKEDSSSQSMKTLyUlJIW 

CPn.0723 809978 B09703 

yhoG-ABC Transporter ATPase 
ASHPXLS\OO.VKKYII!aCPVTNWFOZNPM 

IKIU^RCICILITOWAXELLSIADRCYLIIDGKIF^ 

SY 

CPn_072< 810602 809706 

no robust hoaolog pmnc in Genebank/EHBL at of 11/7/98 

WSTWJ>fllSreiLSKlLPn*lJflMI^ 

NI PYrrKATRSCUWKnAYUlStiauraiAYOTXJa^ 

rFTCCSNIEDIX^OlRRPHRIUXtffSYCOKPKACPBCRFNDACRYDPSHPTCASCSICT 

MMMJJARKrrrmPTrcDIAlCKLKW 

GVC IRtTCRICNTTKAmjkERCVKPCVTILXECCFEVl^ 

CPn.072S 810829 810587 

CTSS2.1 hypothetical protein 

9CC2M»FFArLLYESlJU£UtKFTS>MQXl^^ 
GLTTIKAIAEEVLSDDEPLLD 

CPn_0726 813384 610880 

CT620 hypothetical protein 

A^ItJmySTSirrTYXXI^LVSSKHSrAORHRESLEHIANYZrrTAIMILK^ 

RASERYRSAVEKLHKYEVS^TVWCSXPVMZHEXP^ 

YYWVKQiCKAOOLXVEIJm/MTTZMASVNSIWP 

FBTLYNTPEnPTAIOR AL/W lUjWllUr rWJLAaCfSNOATLTqTF AU J HV U^ KD ILT 
AVQGVLT PEQFT I FAX IATELQALADHVGNFDEACLQ RI ZQAGDOAAVrNSSDLTmDK 
IKFCOHITDLYSTOmLCSTOTLMXYVWHO^^ 
S(XmXS5AAUkGAICTAJlGLNSRFNSTA£0aXUN^ 

STWAUJFTATMDHVKAAI L£EAKQ<£liSSFOLASS IKSAKTSIVKSSCSFSVTVNSSTL 
OYTIYSEKNGKVXINOXU^STGFWEZTKIJUCTKAE^ 
NKXnJLQSQLQOFTrMKTEIJDeniZSQASEL^ 
YXXLYYSNU^XCraiXIttXSOYVNCOTYFNF^^ 

SLVLLQNYLQ PLS IACCSVAOT Y ^VKiJQ0E0WQARL0ILEEALV5GLVCNHINOGMrPL0 
CTXQSI*WFMHX>NFOIiaLaHKLTSNaJE*W 

CPn.0727 313559 816192 

CT619 hypothetical protein 

KYYLTSMSTFSIONRUrTISCESTR I IKLOKXYSGFOPRSVPAINLEELNSG IYALRHLH 
NALOSENTWAAUKPNNT I FPTTSVrDYKKSRPOASSPRAPSSQTPTD IVSAAALALVL 
VI DGCLAZLVASVT DDLCALST I STVJIQ U4ASYLGLTTLTAEQEXWTSSSYVPS EKNL 
LEHVIC0E1WU^QAK0E£XXAVT£AXGVST££XEA^ 
RAJCVGAPIQDOlENAZOtAPTPPAttPIWVNrVNCIWIlCT^ 

EX XTXLQTLVPLVDKTTFTXACfDL lYTATOLPOTASIJCLYLTiniOIAEYRCKrnCVYON 

siqnl5eticrvviknrshlitoi£mfo0aqncfv^ 

lctavtsooo i kaylgtrg wefxatrhffhn i gdqky^f anftvfoclttanga i qpd l 

ggf xreaktnvctveaoyvsnaor i lnefnt aat ahvlq iq lq xaslowcaddldpgkas 

ftenrkfavaaw itseslcdal ismi ihsolpkoeaflkplx eexnfnnlaanalnsllq 

itnefsttswslssylvosktgowjagd™ 

ncllqk inslpcatsaqkoeml^ttyyoyslsvtt^ltv^ 

yoksvfk xesfodwi ptlaalesfl7scfpni satccicplrrqvqsdootytsocotoo 

UajONOKrTXQQEVrLVSTSMQVLNSILSOUACAIYSM 

CPn_0728 818483 816525 

CHLPN 76kDfl Homo log (CT622) 

'/FMVNPICPCPIDrrD?TPPADLSAOClXA5AANKSA£AORIACAEAXPKESIfTDSVERW 
3 ILRSAVNALMSLAOKLCI ASSNSSSSTSRSAOVDSTTATAPTPPPPTFDDYKT0AOTAY 
OTIPrrTSIJ^tOAALVSLCOAVTNrKCn'AATDECTAIAAIWETKNAr^ 
KYA30NOAI LOSU^TCFt^AALLOSVANNNKAAELLKEMODNPWPCKTPAIAOSL 
VOOTOATATC I EKDCNAI RDAYFAGQNASG AVENAJCSNNS X SN XD5AKAA I ATAKTOX AE 
AOKJCFPDSP XLOEAEQMV XOAEXOLXNXKPAOCSIjVPNPGTTVGSSKOOCSS XG S IRVSM 
LLDCAENETAS I LMSGFRtJM IHMPmElJPDSQAAQQEIJUkOAiOVAXAACDPSAAAALADA 
QKA^EAAI£KACQQCCILNAL50IASAAWSAG\TPA 

ktoisacydaykstndaycrarnqatrdviwvstpaltrsvprartearcpektooala 

KV ICGMSRTLCOVYSOVSALOS VMQ I XOSNPQANNEE I R0KLT2AVTKPP0FI3YPYV0LS 
MDSTOKnAKLESLFABCSRTAAEIKALSFETNSLr XOOVLVNXCSLYSCYLQ 

t:rtU072'i 91*905 *l«92 

CIILPN 7i;kD.i Homlnn ICTC2J» 

PAWT^VuTLN I arKDTMKKOVYOWLAJWLLALT ISOYAELT'UiECKVKsJHTYTTLOEVK 
DYL.'<KRfjFVETf < KODCVLR I ACUVRARWLYFR ED ir.UPSDKDKYNrLFVNRYkSEFYlY I 
^JYRAFJ^IWIJT:/*KMNWAtAlSTI>^^AAWDt^lRAFUJYRFYK^II'tTf^TFFME inRJGUJO 
I a-'EItVUTV^FDlIUI t YWTREL:»D\'PYOVI VI(Cr;PFVVNMTKKI IYAWWB* I LNP LPK 

• IAFI JOU'I JVKATKTTI^ir IKKNI.AWK lit !TLf j;UlKA' ItatttATWYKYVEAUJVPi; t DV» : 
I * TO WUi: WAOAI AANYPPKPJW* IFTMYKt IK.IALYKY. J ITU II^KKAYflAYUKPAfiOk* 

iiL*:riFTKt'Knu*:ir::Af 

•.'hijiv to >ijir, it, si*rn. i 

uivjN ltiiiiii.il Mi<nhi.u»< |'i,*r.Mfi 

:a kk» > 3 :i «:RKt*HKV;:iAi<;: TFN tiw/rri^ f 1* : I Kfe R t amaty i-tiadi* I v aai-v 



a;FrrVFFLPK:U7;L:L^.. .PHFEFUWC^^FFFRPF^^r ? . ;^" E 

A'/tjAWLOir/Etcr/^ :litn ;LL.r^Fi^>i^a^ 
w i rrvrAARHSPffiar^i^vAL^ 

APLSXS tLTSS IFCUILlJOICWJrrVKtTCFLYtKYJIJa YCCTT WsrcWWllPA 
rSRCVOREDHERCULMKFVLT-wTMSVHX I^ACLU^LPnVRVl.YEHCLFPOSAVYA: 

vRvucryc i r pmalapl vrvLn'AORCY avplf : - :3taiak:v^,,-lcrwlcvs 

t .T^AT -TTAW/C LYF^ Y-SKRLPKT J K LLWEJ I RRStKVMCTT?CACMtTLCUf Xl.t 

TrrYvr flnfltpi aw?*-.".: tTAC* r Ar:.-F-r r n^FLmFAiaxRVEOL iklastcw 

CPn 0731 -*il4J4 ^JWoO 

no rooust nonolog present in S«neoink*EM8L as of 11/7/99 
VAIAIS RNI PV lRLCr/PDNILKXERAItrrSLSFU.1 KPFSPPPUCCOYLTOXSPrTSSt 
rTIOCSYFXLNXASLCSrrLRLRSISIIS 

CPn.073: 622092 822976 

nfo-Endonuc lease IV 

MFKKVLPPPSXPUjGAKr^ACSUa^IY^rAOXCASr^ 

DFKAAIJCITra^YIMSHACYLINPGAPDmLEKSRICIYCE:^ 

AUCSSKEXXM«XVS3rS05APLF1S5PPLVVU^AC k OCTLXC5NTmCY^^ 

OIPIO^^rrCHIFAACYtJITSPOaWEtVT^FOEYVC^^ 

KAPU^CYICKESFKFUfTDERTRKlPICYLrrPGCPD*^KEICEUJ^ 

CPn.0733 823739 823101 

rs4-S4 Ribosomal Protein 
OJCYMAirrCCPKOTVAttlFGANircRSR^ 

LXACVQ1XMEK0LVKAFXZVIHX0CKVAOMFX.EKFECRX^ X FAAQOLVA 

HGHILNffCRJnmRRSFFIJtPOIOISIJCEXSXRLQSVXriar FSIftirSSLPgYISLDICTCFX 
CEXiVS PBQD0 XEAOLPI-PI MISWCEfLSHRT 

CPo.0734 823863 824915 

yceA 

OWTTEHFSSNGNFLOCNYFODTVRWIMEKrYYALAY^ 

CVSCRIYXSEa;ilC0rSCYCPKA£LYH>rUC£IUt7SKXKF^ 

KEIJUUjOCEVI3I^XOAJ«ISPCEWKElCXOE)mc^ 

RZFWADOAOECDPnTPVHMYCrOCIRCiaYSPVUJK*^^ 

CTtnOVU^FVFI*ORIJkIPIDESDPCVAPIAirCHCOTPS 

CIHOHQGCCCEraOSPRVWCTOSSRGNKPFWUkHLCnSTCr^^ 

CPn.073 5 825680 825003 

•Uridine Kinase {Uridine Monophosphofcinasei (Pyrimidine 
Ribonucleotide Kinase) . 

CEXniLWaJWIICXTOGSCACKTTLTOWIIgirC^XTWXCO 
LITOTPDAFt^U.ISDIICRIJOCrciVOAPVFDFVLGN^^ 

tpjcyasi rvKomtownjiLSoruKOT^ 

CPrc0736 827731 825992 

ygeO-EfClitx Protein 

RCEXXKJWX^VAFtfTVSViaOCSFRALVTTHFLTI XNXMLYICFUAmXIXBCXtmA 

KIl^CVSFTFALPTLLlJU'LACSIJkDRFOKRNIIWTTlFIEILCTII^^ 

YVVLZLH>CHmPGPAXX£ILPEX4X^Sf»LSCANSIIffM 

KRIX^mSYVWPTLMCVZVS X ISTLISFCIRPSNVlOWKOKXTLVSFTOLWrmjriTr^ 
YtTVSX FXCSFFlilGAYTOLE Z X PrraTUCYHWYCAYWrVMXNUH*S* ltUUS 
aa3IKICYVPlAAICLAL\TH^YAFACSILT^FUAlCFU3^^ 

pqcKCOiiJUWwrLDiTOrtyAACVig 

EHNnriaiXSZXLmLCYYLKZHOSSSPICCYFV^ 

OKLVPWIUUaUUCVPTV^SVRI»^ 

KNVCRFTSILOEOCIDI^PIOLVOKIinaCRVlYTLVFPHA 

CPn_0737 827469 830756 

"recC-Exodeoxyribonuclease V. Cussa' 
reRSAXLPASGASKRKSUUOaCLTOERIFAF 

SAI>fDFFLTrrVMNATIOCRASFSNSPRHUJWLJ^ITST^ LVANATTOH 
WI lO^OLVHVLSDHIFJCSriFTASDS IVKHLFLCSCCSOPNI POYLTLPLUHMZLIIIS 
KASKFET«REFl^PPTYrPrKXIJkAAFlCOFHTFSORPTXNASH^ 
HFTTXLNNRT0EEX3CSLK X FGYAHLPKHIAEFF XNLSTYFPVYFYC FSPCftEYTOOlXSD 
RAZOFn«f0LPOSPXaiAHEMYVUOROAUANlAHKS0SSQ^ 
HOS5£^ZONSXU3UCPTSPOOFSQTKOTXCXYRJaiIXPR£VOevrat^^ 
EIFXLS5HIE5YKVHLNAIFHPHVPXYrT1)EVDPRAXDU»0CILIX^ 
OLLTHP0L0OP IDONXVPYLIKXLSSEHCK ISSKDRASCOOMIWU^LILEEYPFKOeCC 
RVSOVEVWrPTVPLXYF lOERINLYtSSSOHSYCOLFOWrSCLEXIFVLSPErrSFITT 
LRNSUPTFOSCSLl^rTDFCUFLUFHKPSPLYOKPCPYtCSl^L^ 

LPSPFt/MXKTTL>OLP\TTLPT0PYL5AFFKNXACLHTSOEYNYSl^ 

Ft PTVKOVNLPOHLS LNEX XKC I FSPLOLFUOTIYNLRI SYPEHLKKCCKLFPTKHOIED 

FWNECFVDKEKDI.I T$IS PHAEELTTYYREKT I LLRMCLDKO PKHS PYTVTFSSS XPXER 

PYHE^IJPri^r^FOCNP\^rHCTIHCVrNEgLYI^SrDPRDSLI CKt rRTL SSLPCTSS 

EOKOLLERYVALAVLOMSOKCfSSClSAL X KLTS FHTKENHH PPFSOPEOYUUCVlXVYKlil 

SSQPXPU^PLOm^EEKFHOAVl^XSEEAKNPSLPXFiQFKr^ia 

ERLKXLSLFRCPCEAV 

CPn_073B 8)07n 833895 

"recB'exodoojcyt'ibonucle'tse v. Beta" 

K FYtFCEVPVXPFN t FOSNCS IOCKFFLEASACTCKTFTI EQ XVLRALI ECSLTHVZHAL 
A ITTrKASTKEUCVRI KDKUOTUEUCAVLNSOPAStPraOXN^ 
TL00K3t^IHCFtrNF\1^0YFPrTRLIHr«PALTHS0L*/l^rr^ 
OFHU>VRYNITt;Wr:wLVDK:iXAnYTOPi:^Fr:r,R7E^EX?rCLhW0OIYNSUXIP 
KOVFt^LTAHIGCFKKOPF;;iLCCLHllFVDLLYTSET1C3LrSFFKIAETT>IFKKRLAR 
YK PCAAFTVLENMIWV'KKTI J>X.TILX»R I ►VTLLVDU»EYLKOMYTPWU; POESVFALDCL 
I^SCAOPVVVAZ JtFOVOI.VL r DF.FOOTDKC/AC X FaiLFI3rKFTC:;LFLICOPieOS IY 
WR^Lm'l.TAKj:.-:r ; ;EDKOl^LVIffr/R.'-Tri' LMEA ttfQfFCKX i;rFLEI PUYt^ICY 
llAIHIO^IKTFENPniAI'l MFKFYrr tr.OrjAt.WIFr.EALPLOKEOK [ PU*MMWLVSDSN 
'JAFELX-YAT tPVWKHK:: t PIILTETI 1 1 1 .TTAU.EA I LIIPENYOC ISK IIT.'-.TLFCLSL 

myrma^rr t ykom jk:y t .:i ii r ;t.i Arr^'rtfnvar/u^L*:ri*( :n t iFVD(CXLCCY 
: 4tt i:::;YrYt vLUtUNr ;Frn »w» a r ;:;r/::uuu:rur. itt i i i^'jc. JiiYOtvrcPG 
i Eic;KKriKS^:EU JiOftVA* vpaitoi.y u- t :.tqpi^i/^ J:yvLWYVKLinroniiAYO 

I^IIKJK^FJlPOLFUY:UJ1(rai(*HA"n"/liJtJl,LGTFAl^*/TPPKTIl^K;t:rTKFLlJTT1(K 

ix x>::tPY::KLi- 1 :«c\>U'U :tarr; i r. fur t ri::: tOK:;u.or/rEY[«!T rmtp txirrtiLCR 

KEKT I LKU^:KThT:TI .TF:-jXTTF::t &j VI .HIF I FHPrnPLFLEMJttia.1 V t OLFFEHE 

i kyyi [r*ocr.Tu:t:rNjDY::K;ait-: i ■/ 1 kwijw* t tvk avkkfuokf. ( nocA/et* 
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'TVfFlVt'' LTjdNCFFAL.'nr.EO I PNHNPKA t'jt /H 

:;pn.mr' "J43V2 =03**1 

•7T hvt>orry»c ioj : protein 

^r/LrxucrnLRNKrrKrr^ ; r ialciufrsi rcr/YDK iRcsrvjLKvxrrpKiKO 

APCnHLA'JLCLCNLVTJCER-/A3l^UtLYEVaiKTPPLrPE:LTPYFHKLVBCiemra 

n-HwssscwvwOTniwwsr/u^vi^ 

-.g rosww i kiclrel r poveo rsHAY : lekokydci sowelklxscegenoallrc :l 

i\tw- . : . : :• :: : " — * :kti:.i • :; • y.tt *Li* .: :s.\i>'rr-.r,\i P : 

w nrrrv.. rr.K«*r;: :::.j-rr :r»;L.Lwr. 

CPn.0740 936054 934864 

tyrB-Aroetatic AA Aminotransferase 
SYMSFrNHIPTFSPOAIUJlCW^ADXRPEXVNLVIC^^ 

1EEGQHXSYLPISSL0 IFXJIMRELVrGAVPPSAXVGPOSXZjCItlArMIfiARr FWAKSS 
GKVYVTEarvSNHXRIFSOECLEVXRYPYYSXITO 

CHNPTCVDTrCOKWKELAXIilXIRQ.1 PFFDTAYQGFAHC I ELORXPXE XFX SEGNTVLV 
AASSSXNFALYCERVCYFAVHSTrrOCLVKXHSFtEEXtRCCYSSPOIWCVttVSTILSN 
PYXJCEDgQSEt«FIRESLGia*RT*rWAlJlW^ 
RCOHAVYTTAGCRmUCZTEKNICHWOSFIQAyEL 

CPn_0741 838383 836185 

areA-Transcripcion Elonwion race or 
EYIFTlLJCT^r/I^EXLQV^IEI^S^ 

AS LFGK IVOTWPLWEX I PESKDKDRVLQLILDLQTSNSOMFFDIA77EYVNKKYSCCCKF 
NEAXJWVCLRXXjRI>FOFSl£RFDfUIHMKKr^^ 

CIMSAKDISFCTAJTCSLTPL^DHFl^RRrcDPDGFL\FAKQfPI£VVZIIXRDI^PKTA 

KEXJCDttVDLVIPEAlMatt^SAICrKIlCKOT 

L£L3UtSAEX:SUYKFXK£OS£UCNIEX^^ 

YLCXIO^IOXEYZTSX^EDCTSRU^MPIVAlAKSrULVRJCySSFWOOV^W 

TSPTMRDFvyrriKiropssvE\ajacRiiDSAHOP>#<FPnjwr 

KDUU:FXX5AUIFHyCVAST7KICELGXKU{HYt 

STK^FSSSCIJWiaSLAEVVgiTtJCXH 

KVE»t*KEIEDAR5U£UlENSEY1TAUKRAJU£Q^ 

CVCCmUOSOACeVVEYTII^PWDADPDS^ 

ISRXQSIWEEHGA 

CPn_0742 938443 836888 

CT635 hypothetical protein 

TKMMVXVKNSKSAOX:i3SrKCILTIYNIDFT5PSTGSSLSSDSWJ^r^ITTO 

IJKRAQEILTCTCWSXECHI^ANNPr^S^^ 

OLHPTXESXRPKOKLSSTXXNKXmWZPL 

CPn.0743 838936 840362 

"ngrA-Ubiquinone Oxidoreductasa . Alpha* 

XFM1UTVNRGLOLSL0CSPKESCFYNKZOPEFV5I DLRPFQFLSLXL1CVEQGOAVCSGAP 
IAEYKHFPNTYTTSHV5GVVTAIRR£NKRSLlXn/IIK^ 

EIFKENGLFALIKQRPFDI PAI P TO TPRD ^DJIJtf^Pn'PSPD(HIJ^SSR£ECrifV 
FWOVRAZAKZJIStJtPH IVFR0RL7t*PTQELKTIAHL>(TVSC PFPSCSPS ZHI KSVAPZT 
NEXiVVrrLSFOXMTXGMlJl^iaiJfEGyTA^ 
LraiSDNm^ISGDPLTCRLCXjaXEPFUIFRriH^ 

TmUGTFIOCICRrYTOPtmiLHCETRPZ ir7TDIYDKVMPHRIPVVPI.IKAVITKNFOLA 
NELCFLEVCCEDFAlJTLIDPSKTQiLTIVKE5LXEYAKZSCILTPH0D 

CPn.0744 841387 840389 

henB-Porphobilinogen Synthase 
EHSSLTLSRRPRRNRKTAAIRDLLAETKXSP1CDUAPF 
LDLLLKEZZRLCTYCUMVMLFPZZPIXSLICIAYCSYSS^ 
ISOZAIJJPYrrHCHOCXFLNCZVl^DESVRXFCHZATLKAEHCASZVA 
RSXLDOSGYOTSmSYSWASCLYSPFRDAl^SHVTSC^QYQ^ 
LDETEGADILHWPAGLYXJ^YRIRQNTCLPLAAYO^ 
KESLIA I KRACAWI X5Y5APFILELLH0CFEF 

CPn.0745 941903 841742 

No robust hooolog present in cenebanM/EMBL as of 11/7/98 
VDSCFDIWRASSLQCSTTYNVAYDPXKTIAYCFrNOVSVKKFKLXPPKSO 1* 

CPn.0746 • 841939 843567 

CT632 hypothetical protein 

FSCRCPFSFEVFMI^KEEEFTCKOKOCLSHFVTNLTSDVFAIJOJLP EWKGALFSKYSRS 
VIJUtAIALXEri^NEErOTVCOEAYOFITtTO 

MENVS I LiAAKVl^DARZOCSPLEKSTRYVYFOOICVRCEYLYYROPI LMTSAFKDHFLGTC 

OFl^OTSALXPgVRAYrEXLYPlCDSCTFASAYATSLIUU^^ 

GNCRIWtt.IHXLOGKNlAEUlRl^a)ESLTELHICVIW 

KEOLJCLAEOATFSEEMSSSPSVOLVYGDPOCIYKVAACFLFPYSNRSLTDLIDYCKXHP 

HEDLVOILES^/SARENRRHKSPRCLECVn^riirLADFCAYROLORHRTLTOEROLI^ 

MHGYNrPVFXLDTPHOCSYRLAMERAKCTYNEIV^IFPEEAOYHVPMAYNIRWFFH^^ 

AWWICEUSOPQGHQrATlTXATCLVREMn^ 

PTT 

CPA.0747 043949 844053 

CTttM hypothec teal protein 
RTCHCCKCAEVO I LSSRSLSCMKX LS5SLFYKKFC 

TPn_074R «44i»«»6 844121 

mpA^*rjnyl Tronstransf erase 

CTUTJIAUTn-RPC I ESAI EXALECFCPICHP I RSP7EYAL£OOCKRLRPCLVCmA0CL 
CLMtDVMOSALAVEFVIrrJTL I AD0LPCMDND0ERP/WPTVHKAF0EATALLA3YAL I PA 
AY.*<t (LALNAKKLKEOCCDPRE ID I AYNI ICD ITOKJ J IGC3CVLCCQYDDMFF3NRG0EHV 
>.** INIKKTt::7LFEIACldCWLFnOCDP0FAPt tTIF.'SNNFnLLFO 1 KDDF3OL0KDSQO I 
• II J/Y ALLfT/IEKAAL^UAHSCWCLELLDRLS/VJCLKJCSEFET I tS.TLC.*F 

<:ili_U7V •*4'ip1« 8 '100(1.- 

ttlmtl UW»-.:i.jia.: lyt ».u**>npnf>ry t.»i>.' 

w:ykty(a:^:i r::ir.nn.YiF.i ( : ikahytwd 1 t .tlhi/jmleni tw: ? : 1 1 if 7TVEr«(jVTLKN 
i ek i ci AEr»AYvr :i jay i v ;n: i u;: vrtVRjuiWLWMv tTt:::p.t wi ;kcte t knsylc 

i II ITKAAIIKAVI * IC*: .VI ^ 'IVNLt W WRi.'AMFRIXmpri t YVR:.T::iW HKK I DTCRRKLCAF 

u :ki iVAi'Si -WV I Nil vt i I t.nrrft t fcr* hjv t 



tctO/cpxR-KTH Tr.ir.n- ion.il RequUtorv rzzztin . Receiver 
Oasun _ 

KrroFiuiXHSYsiLFcnm«OKi yjww tmxus qo/ywLW 

ESVA IFCEYLLLPEO IFSFOI FPEEDC IVtTt7TTQcI^TKVLIIQCA7\*l liRPTTWCVL 
OAVIRAFLROHEVLEHS ! PtTTKTFCDKTFRVLNLV r ESPECSTCTPStAGXIJCj&IJNR 

ghlclrkkllae: kcntke: xarnw^ia^lrkki^pycsk^tirgvcwscccsip 
lqnkoktahpnee 

iv. " . *..*■;... : ! v ^^ y *^M 

MT RCX tFG XFLLTCFsi^r-YYUX* JHOV J ICPKEKSRSVW IEEfJt£FTO»*VLHMLP50 
HOMUILCFOCFL^KXKr^AEXIFSKVVSEAOOCPTUTEr^^RU^rTIJXTO 
VMETIUUJ*QRCPNSPYYHLFKAXVCYKOKLYK 

lXLVB3XSADIYFTnYDra^LXKrYXLE0C?YACX^PEE£U^XKEM^ 

PLX0I^MW0KKYVHPNSSLvV0XL\TRF5T10<ECAXRFCI^ 

Em^NKVQOXKT^EAXOC/AUJiXUPSXSXSEXLAI^SSTLSNXV'SS:^^ 

LDLWIAIOSYOI DTOLVHHLmAXOIMUCGS^EKAlKU^l^lVTSYPlEaSW 

LFHOOAYKOAtiSHAI ARLUCLQCF I SEAKZ PSXVI SEA£KAHFLAOAEYl<FAKXSYDXC 

YX«YS>*C.TKVAPSP0SYRl>Cl^U!ENI0RY0EAIXFlOiI^ 

HOSKORAAS 

CPn.0752 848595 850082 

-recO-Exodeoxyribonuc lease V. Alpha" 

GWAUfT EFAPFl.EOX«VHO0VX S PU) XAFASXHX SSDFEESFVFLAVSSALWRYCHPrLSI. 

Ea^XRPSl^ISrTPLYRCFWLPKIJUtDXLFVWSSRX,\XRSLYTXRSr JTVIVir 

SATTNYTPP5XO5STI^EE0^XFTlKrrOOCF5IVSOCPCTCXmAA0^XU 

LR IAXVS PTCKATSH XROZLMCrlfXFOtMVtJlQTVHHFl^JEYAYRRYNS ZOVXXVX3ECSM 

VTTI^YSLVr/rL0CYllO3KKLYTSSLIILCimCLPPXCXC\TSWLCX^ 

HEPIDSWa^OVVYNYVKSVHKSOCSEYDEVIVI I PW^EVFCVSILYTAXTRAICYRVSV 
WGDPETLKXXXKKSNY 

CPn.0753 851009 850161 

No robust hooolog present in Genebank/EMBL as of 11/7/98 
I)UTAHXXXROALXJ{LR5WrPAXRASGNLriU»S}C5IJ01NW 

liSSSHYAHAAI^KTIXFLC^AIXTV^AVA^HIJCO LLNG5H X I fl ULll^C JW CW tAP 
AECXHIX}iaQRR£AI,TXKaCVAJUASlCrCT 

VATOCSLTESSXSLYRZLSTRPETZSTlPINRNirpSAEFAAJlSKAZIlNArXAW^^ 
CIUIOT^FXPAIL^^VLIMAII^LISCVIOTVXDYAKIC 

CPn.0754 851381 851040 

rs30-S20 Ribosonal Protein 

FCASXJOOOTOATX^NLOSVVSVVDIUWK^ 

. CPn_0755 ' 851379 852799 

CT616 hypothetical protein 
YKBLm<LL\nxuurrcrKY>iiY^ 

FTAIJUl&ENOLKTAAVOlJjQTCXRJa.^^ 

LF10^y^VlCTTCTO)KLETXJLLKR7mw^^ 

IAYOOAI>©EYOATFSEOR>#CJiaOIYIGXLEKKVOD^EIRN^ 

ESNAVTOlX5UUSEXJ0CXAF)CA£KXEAASSLTASRYLKTt7TSV)<NYS 

EENtIMl^VYARO50RAVFAKAXJ7nWXUYCA£0FlJCFCS0 XVI SGGHX>MEDUt&8XSX 

CSCRI,VIKTKSROTLPFTnfaj4AIlDCCPirYHNrt^ 

CPIU07S6 853889 854676 

rpoO-RNA Polymerase Sigma- 66 
XSYWTICLSSKAIWLVt^VRKXJM^ra^OATEV^ 

TYtEXNEXUHSTXTTFEQIIXT^rLTMIOVW FfTTiAllRTE 

GTITOPVRMYIJCEK3T^II.TREECVEXSKRIEKA0^XERI 

XSQCERFX^XISCCEVEDKTOFXJCLUKLZTI^JCEECTY^ 

NDSLEKCRI RTO AYUICFHCRHNVTEDFC EWTKAYDS FU<LEQO INDIJCVRA1RMKFAA 

AKIAAAXRXLYXREVAACRT1XEFXICGVTO(Xj0RWKDXSOEAI^^ 

TNRCLSIXDLZOECMClJOCAVtKFEYRRCYXFSTYATW 

HMZETIlJmiWAJCKL>WETCaCEPTPEEXAEEl^TPt)RVRE I YKZAOHPZSlflAEVCEC 
S ESSFCOFt*£OT AVESPAEATT7YSXLJCX1XMXE 

EVCS ATNVTRER I RO I EAXALRXMRH P I RSKO LAAJXDLL EEEKTCTSXVKSLKSK 

CPn_0757 854709 855134 

colX-Oihydroneopterin Aldolase _ 

PC XKKXALV1 AI ERYQL 1 1 5KFRHWL FLCCSVEERHFKO PVL I SVTFSYNEVPSACLSOK 

LSDACCYUVTSL I EEXANTKP Y ALI EKLANELFDSLVI SFCOKASK I DLEVXKERPPVp 

NLLNPIKFT15XELCPSPVLSA 

CPn_0758 $55104 856459 

tolP/dhps-Oihydropteroate Synthase 

RA>«EPRFVCLSLC3hOJC3mFXNWIAKTLLCEOAVU:LRSSVILETtv^^ 
LPYFKSVLVCETr^LREU.\TIKQIEKWCRAE^ 

TEITIPI^fCXSRPFLlAtIASL£PYRJlFrTOCSPYHNFTrCELAHHLPSPP«IRRSLS 
PDTMUC^™rrND^SIX^MFLDPEKAVAC^EKLrrECA^ 

MEWERLEPVLRLLKETUSNRKOYPI ISLDTFYPEI ILRAMDIYPIOWINDVStaCSOSHA 
EVMOXLSLVMNHSZZUWtnilLSrSVPlGZQLLSWW Ft) 
PC ICFCKGAAQSUATLYEX AXFKRLCCP I UCHSRRSFL5LFCNH0PK0R0WETVCLSIL 
CAQ0CVOYI*RVHNVAAHQKALSVAACEACAP I 

'.•Fn_0759 !*S*4!4 1S6n'»7 

EolA-Dihydroiol.ir.f RorJurr or.*, 

LLVKpyilPSNFEKPl/j-VEHJICHP/r/Pf; r/ACDPRG*/ t'-'LECKLrW] r.TEDLOFFSETIOK 
FP lVHnRKTWETl.PPKYF\TRA*//VF::HF.KRv.S JVH'JE I WVTi*Lr£FLL!JX. , »:*PTFLICC 
'".ELY^LFLENOlVRDFFt;:!! IFrrM':WFFPI-::LLE7VrKTVIJ»ntX»K rTTCYYENHMJi 
(JffTCTIt--L 

i.WlJ»7r.O -*<...'..,; v.7i.-M 

*rr*. t J tiyvnrn*:t. cm I ri'.f<*ut 

RtfCPKlXTLE I PKPr^kVTHK ITT/KTI*C I Yf •YUfJl.Y.'! t LC^.K.PK I MVS',:'. I WtTUK IVH 

l^; wyiWELKKv.-.-Kum. t kokalayv+-/kkv- ; i ytTKKwn i l f r:yu r UKinvn ^kvly 
i ♦DFLU'M/ru :lwlrnfyiii.fj k;;iii: :r* :i met mp< rtMnu ;u'wwr :pvn .-/nwi « r 
Dr>vmALKKTY!;NixDt;L::AAA7i/ wjhrifta^rt' iai ieeapk iTFit:::;rTri4«KrruA 
iAKDGDLYf;pixo:>*AWETrAKr:: 
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NUUCREOTLLOVMmXPKl 



:PnJ7M MV7*-1H 958)7$ 

:TTMD nvt» ,p n"C tc.il pror.mn 

- tKTStf t ELLOXOl CDOHMUCHCrYORWSBCKLCKOOLOAYAKDinfLHI KAFFCYLSALH 
ARCOOLO IRROI LENLMDEEAGWPNH tOLWRQFALSLCVSEECLAKKEFSQAAOQKVATF 
R RLCOMPCtJ^VfJIJGALYTYE 1 0 1 P^VC/EK IRGlJtEYTCVSARGYAYFTVHOEAD X KHAS 
SEXEMl^^n^POAVl^SOEVUTaWNFLSSFINSTEPCSCX 



LSLLYFSRFSEK0NYSU3MET1CRSI YMNLPBRXKAL£AAVAYIEK0FCAC3 XMSLGRHS 
ATKEI ST r KTCAI-SLOLALCIKGVPKCRVI EI FGPESSGIOTLATW IVAKAOKK3GVAAY 
I DAEHALOPSYASLICVmOXtti<I$OPtgGlDAtSXAm 

3O«EGPICtfl/KVGLQARH<S0ALRKLTATL£^ 1'IU* 
RAUCTYSS XSLCX RRXGS XKCSDNSOICNMICVKVAHMCLAP PFRIAEFDILFNEOISSA 
CC I LDLAVEYNI IOOeGSWnTOOUCLCOGRErVREEX^^ 
TPSVHANITPOCVPAOTVCA 

CPn.0763 860530 959972 

ygcA-Foxinyltetrahydrotolace Cyclol lease 
HFPWTDPKXEKSAiJWLFISIRJHJLSEEWCHEASSAVX^ 

CKCCAIfflXLXOKCTtJ^X:DOE3^YWLXPStOCLX5VVHP1CDPFSKOTPXSSm 
LVPCLAFT)QC<rYRUraHGrYDRWlAQHPYPSIRT^ 

YLC 

CPn.0764 861819 860524 

CT648 hypothetical protein 

CYKSHDXKKUCLFLCSSLtAMSPIYCR'IltUY EXL TLTCXNI IQRNSLSET ICSKEXIJQC 

VTfCVDnJVPQPYOKVMRHYTOKRCIKVSCLTAYHTFO^ 

(»XiaOA£VXGGXADU<PSA£SCMLFXCrTPAY>a}ZSXIXAAXVrEKCLLK 

GNXWCECPYHKC^rPOGXFLTYTSSCKXIJCSOKVQQGKtQfCU 

ECIUJJCMCVIJIFQTHEtYATlHECNGXOAXYSKYAVXCTRAFYWS^^ 

IVCTmX0GAXKCEEITFY7CTX2XP1CLLX/MHBnX^I 

KSGIiTrm>BW3IMATOYI»raiIKCrmiPCDRHPYSXXX}I^ 

KXPYQDGKPLLN 

CPn_07S5 862415 861801 

CT647 hypothetical protein 
TriYI1GLl£Wjaa0rfXSXLXt£Fl£Ll£ILSVIJOT 
DHEDQVlKMlARIgttDRMNI^lESLNASCKO^ 

XRAI^QCSNKOLVWNCEOHKNPFAfVRLEOATOCCKEDX ESLTS LTNPINPVAPLV J- ITLW 
lOrfTROTTPLCNEWLTKAEAXSRWI 

CPn_0766 86)785 862394 

CT646 hypothetical protein 
AMNFTOJWHIGLTl<AI2WIXUILQKTaC^ 

S LQCSDI LVKSS S SSLKKRKNI UCVALTJAXASIJ^PV^LIVOPQLOI^rUittSrrWLTL 

WXA0»niJaCEU7l^0A0XFP0KLSCRIUU)XFP1AEXlSPXJCSI^ 

FVKMKAXAVARS F^KKSTXXSCOOXHATL0YXOCTFP<7TVIf AXHVAOX SPNLOXXLEOX 

LSLPlJVVCQSfTYGVEXlEDWEXYCOTXAAAHHGJUniRPL 

SXAJGlCYAlJttTVVVSLCKVXJOUCSr^SSASOT 

KNSASNYFXI^IPTSEOTLmiAI/MSre 

VK S Q GQ PEPX PCFUOCISSHPKLOHVSCSLEDORSrXLOrrLSS 

CPn.0767 863876 864177 

CT645 hypothetical protein 

NXKLSYUATAXNSnrSFULAYXFASWPOOOSARWyOLVSKCVD^^ 
IDPSPfVClXCLSXLPFVXLRVLRFX ILNXFHSPWLLOYL 

CPn.0768 864144 865163 

y oh I /nir 3 -predicted oxido reductase 

YF5FSMAAF I F X KNXLLRSS rVTf APLAflrSOYPYFCMSALYOPCLMFCEMVICVBS X LYAP 

ERTSKXOJfYNENMRTIGAOLCGSNPrrSCEAAKILECLGro 

SGLIJCrPELICRILDKlXNSTOIPVrVKIRSGTOlEHINvtlTTO 

TRAQGYHCPSKOrYX SRAKAAAGKEFPVFGN5DX F5 PEAAQAKLTTGCDGVLVARCTLGA 

PWICKQIODYLTrcSYEKIPFIKRJCAATLZHKRLVEiy^ 

AKVRTUISSLAKATSYQEWOLVNDYEEADDSSLETFVKC 

CPn_0769 867763 865121 . 

rcpA-JJNA Topoisomerase I -Fused co SWi Domain 
SrOGPHAIRLHKKSLI XVESPAKIXTtAKXXCSEFVFASSXCHXVDLPAKEFCIOVDHDF 
EPOY0VLPDKOEVINH XPJCIAAJCCEXVYUP0PDACSCAIAWHIAN0LPD5PLX0RVSFN 
A ITIOIAVTEAIJCHPRT IDMALVNACAARRXXDR XVGYKX SP XLSRKLOQRSG X SAORVQS 
VALKLWDREKA IDAFVWEYWLRVUCDPKTTK7TWAHLYAVnOCKKWrKEI PECXTEN 
D\AXINSEEKARHYAEU^KSSYTrrRVEAKAXRRFAPPPFITS7X00EA5RKFWfSASR 
TMSIAQTLYECVT&XSEDSTCLITYMRTDSVRVDPEALTIVREYIQQTFG^ 
YTTKKMTODAHEA XRPTDXmTPDKLXtOa^OOQFXVYNLIWICI^ASOXTPAXYDTlAV 
OnTDTEIDUlASCSLXJCFKCFLAVYEEXOOOI^TOEEDHPLPPLHAODALIKEEVSOEQ 
ArrKPLPRFTEASLVKEL£JCSCICRPSTYATIMM(I0SRrnTKEN0RI*RPTELCKIISO 
FLETTIFPR XKO ICfTALHEOELELIAXINKKPHKIIXCCFWtTFLPVVXTAEKEAV X PR XL 
TNIECSKCHKGKLVKXWSKNSYFYG€SEYPECI)YRTSEEELAFNKEZW^ 
CCOVMKVFHGRYCTFLGCEXYPECRGTI S X HKKCEEXEOCEPX PCPAICCNCK X FKKRSR 
YHKIFYSCSEYPECSVICNSIDAVXTRYSGTEXXPYIOXICTJ^^ 
CKAKSSVKF.SSEXrrcPlTLPSPttAKMICNEPVSR<^^ 
LVPDM4LAT t tGPNPI£W)LSKHLSQHLTlCVSNDX3SASS 

:Pn_0770 468122 rtii-i>131 

T r ,42 hypothetical protein 

Y. PFTRNVEKLEFVTCLCSPDDOL [TFNKOCLIAOPEEEKVAFLVRSNAMLOACPETPASF 
i'E.'JLREGFB t FPnrVEVLY^NECLDVWEACCTWI Ufl IEVT tOLRKHHRKASRWLGMYSRD 
LVUMIEAVI IAVRMKFH EPVFESVLAYOTSRWfWRRFFnPLFRS POESVLLLF FTILCLC1 
::t.WT/(>ACtLlKLVLt^/FliffiU>lACSYLYRAMKKtPK>tU^PPU«fVLLRLTDKEIKMFA 
FKM PVLT3r/AI»KRKLENVkWKO tYQSYFV 

HHA ftllytfu-t • »:'.»* ::uni''jl 
r KK' JJJIKP l./li: !: lAIJJMJ-WKyKL.It.KYL P^JtMOC/lLOMlOSPLTELSSYV* /*3EI IDNP 

i -'KM t jossm: : n -/ p rrw :tfcylnott( ; rot; lvtp.llpo i eeafvtaeerf : aho i 
•vjh: :t)i-7 ;i j*i j* npkgfacklelplek ihkvwpt rvt ilsmc {a£pslqsywmkllrh5S 
■ K'j* r. iwurrrt rum* .-epap t mkkf3L:;u:euv: i lkkalos t pwcpaaactvkpmvs 
ti i ,i 1 * t vi j-'v: w :;;wk. * w:rn»ou>:: i kuiketftifyqilpkeeoknUQO i lsajcwl t k 



OK t PAPYrtJ r KCUACSLJFIIE.T7 ! FP A t ENKAVA 

zzzrs t JDR :takoiptar 



APrCtrPUCMLFPRCIIICOMHSrENVUJWtRWrATECTP: 
RTVAJCYRAOLK Il;fAI(WR»CFY« K3JI4^r«E»CrW 

CPh.0772 <I72400 ^046» 

KLCLX^Isia^RKAVTAPLJ^ 

VTTTOOUkRXLKEP r.T^A^TNEFDVPMyCTrKSI^ I LRRS II^^DI^rYOOS 

r Kv/v-"r^ ;7.;*r'— ' i:j^.*rv*;n:.v::. : ;w v:.: . . yr; . * -r:»iAvrr:--CLil:BCHF 

MVFAVCOPOOS X YiT*VJAN i KW X L/IFENO Y WiAKVL^*l^£Wlu> YTJKXt>iAAMALX KMiA 

SRl^X£lRSVJCPCIXIRUTJC^Tt«EEAX>rVA 

RTTOAllRIUtXPYEIXCCUrnCRXCXOO 

TT t FALTOYA I AOCLP X UCACOQAXJnTDVIg^KKQOESI^EY XALrP O X EHAYK7LSLR 
OF XCSWRXTCYLEX Lm>AX7rFISRXSNLEEX,YNXAX^ECO^ 

CrnUWDliYLTAACVRSIMrryWfHKPSRFUCEI PXOYNXQVR 

CPIU0773 872485 87U95 

uno -Uracil OKA CLycosylase 

F»O4ATX00LPVSUCX^LPLCWRE0LX£!MSKinrm 
AIJ^FTIXA'RWXU»DSnrPCXCOAHCI£FSVPEXX 
CCLOSUMCCXUilRVLTAnvCEPFSKACXCV^^ 
RJCKC£XXFNS10^0KAVL5SPHPSPlJUU(RCrFCrSHFSXXr^ 

CPn.0774 873183 87T425 

CT606.1 hypothetical protein 
LEAFKTOIHSVCFQICrPRI.TAXSWSKE3<LLTT^ 

AEHREiRGCi.nynjm.vrsE 

CPn.0775 874040 873414 

yggv family 

D^IVXASSHCYKXItrnCXTLJO^LCOroXFSI^FI*^^ 

AYfimnOA/SPICEIFlCrYCXCECYXSHOEXGSSGFro 
rCVSHFAKAWKIAPKLOSLFTKHlXTRD 

CPn_0776 874180 875487 

CT605 hypothec ical protein 

nrVXJOHFY7CIJXFFQFUFTHXXIFY^F\U^CXFTV^ 

CIQGJOUALISHSAAINSRGQDALSVT^rSRXHDCTVZILCTLE 

RYfMLR5VSLY<^nCEVPICEVAEMCDVF\nri7^ 

vxjau>imfOTivixrpuwmGsiJ^ip 

Wfl ttH f # U1IGLIWMPTSPOWPDPQS P FTYAATG I LCALSVAS ICVmfTLPFICVlflAWM 

DGEJCVADIlWRKiaJO^X^FFYXPFTCl^ 

VLXAXYPICQVZXTTWXERXPAJUISSXCKIJXCS 

FHOLRSSCLLSEYAXS 

CPn.0777 875586 877178 

aroXL_2-n«at shock protein -60 

AmiLOnATESLGV)OAKEAIilCVVEOT^ 

KAaXL££VEMWOOLOR0AXEL0SPKI)VLHV^ 

SXDSGISJCTRCUaOWKSCYl^PYrVTPJ»ITKI^^ 

LXSEO^PLVXXAESrmNVUrrLXLNKXJtN^^ 

ATXJOOESElCCXPVSXJMiaiVKQfSmiTXE^^ 

SESEOOtLCERLAIFXGSX P^rrAXmTrTORCROFOLESAXJUTICAAiexCCXVPaCOy 
ATUUUHAX BVPANliSCKTraFTTLLOAVRT PlOCVlJ^rcCRSSEEVTOTILSKIllPW' 
G Y MJ Cl' l/ l F LL^VDAGICPPLrVTrSSUCCAN^SCU^TSSrriSSRTTCT 

CPn_0778 877400 878092 

tsa/«hpC*Thio-8pecific Antioxidant (TSAl Peroxidaie 
APVAOSDRVPGYEPGGORFESS LVRNNXRVEEEVTKTLS LVGXEA P DFV AQA WNGETCT 
VSXJXX7YlXSKYVVI^FYPf03FTYVCFTEXJiAFOX)AX£ 

ATlCXJCOOCIECITYPU^n^ISRSYKVUCPEEEI^FRCWLIDKaCr XRHLWHDLP 
USUI EEELJmXUa.IFrrTtCLVCPAMWH£GERAMAP^E^ D 

CPn.0779 878502 878095 

CT602 hypothetical protein 

RTOLXFOHXTTVAtiFGEAEKCSYITrAYFCRSLVDLHKYIjCOVSSPGX^ 
VYFYnmXCTrCVDSYFFGLHFIinCmJCNI XA XGLPCVGNQH X XEASRSLCQXKNSLLL 
FFDHOLYOLLTTNOPF 

CPn_07B0 B79241 978591 

papO/amiB-N-AcetvLmurainoyl-L*Ala Arc i das* 

HGNKIAVOSLRFTOtWCL^FFILLSLLFSGIDCSPI^AACRSPSU^VlAEIEDISAKXAS 
HEVEXVML5 ERLOE0O5KC0KWTAAK PETLAOK X RELESOOKALAKTLAVLTTSVKOIOT 
NlOSKLOEIOKDHRAI^DLALVRRSLLALVDSrSPGAYAOFSDPVPENIYIVRECOSLS 
K XAKKYKUVTELKK tNKLDSDAI YAGORITLORNXO 

CPn_0781 879851 879199 

pa l -Peptidoglycjn- Associated Lipoprotein 

OI^RSRRKTVPUjCCrPSATOKEKrWtlHSLWKJjCTLLJkUJ^PA^ 
TCHHTRRKX PSSFCFVPLYTEEC FNPNFTFGFfDSKEEK^ YKSSOVAAFRN ITFATDSYT 
t KG EENLA t LTNLVHYMIOXNPKATLY XEX^TTD £P.GAA5*^tILALGAR RANA I K EHLRJCQG t 
SAOflLST t SYGKEH rLffcSCHNELAWOOMRRTEFK IMAR 

CPri_0782 P81077 979771 

roiu pr»iy ujcch.li- iiV t ransporcer 

« ;ni'JKU<OU:FOVFFFt"FAJLVYAE£Lr/ , /VR.':Di tTLPt EV:S?OTOTKDPK {QKYLtZCL 
TF.lP^OIAUIIXrtOFTiVVJKCSSSPtAISLJlLMVPOLST/LLyf^ 

u:vu"jr ( iniAAfmifYALrmi pc cca'jkivfalsslgf WKUcor;EiimrtYu :kni,ap 

[.TTFT r.l ITntWVWiWFrYLYVS YKYHVPF t FLCSLDrrERKKVLPIJtl INUI XITP^ 

rkKri J m vai rr^ »n n.F I oi*n:LTmpiCR pprllnei ircnxwr:;FHrp)n: w.vf rr; 
nki/ j'i n.v im:uMr» vAfRi.LTKKYPN;;rcPAwsptr;rr. r AFt::.vi w nn<Mtc:i yi»j: 

: W/YOI .TT: 'ITNkT;* t ': 1 1RM t .VF " Ai WACESELYL tCLVTKKTNK 1AM VIKKHK 

<'l*ri.(|7Hi MM | ItHH hHtlll'j 

t7r'.*i»: tty|>irhi*i i»'.n pi nr., mm 

tHMKYU *V !A ITAI *l Ih : II 4 A.VKA; :M.PKKRI/^PKAFOEKLVT lOMCfl VtTI * WVt il» 
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aktirp: watopokoakcgppoowoicalokp IPV ..TEPPKPSF aptvak kttatexp 

^rTTKfO/TOl-'IJCTQLOTLJ EVAOALSLHVDX IEK3CT3LKKCSWP5TAOLTHHSEIJCAT 

KF LEKYKV3KN I3rH I KLV^NET 

CPn 07*4 M23S7 891972 

flxbO-Biopolymrr Transport Protein 

:..-/;A;.\i-;v ...v:..:.:;^;::Av:r;wu l ::r:.NM'; : -^:.t:.:-ikav; --.r 
n u^^i«;irr, ;i- k:v-. : rvr; ; r.\A ;•--) iku iva*,.*.:: 

CPIU0795 983039 R92296 

exfaB/ to 10- polysaccharide transporter „«»r it 

DHLYFETLSVNKDFY SMVMrSHHP 1 10 AYTEADFFG KS IFrCLULSVCTVTVLKQI^I 

QKNFXJCAGKStJ(DFLIIQWHAW£LDIHPEiSPrAX^ 

PILSSIDIOSIXItieAXNPKYKAlIJaWSFXPAmS^ 

SGSSGNSAIKEOATAIZnTII^FVArPSL^ 

VKYRNTNL 

CPn.0786 883137 88S293 

dsbO/xprA-Thio: disulfide Interchange Protein 

WK7^IJn(naYLQTALIAJ>F7SFPALSCSrSSI0AgEITC0V»WPGA 

OTnUjGIKITASKGSHIWKNPCEICSPLKISWOLPKCPVVEEDWPTPICV 

GYEDSALrVADVRAPECTrTPGOEVrUlAO^ 

POTHAirTKTLHAOPRVUIOTSVOVAOCKCHII I tW XSKXIMATKAWFVSEXAOQJAV 

ACTSYSGCTCTAWRLKVXNLSCVr^ 

ACtOTITILIHArLCSCS^IMPCVLPLVrUCW 

rcr*CLACVA7IUCVU^IGWCn3LOTPMFVATLI IVmJALSSLGLFEHrrKTANLG 
CKLOSSDflCSSNNXAVCArrNC ILATL\mTCTGPrLGSVLCLVMSLSrLQ QU.IfTA lG 
LCKASPYLvTSVFPtoUJvIiPKPGGWMSTF^ 

ujosu*iJ«aj^iiCRwcrrpvspxitORVC^ 

SVNSJSLWFTSLEKI^LJUkOGRiVri/NFTAJn^ 
TLBWmraPGrTEELAJUJBRAW 

CPn.0787 885604 886401 

yabO/ycfH-PHP super family < urease/ pyrxmidiMsei hydrolase 
. TRROPVDLAflAHVHl^CrjlAFEroiNSVLO^ 
XIRFCH V U*TPP0 DV P0OIE£DYRJJFKAAAHSiqaJUaCP^^ 
QRYUU^SLECEI^VVHCIU»FNDFFRMUX7yVHN 

gwfis i scxvTFicw>LiuxwEXPLEHLLiETOAFr 

VANVKCKFPOELAALAYIOyVLRFLHC 

CPn_07BB 886521 B 874 32 

sdhC -Succinate Dehydrogenase 

SLVKSLRMSRHE ICPEVSHKKSKYYST FT FRCIKSX*AGIAFTFfICEKtJTWMLASSYFS 
QGKCFVAMVNGFKXI PCUCI IEVACLVLPFLCHAI XGIWLFQGKSNCYSGDGSRPKUtY 
AXWSYTWRWTAWIUJ^IArHVVHLRFIRYPVHVDIHCTTYYAVD 
WCTLTIJttJMTEAS5IEVStUfl)[/y>APAAr I SgiWSYLLTPSACTArtYVVRiaAIgSLFI 
• AIAYTILVIAAAFHGFNGLWTFCCRWJWSLR^ 
YSVA 

CPn.0789 887436 889316 

sdhA-Succinate Dehydrogenase 

OMDB«lcmVGgnACLSAAJtO^AWLSI rra-VSLTKVKlSHSVCACCCINAALNLrPE 
EnJSPYVHAYOTIKCCDFLADOPPVLEMCLAAPRI IKKUMFOCPFKRSPSStlLDVMFG 
CTLYHRTVrCGA5TCOOXMYTLDEOVRJlREHACRV I KRZNHETVRLVTDHSGRACG 1 1 LM 
NUTOmiXIU^DAVIXATOCPGVIFKMSTNSTFCn^A^ 
TAIPCITOQ^RIJSESVRCTCgRVWvTCDSSlffl 
GNLVSIUNGAJUULRVCEAGLGIDGRl^^ 

TVPIO\XFPAVHYSHGCAWVtWPAADO PORDSRFRQKTNI PCCFNJCCESDFOYHGAHRIfiA 
NSLLSCLFAGLVSGDEA5RFI LAFtlASOATSSDFDRALQOEXEENARIXSASCKZMX FVL 
HEEIAJUMVIWVTWWNNRI^ETNDKIJC 
LEIJUAITKCAlXRNEFRGSHYKPErPERDOEJ^^ 
PTUU7YTCSTCXIELTNIPDNTRLFI 

CPn_0790 889279 890103 

sdhB-Succinate Dehydrogenase 
NSRIFLIISWPYRKREHKENI^FILJCIYRGVPGKQYVESF^ 
KRPVNILGEXVNPVVWEOCCLEEVrXSCSILV^^ 
L7KFPLIRELnrtmSlMFT*fl^IOGVrt/AADircCTrc 

rrEACPOI[XOCSDFXCPAAISOARYFtlTYPCDKRSKKRWRAt>CKCGXECCOOAKNCVRV 
CPKXLPLTE5XSAVCREX5KFSLRSLFSALFXKXK 

CPn.0791 893104 890111 

CTS90 hypothetical protein 

TCLRSSRK I WED ISDRNKYSCYSRG ISHNYLLH PMSRLO I FVFOSLI ANQDONIXEEI F 
CSEDIVl^XAYlrrTAWPLAAKNl^XARKVANYXLAOCCEIDT^ 
USPHRHNEACOREHLU01LKALKENPKLKES X KTLFVPSYST XQNLIRKTXAUIPOT XU 
T I HWQAALTALFTYLRCDVGSC F ATAPA I L I KQ EY PER FLKDLNO LI SSCKL5R IVNQR 
EIAWIia^CCICELFKPLRILOLYPD^LVKLSSSPGUCKAFSAANLIETLCOSEAOIOO 
LLSKOYLHOKLONVHETLTANDI IKSTLLHYYQL0E5TVRAI FFKECLFSKECVAFSTOH 
PP.EI^IORvYOTLHAYEEAKSAFIHDTONPLLKAWEt^ 
WKSEDPHSLVSLvTHFVEEEvTNIRILV^OCECTYH 

uTMDHMRFT^El>D<ALYD^SAOEKAKKFLHLPEFLt^PYTKOIPLYFRSSYDAFIOEFA 
HLYANAPACFRtLFTHCRTHPflTVSPXYSXNEFtRFl^EFTTSTESEtXCXHAVXHLEKE 
TGRLVWIITAHUnWFCEAlXTR ILEAYOLPVPPS lUIHLOOWrPWVYVSOCTVDTL 
LLDY FE^£I»LTLTEKHPEWPHELAAFYA0A1JCDLPTC I KSYLEECSHd LLSSSPTHVFS 
( I ACS PLF?.EAWWDWSYTWUtDVWVT(0HODrLQDT I LPOLS I YAFI ENFCNXYALCHV 
■/1IDFHDFC-DHSLTLPEL\*DKCSRFLSSLFTKDKTVAL C Y I RRLLYLMVREVPYVSEOOL 

iv/iDNVzzrfuz ityekfrsl i eet ipkmtlujoaolrh i ykollhusyok t ytee 

^LnLTTAMniriINLAVrAPLLFADrJA*P SI YFCF t Lf irCTTEXDLWKFNYACLCCOPU) 
t! t OELFAT::nrWTI.YANP I OYtMPPPPCYRSRLPKEFF 

•:in_U7'ii »**MV5S tJMlOH 

■ .T'.h** hy ( t »t in»t. u.%* I pr oce i n 

MIHT.I H Ir'JfRt KKl PTFTKRVLFFFFLV I P I PLUJ«i4VWFF::K::AAKANLV0VUrrRA 
TtlL.1l enyKLT IUKLFL^LAim^LKSYAUPaAEPYAOAYNl->1MAU;NTOrJLCLIOP 
KIV33VRTFH P< *DPF I RYLKOHPEHXKKLSAAVUKAFI.LT t l*:KPU.l!YL I LVEOVASWDS 
rTrKLLVriWPMJ'KWKOLTOSUItTKCN^ 

\<JFQM2KM E ICKAiVtUiCENL (TVS INKKRYLGLVr^K I P I lXTYTL;;LVPV50L I 

y■;AI^PUlteFFwtJ^FUi^wwtrcKt^mc^^ 



TF.130HLJlRR0L3CMITWr^£X ^CTTJJZ : ISLAi X ML. • ,^\-,AP,.LF>A/ASSp 
VTJLOKISKDTAD^^ f T tf^VVAKTCT A'EJQ^XT 

RLPLETHOALOPCOPXTr-TXEDI WTFWLP I EXLDCTPt-MFlJfl l*L!t2>LiMHLNN 
ETEHSADCTLT I LSFTT 

CPn 0771 o-ikhjh 4*?4H'» 

rDsil-einiM re-j-ti it-«rv t imilv pr*rt^in-PF:c cnospnatase IRSBH 

•» * ••«.*:.. " : 

WTLTO IVPLNWVLCU^CVI^LQAJ 1 1 ETf NVL^NEMtfKVFOG I YNE X3HKVFWCD 

KIVVASSIPEHUIEWNKKlDIPKNTPFLAAiJCOSPIO^EVrr^AWV^^ 

LYTTrSACIXKCLUraCOraTVKTAtLSKYOT 

riJIIlWCPIDSEXjCPLTtJPl^IGENFYSFKXKIJTEIKJCXENWSIDIAVlSYAItKCK 
FAPLWRRARMYTAYFFCXLLCSLXAF XVARRX^LP XRKLATAMX ESRXNKNCLYTDOSIC 
FEIlttK^Xn«AMN^UCKOO>tl^aiffI}OXIKA^ E 
UUCAYXPAXTVOWFtVFVVCEXSXAIlX.FLXVACASCKCVNAC^ 
SSSLWIOCTSKJTJWTIOVSOffVrLC/YCY*^^ 
WLFHPCMAl^FtPEVANITSKLFMPKPCSLJ^YStJCXTEAHNWiNOKF 
LTC3CSAADAVMRli(I^VKTruCNSH0KOO XTti.1 LXVLE5 

CPn.0794 897123 898004 

No robust honoiog present in Cenebank/EMBl. as of 11/7/98 

KSSlQOlSflXiaCSaOCWYOICMilNSOIJCKSLCYSW 

GLDPSVECLSGDGVSVCYFTKACSTTVtYW 

YTCTlTMrrCSLCACICr>CAJCHSADCTt.TPLTG rtCGTSKTEARAISXCTOy I ECFSYOA 
SGOPKAVCWASCATTVTOLADISCCSRSSYAYAIiCDCT I IVSSKEST XTRJCTTAVKWVN 
NVPTfl^TLOCOASTCLY I SCDCTVrVCAANTATVTTCNO ESKAYMYXDfOOtD 

CPIU0795 89B00B B9919S 

No robust nonoiog present m Genetoenk/EMBL as o£ 11/7/9B 

ffTLOCAl^ATC^SICSVrVCOAOTAiaSVHArOYYNCEKKC 

aC\roCK»IAXX^WHAFHOTTDFSS»ft^ 

ASWmVCIuWIALAACLYVHALOIIXJSNLAAa^ 

VPNHPIfVSfOffu^NW^XGICOSXlAl^ 

lOVAAOIECRYCKSL0CH\nrWPn^0FVHI^^ 

CXGSHIJuVVDSLHVCnOOffiONTAAHTDRFSCSXASIGNFW 

YtUWSU^IUlVyJCOPWCVMCFSSOLRYAljCF 

CPn_0796 899280 901340 

no robust hoaoloa present in Cenebank/EMBL as of 11/7/98 
SEt^SSYUPCLNMSXVIu^SAX^LPCURSETFXlCVRSKMXFN^ 
U.TAXPGSFAHTLVDX A<ffPIWAAOATCVSGOCK W IQOWPIJOPFAITVCFOTIXXBIIX) 
PXXAVRKJCSVVPICnPOCnVIVtTrNYAICatWAViei^^ 

vsAxxatvxaa^xMXjauWAviavEDi^TOLPsi^DA^^ rvowvxjv 

SWWffAVOWIODQLSVICTLOCTTSVASA ISTDCTTVIVDCSENADSOTHAYAYnOVMSD 

DCICV7VCPJWSC0UHAFX£FF0APSPAPVIKX2ST^A^^ 

QCnjQRIXIOHSAICVCSVSSSA^SFTSViaMXSKQSPAVONS^ 

OOLLtCAfWaXJtfAmCGFXVNJffCS^ 

RYTTOjCrTVVLQPFManW4L5RXSYSEXNVRFFVSTO 

POtSTAATLCVPqJLMSHIIiOTCSVSAHaJr^^ 

TLSWMNQQPLTCTLSLVS05SYML5F 

CPn.0797 901552 902694 

No robust nonoiog present in Genebenk/EMBL as of 11/7/M 
vLXX.wxi^TKXjCL»(siaciic\ajCHLnrrt^ 

EEWKOYTFT DLELLSKECWS EAHAVSCNGSR I VGASGACOCSVTAV IVn^HLXJOttJOTLO 

ciAssAtaisimavvvwsrjmrrriwvra^^ x 

VGVSATARCEDYGUQVCnyiCWCXCKXXOIJCXXPO^ 
VAVKHMQUWSlCTXa;5VASA£AXSANGKVXW 
OOCFSVATC^SAIX^XVCFSAVICTCEXHA^rrAmEMEOLTTl^ 
X ICSirTDACAERAYLFKXHX 

CPn_0798 902810 903856 

no robust homolog present in Cenebank/EKBL as of 11/7/98 

\A^EIIFWRVPMXKTCCOWRSIC\A^SWL^ 

SCURVVVCYEIKrUFICYVrJCaCFXlJCLVPR5EAL\T^ 

KWVNSAX.VDLCI FSECMOSFAESVSSCCICriVCCLYSrXTETNFAVKWDETCMVVLFKLP 
EXWHSCAWDASZTJCSVrVGOAMGSEEIAKAVYVOCDCEOHLl^NIPCAXRSSAKAVSIQJCS 
FXVGETISErNEVMATVY»CVIKOIGTUXBYSVATCVSRDCICVI\^ 
KYVDCRMIDLCTLCCSASFAFXr/SDDCKTIVCKFETELCECKAFIYLDO 

CPn_0799 905001 903940 

No robust homo log present in Cenobank/EMBL as of 11/7/96 
KREEmAAIKOILRSMLSOSJUGf/LFSLYSLSCr^ 

TTLSRLSNKKASAKAVSCTCATTVCF I KDT^PTYAVRWVWGTKELPTSSWVIUtSKATC 

XSSDCSI lAGIVENXL50^FAVTVflQ>MEKYLLPSTWAVCSKA\^ISSIC5VXVCSAKDAW 

SRTFAV*WTCHEACVIPVCWAVKSVANSVSANGS X rVCS^DASCILYAVKWECNTITHL 

CTLOCTSAIAKAVSNNOTIVCR3ETrrcEVHAFCHKNG'/MSDUrn^ 

CKV I VCMSTTANCKLHAFKYVCCRH I DLCEYSWKEACAJ IAVS IDC E I IVCVOSE 

CPn.OUOO 906550 90524'i 

eno-Enolase 

RK EI K I MFEAV I ADXQARE Z LOS RC/ PTUA/ir/TT3TG."'/^CARVPSf*ASTCKKEAl*CFR 
0TD2PRY0CKCVUJAVKNVKEI LF PLVKCC5V/EOSLI WLWlDSDCirNKETLCANAXL 
'TVJLATAIIAAAATLRRPLYRYL0CCFACSLPCPM>4NLtfl7^HADNGLEF0EFMIRPICA 
I KEAVNKVUTVFHTLKKLLHERCLTrCWDEWAWILASNEEAtiU^ 

pnKDi:;LALa:AAi;::nwKTxrrYonpiiYEEC XAiL.'srL'.wr i d.*; i exxilaeeoyxjcw 

ALLTEVU IEKVO f Vi HMX .FVPNPEL I LEG I St KAMI: .VI . I K l*Nv,' [ ' .Tl .TETVYA t K LAQM 

AirrrT t r :*i ir: va Tnrr i adlavafhago iKnSLJf'" fj^vakynmi xf. i fjieloseax 
FTlx»r/K.:Ym:Kf: 

*:pii_'ihih -uxnvt 'ti)i;7i7 

•IV ill Kkmiui- !<•.»:..' Alt* !*.Ut>irt ( I II 

1 1 vTMTKVUiAirAi\i:ivPtv\iAPU^VR>K//KTOvr.^;ri\iy:n''-Ti wiwanvtil 

(•n.VI^IINKTLAjVI.YOmtKfTr-ffflAVEYFl SV70YYO**EAY I AIMlfrY I l»::i J.INDE 
I UKUMJyVTK:: I IMiW rr\ . t V; :::V.*V: I VO I a-^ENYT-*MALVI XV IK KVIHN I t.TAOLVK 

hi iy*ja::p t r\jH; mkhki« t *a- u i i kI'AYF^:^x/»LJi LEFU iltlt:: li!Y:ffiii.TMi licewv 
::ATi.YPr:::iiYVi ft,MHfVAnmoFJ3J<K»*AFFDDP.l' iektk i nurrriiM miKKtr; 

Ki Tt( J I KNY :*.HI I IA I ■ 1 1 »n T« .'t .U *Y KPKUF t .t. 1 1 OK H r/T LI V I MAM Vt* ;| ;nK*j::i. 
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vr^FRU'-AFWWPLTYEEAOKYFRr/tYVSAT. ,-evocsscH rvwx irptc r POP 

MF-ErRFAT lUVDDLLEEXR LRLJOXH EX t LVT5 XTXKLACCHAGFL3ELEX PAAYLHSCI 

ETA ERTC I -TOLR.'TTV I DVL rGVNLUlECLOLPEVSLVA I LflADXEC F LR3TSS U OFCC 

RAAiWtfTKVIFYAOjrrRJIEITUlITEJlRKOrCLOYWEHNrVPKPnKAirANPIU; 

TSKX^PKESCRPUKEDt^EOimEAIiCRAAKEFRrNEA^ 

F 



■:\iTwrf!c:- s ; : r::*Lni.-a w*::r:T.i £L\4z:ri7fZ< :' " : :.\rjr 

r RKEEVLDVDNH IY EVUDWL3VCIOPTKSI XYLOSAXPEIYXLHLLTSMUSINRVMCX 
PSUCIMAANA£XE£C$l£Ya.XGmtASADXUJ^^ 

KRLYOCVTPE PEVIOCELTSLVC IOCQCKKSKSAJWAI YLSOSOAT ITrXVRKMYTDPNR 
IRATTPCRVEOfPLTI YHPX PNPWEDEVttJFKAJlYROGCIIOMEVKARX^EELIHFlJtPX 
KERRSEFI^KPUU/3hn/LEOCTHXKREVAXVTHCEVT4DW^ 

CPn.O8O0 910306 909752 

CT5B4 hypothetical protein 

FMAAimcaSIJDMVrLtXECMJttXFATPlCYTT F REJQNWTTCANSQQEXANrFFEH 
LXNCKLTOELAPC0X0AAH5LXA£PfO<PIRVAJCDIKERCEFXNrXTSIKLT^^ 
RIJkRVtXMEFU-KTDVOMrCHL X RHUAWJXAQKNPVC EXNLOEI OTEJTSLKWnJEI. 
TKAW 

CPnJ>804 911074 910310 

gp6D-CHLTR Plasmid Par A log 

EIFSSHOJlJCTLUJRFKXMTTlCMEAlJWOra 
LWYNFREQIEEPtXTOLCT^AEVKOIKKQSVtAKOT 

LTlfGNRQTPY^LVYYElJTLLPEPUCIEMEKMPROAVYTXASROGPOCKKEII XROTRG 
niKSraLDRIRKEFPLVETTXnWSPVKOAl-^TXC XLEKLXK 
KLEXVKSNLFPCTKV 

CPtV.0605 911646 911067 

■tinD-chreoosome partitioning ATP«.«e-CHLTR pl«aid procein CP5D 

YDSIJWVLOGOCEIOEVIRPIODTOIXLIPAiyTVl^ 
V0DXYBYVIIDrPPSl/3fcTE£ALLAADYAUCATPEr* 

T I IGVAI^FWNCRGXJWSAFAZL X HKTFPCIQi^^X IRRD rTVSEAAX KCKPVFATSPSA 
RASEDYTKLTXELLXLLRDI 

CPn.0806 913016 911B67 

thrS-Threonyl cftNA Synthetase 
HANNESPPNMEAU»CMXOVrrCX«XNYE\^CTTAA£LM 

THX/IEGXTILVTLTSEDPESREX FTJn , SAHIJAOAVI*RLWPDAI PTXGPVXDHCFYYtJTAN 

X^XSESDFPLIEimnCOIVDEXlWUSJiriYSDXOOAW 

AYS0CEFFDLCRCPKLPSTAH\nCAP1CVXJCrSAAYHRC0 

EILTPOtJe«aLWEVSGHtfDNYKA*i4YTLOIDD^ 

PLRVAEVGHVOTQEASGWJGtJOTVRAFK^ 

GLEyKLELSTRPERXnXCDDSIMEXJ^TDAUaUU^SCTPFXV^ 

DAioRrruoarrxouMxpERrafyTTAacncsvpv^^ 

RFPUfLSPEOVRX XTVAQRH X PRAKSLEEAWXJILGLVVIUX&SESVSlOJCXKliAQMtTVN 
YMrriX3WEXNE}'lVIAVRTRI91RVTllDVSVERnirrX LE8QtS££LTAXX 

CPH.OB07 913950 914879 

CT5B0 hypothetical protein 

TLt7TGLHMSLFLVTLTA7IWSSSPAI^VI«4ASA^XrATGAI^ 

FVCISlOCXFLYIVljyunSFYLTNXFmGWSISSSKrCFir 

KVTIJWVLGI^LGLVSYICYLTFGa^DS^ 

lEXOSTl^VTAINAYA^AaC^XHHSAWEPWRPLPVODISOrLYATUkLWISHLIC 

m.YAKUJUCYSSTFl^FCNLVWLYSGr^ 

EEFRQCYXVS 

CPn_0808 916396 9149S6 

CTS79 hypothetical procein 

LKIOJ«WALKSLXRMP0SAEP5LWXKPtXFlCCACXAMrS0V5CSSS^ 
CTCAGNAQSCHDTIOWTXQGAOAEVAAOCynSLXQiaASAQSTCiqCEATS b 1 1 RSSM EXSE 
KSQCSKSSTSVASASETATAGAVQCPKOJIGSJ^ 

llCLVMTLMANAAGESWKASFOSONOAXRSOVESAFAjXGEAI KROANKQASATEAOAKOS 
LX SC IWI^rrVSVCAGXPSAAKGATSAXJCSASFAKETCASAACGAASKALTSASSS^ 
OTHASTAKAATTAASSACSAATWUUWLTDt»lAA^ 
WS DCVSROOIVNnCTOCARVASFAGNAI^S 

RI^COAI^AEVUCOHSSVYCOOACOACOLOEOAHOSFrn-AXXTUNIAOSOTOT 
N 

CPn.0809 917794 916307 

CT57B hypotnetic«l protein 

DTNMS I3SSSC PDNOKNI MS0VLTSTP0CVP0CDXLSCNETKO XOQTROCXKTEHESOAT 
IACA^KDKToSTTKTETAF^OGVAAGKESSESCKACAITIWSCAAATTASin'ATKIAW 
TS I EEASK5ME5Tl^WSl*AA0KKEVEAV\a7VALSGIW 

EV I E IGLALAKAXCTLGEATKSALSMY ASTQAQAtXTTNKLCLEXOA IKI OKEREEYOEMK 
AAEOXSKDUXmiOrVOTVMIAVSVAXTV^ 

ACAAAATTVATC ITVO AWQAVKQAV XTA VRQAI ?AA XKAAVKSC Z KAF I K7LVKA X AKA 
ISmXSKVFAKGTCMXAJWFPKLSmSSLTSKWVTVr^ 

UONVAOFUKEVCXLOAAADH XSMFTOFWQOASKIASKOTGESNEKTOKATKLCAOI UCAY 
AAISCAIACAHKTNNF 

CPn.Om \U8193 *17125 

iTS77 hypothetical procein 

OElttIKKIttKTKKA\X)RKAAFVKRVPE£30EAAICQLEl^ 

Ny ttCt I AALJ*ITLEr;Ult.EELTOCLFr:7A0ErANFAK E ICLKNLTTVVNKQWVIC 
CAE 

<:PTi_nuii ** i^Mj us20i» 

lurll*Li<w •".( Ut*r.prHii;»» fT(ir.»Mit H 

IOWIFI U : 1 U: MKIttHfNANOPvK MAi; W^KKTI^!;p.LA^aJWOKKAXADly^.^7JVllt*VPT 
KF£I Y.r ,\U IN I Y\\ a S.UI iL'LOy I UU!DYUJXI YTVAYTFY: V ■.KYNEAV'iLFOUUA 

wwrwi* a i i.*krvi tyi jilvneaafoffuvkoaoponp i ppyv i AP-.itxKLwr'CE.'JN 

HFI ^ »JNI'KFKf1.KISRi.VrMKO:: tEKOHV JETK K Al TKKl'iV IK; JFTTTNKK^JRK 



c:LXCWLCNL?KAPK7T1WPX0tXOn.TXNCIAACEV^ 

E I ETtSCGOGAX rXROFCOUFRAEOI 9 ^52?jy , ' , ' , ^2r^2^^SSE^K5»«iIi 

AsrjncxErossxecoecintTvrHOGW 

OSDRl£IRKLIE>WIl^ANr(>^ISECKHEICr^ 

OKEANGVR I '^/L=iP3FHRPTRCC0K I F IN3R P I ^LF££K>aT^YA^PLHRYPVT 
Vl^YLPSSU3>FWHPOKIEARIUEIXVTjr^ 

.TVPLPMFPML ETpir^ EE ESVEFtCN L F A YS3 EPV-TLEXC rYT^^ PfCSWOrfl YSSOVP 

v_ i* -jui. : ■ * ■ : "'.***" **".*■ ■ ••*" *"' : -« ;mue**c r 

CPn.0813 9WB43 «l«4 

pepp*Aninop«pciOjAH P 

TLILWCXWKSHDRILRAORAl^EJtNLIMILVEKiE^ 

FVYRKDI03LYSHX0IWL.Tri.TODWAX«^YVCK0llVCKICrDSAST^^ 

(XWEPI^CrrEKIRSIKSEEEIRRHDEAAAU^ACY^XTlXR^ 

A£MSA£SPSFPPX XAFCEHSAFPHS IPTPRPIJQCCCXV > .IDIC\XLNL1UUJH1RM1ALC 

TPHPKXXCSYTrnA/CAOKRAMALClUXMJUCDXO^ 

HXHEYPCSPRGSQVXtXSCHTXTVETGVYrrciOGIRXEITTl^I 

LVCL 

CPH.0B14 931996 923357 

CT8U.1 hypothetical protein 

FTLFFXX^YNriF!^LTMYOU^ICYSFVSFIAII>#4IXYSPKr^^ 

CYIAAIJCEJn/YUATNSSRGPVYSFSPKGVPT^ 

XSXPRDCETX^U^AtnCXXO^ACFinmASFPVKCyiRRXCn^^ 

TlCCnVDFVSSOC£NYSirnAVCDVLI>D(aCWQ^^^ 

DUMVaOTORQTXSLVXX^5PXUNEVXRZXErimR5WSXPX^ 

LRTAJ05WEXURAinXODYVTGX\ , TCTLLVFK£]t^^ 

PIJtOCF EPAVASOEVSSKTRSAAAH PCATNROCS 

CPR.0B15 923361 925622 

aspD/pllO-Gen- Secretion Protein 0 

MVrntNSIXKLVAI-SGKLCCSSCVALT IAEIQ^ASX^EHSCRCAIX^ECMASFliaiMXJnrSL 

WW(PETTTYNLVTIJYCTEDSXYTJPC£IIlAJKXA7^^JCF^ 

XNirrTHvi^XACRvwtrcsAGEvccxxKrwFvo^ 

f UU^TRXI^TTa7VSE*3LCLJtVVPU^fCCR^ 
TVFVYIWKHSDPOEXJkALL^OVHDVTSCT^^ 
GSVKYCNFXADSICTCTUMVVEXCVLPRIQMJJaaXVPiao^ 

CLNiiAx^svciaoacspsvsHAaaTCXLEFiJiacsTcss 

WkgPWVTMhOTPAJtIAV\ma <5IAVS SPKXXA0YmikQYC INXMPVZNVCElXXaCSY 
ITUTUITF LTI^UIQIHODIIPIVTWWITNKVRXAXICETVX ICCLR C KQKjaSHDOI PfUO 
DIPCICXIJOCSTSDSLTBgVFXTPKXLPJPVFQQFRirrfATT SCTPCEMPfTQAlA 
ASEAAARAAHlOCLIKFPASGVSL50VER0EYtX3C 

CPIU0816 92S600 927102 

Ssps-Cen. Secretion Protein E 

RCOnKAASILSOEIiDILPYTFlJOaCL^ 

KPVRFVUCEES£XL01UAlQLYSl«fiC3IVStML£J^^ 

UMWXtJCEAXEEIU^XHFElCISSKRXRYIlXOCVLHORMSPPSHlJt^^ 

DIAIKIUJCXX^XKXKXGCQEVIMRVSTVTVXYCEKVVIJI 

EXLTKPriTAPESIlXV l^^bGAri 1 L YSVI^EIJCCPLT^X>friEPPPCYiaJOIAQI 

AVXPKXCLTFAJ«nJlHXXROOPDXLMVCEIRIX}ETAEXAX0AALTCHIAn^^ 

AXPAIJXK3XE5YII^ATLVCWAORLVRTICinrClCVAYTPCl«E» 

YROOCCVHCFRSCnrKCROGXYEFtJlPNTUllSWA^P^ 

AlAVSCrrrtAEVLRVTICKCD 

CPn.0817 927106 92B2B7 

OipF-Gen. Secretion Protein F 

GGR)4PRYRYTY1^PKERJUCRGYI£ALH XOEARDaJWENIOVLDIREVALRRKSIKSTEL 
FOTFYCSCVAACESVGNIXOCLONXXVW^ERAOXTXK^^ 

LUT/IPStJCETFEWEVXCLTXIVFCVSDCl^AYRYI^LCFASAL PUKK 
. llXKIXFAX,PCT)CKIVVKVAVNRFCSVASMlJCOOCTLXECLDt^ 
XVOAVXOOCULSO£LAORSWVPKLAXCMXAX£EESSDUkXW^ 
TSWCOPVJLIFLCCilCVIMWILIPLTSmOTL 

CPn_0818 928158 928682 

predicted OHP [leader (16) peptide) 
CYTXNVCFDWVW5TTIDSDFSWPDRCOHVCNIDPTHK0 

S ITL IEMKW ITL XC X IOGAJLAFW4RC3 X K XCXVFQS EONCAKVYO Z LMKEYAT0SS5LK 
E X lAHKETWEEASWC K ECRKXXXDAWCEOL IVQUJCKS DOLV I FSKRVQSSNKX 

CPn.081* 929117 928956 

CTSoB hypothetical protein 

ASLYT/CLFLIWEKFMNNIGKANFHLKI ITTCFLTDIYIVTIRDPIAYPLTCIC 

CPn.0820 929042 929659 

CT567 hypochertcal procein 

DESLPCRCOtXT FPR.-S ETC? IRTEHPMCNS IAHKKOXRGFVUlEXXilSFTLXAXXlJQTLO 
FW\'RK I YTVOK0KERI YNP/ 1 EESRAYKQljn'LFSKSLSSSYEEPCCLFSLIFDRCVYRO 
PKLACAVRASU I HCTKDQRLELR XCN IKOOSYFETORIXCHVTHWLSFORHPOPCKXPE 
T I ALT ITnEPKAYPFRTLTYCFAVGK 

CPn.O 8 2 I * r.!*.«i; "i 7 V 1 0 h *.H 

tTSr.», hyiv»rri.»t i,Mi iiiur.nu) 

i rrNiitU>iK ft* j »*k t FTrajTLT;;! ,v:: t .vafdaamar krcacact r etvtkhfk.t i krsaca 

E IEYOKK;milA;^\l t*R t:;KI)Xr;RWrF.OIAKVATKKKORYftLUjVPFf;RritMSRYNLYA 

t.u-Err-Ei^::rrrA:nift-A i v i nLUtHAr/tmsf/i'prwrM ianal oikoeilergaou; 

I'l V t KTt.TLI mWE t FYKKliti SXttXttLUIKLI r/EFX*:LniU rKLf Jt, I ^740r*lXX>EAVL 
IHH*I»AYniTr::U .UU: I WKAVK^HIa 1 '^K?jA/JVt,El.FrrRTDFt(l.D.IU4CM^LLL^ftY 

in j.i'M JiKKMHrtTf *j;wjiyi j*i .vi 4 -t/iK a l : : p« .Tw ' r.*: k: : r kl 

iT«t<'» tty|\<i Ik'I i>mI «i 

kki* I ( vl i:.T t kn ( :: n :i*TMAi*in*i r.vi t :::Ki:';r^WFfv:i j'rkvxoui: wiw wmkfi, 
: niHAf .*kai*> * -i .vi w *. 1 1 Aiif* i.MAi y :i,t-*iAi ** .-/•/! < u-iive I rkmij;ni ak :y: : l a»«i» i k 



110 



WO0027994 f file:/A\dcwas03\firnridata\lp\FolevPat\PatentDocuments\WO0027994,c pc1 



WO 00/27994 



NA ILO 7L I llTVt-N XPJIFAVT.F MjCV ILSFtTT. -STC3KDHC0KH0DTSKKPS 

f?^n_0«l 'H24J4 DlSOt 

v8CT/3p«jf - Y^nT Tran location T 

fyatjovrfsict:; r nt^kelmc r clpeitsnlgsayloy i foh ppayvwsvfluxarllp 

CFAVAPFL&kKLFP3PlKXCX21.3fcfUkXXFPlCYlJu7TO 

viciwvrprYAAOSACsrrrNooctocucATSL r s ieqts phcxlyhyfvtx x fwlvc 

"»HR rvt.ULUrrLFVTPIHSPPPAOVtSLSAP rrflTHtXHCQLCLVKTlOLSAPAALAKL 

; -on. m \f w-.v ' ' . *■■«: /Ama ■•: ?:-v.«wr r :so r i v'Tt rf wrv;r.T 
» 3.1 *.-■■*■• 

CPn_0824 932677 932378 

yscS/f liO-YopS/f 110 Translocation Protein 
IRTRAVtJtfFATSntSVlJTrcYQSLLLX 
PAFAVKLWXFCTLMISOGULSMfXUtFACQ X FONFYKWK 

CPn_QB25 933618 932677 

yscft-Yop Translocation R 

VAASSVPSYKPPLNADDl/XPIUWLSDCSFSXnTrro^ 
KIIITLVUJWAU^OOTPPSOVUCIAi,ILSrYW^ 
rrAJEAETWAIinCSXEPIJlSFLIRWrPIC^ 
IPAFr^IKNAFDGVLrrLPmrDLVTANVXV^^ 
•GWTLLLOCLKISFK 

CPn.0826 934382 933612 

yscL-Yop Translocation L 
KDNXRSGvTSSrVNOPOKYYMVW^^ 

KTKADS EAYVAETEQXCAQ IRQSAKIXJGF K£ CSESWSKQ lATLEEmULRI HVREALVP 
LAJASVIUmCKELEIJlPETIVSIISOALlCELTQrTO^ 

VtYADSLILTAX PDVTP GCCI XEX£ACZXIUOUM3LQAU3U^5TIlJCAXNPVDEPSCr 
S55TDS5SLSNDQ0KKE 

CPn.0827 933273 934434 

CT560 hypothetical protein 

OCLVTA MFO r L DILKKHSXEDDLSRn.PKNLLVESPHPEEI PLKSISFTMSWLPTI HPS 

WITIAMKEFPPEI(XaLLJU^EPLVCEIU>XXPGISIAPm^ 

?OGITEXIFI,PASSANMLYYTCPWIAI£NCI^YSIJU^^ 

TEKLFLTYCQSHPIQGfLEXTNFLSSWTTDAQJtQFVHIW 

RiUiM^YXVSQTXXTWrI}HPYVDmCSRXX0QOCVLVK 

CPn.0828 936292 93S267 

yscJ-Yop Trans location J 
DcinrAimfVRiursFCurijcrLLCCT^CNSi^ 

LPOAAAATASAATBQHHDXAVPSAO XTE&IAI UOACLJ > RMXGTSLLDLFAKOCSt<V?SEL 
OEXmOEO£EC**ASTXJroCXWDASVOISFTTENE^ 

IKVSWnU^IA^VFGLVFENTOVVSIIRAAYOT^ tXJUCSS 
LTXTTtLXFTO 

CPnJ)829 936729 937298 

No robust homolog prtsant in Ceneoantc/EMBL as of 11/7/98 

KYXCFVFTLAlC$FYXNXIU>SRFYSWl£FIHKETYY1WFt^^ 

FTUWAKWPLVPAGYTWVRGKDFVI^PLV^ 

CIEWSTLreXCRFCAVtCVro FSCS 'ff OIC IYro 

PVWrtFKCYS 

CPn.0830 937339 937959 

No robust homolog prasant in Genebank/EMBL as of 11/7/98 

DSCSFUJCFEVEACrFPOVFSKVAAnfKYK^ 

GRVtUIYQNEEEFFRATERFVSXCMCYtJtVroO 

KVTTKLrYWSVLESIPVVGAFTFSIGRIJAHWCrEDFra 

tKFILRXXFTLLTLPFWLXSCLKSSAA 

CPn.0831 938249 938434 

No robust homolog prasant in Cenebank/EMBL as of 11/7/98 
NKRJQWVLIRKSESECA7FEAT0NYPTI0QCYQLV? IREHNLSVRAHFDLSLSLDASVHP 

AA 

CPn.0832 939750 .938B27 

lipA-Lipoate 5ynth«tas« 

VHXCRPTUJTOOPRVTUCKLPniFPKWLORPLPQCSAFKATOATIKRSCMPTVCEEAl^ 
RAETORCTATYLALCaTVCTTlSCOrTW 

MVARDDL EDCGAOCLVD I I0KLREELPOATTE%TASDF0GNVSALHTLLDSC ITIYNHNV 
ETVARLSPLVRHKATYARSMFMLEOAAMYLPDLK I X5CIMVGLCEMECEVK0TL0OLASI 
CVRIOTIQOYLRPSRKKUJVKSYVTPErrTT/YRKV^EAi^^ 
SVQDKASA 

CPn.,0833 141171 939747 

IpdA-Upoanude Dehydrogenase 

RGVLFE I LI TVSEJtfrOEFCCWIGACPSC YVAAITAAQSKLRTALI EEO0AOC7CLNR0 
" * PSKALXACANWSM I KHAEQFC t KVOCYTI DYPAMAKRKNTWQC I ROGLEGLX RSNK 
tTVUCOTCSU^STEVWIGODTm 

EVLPKKLAI IC«WrCCEFA5LFHTU^ITVIEAIXHriAWNICEVS0TV^ 

R 1 H** 3 ISAI CCSCfCVR rTVNOOVE EFDYV LVA I TP.QFNTAS ICLONACV I RDDRCV I 

PVDETMRTWPNIYAICDiroCVftXJUfVAJ 

EIAMVCL^EAEO}NU»AirXTKFPFlUUCIW^ 

PI IA33L ICEMTLA I RNELTLTC IYETVHAH PTLJEVWAECALLATNHPLHFPPK3 

• ThJlHM '41544 *MJQU 

• TV,t. I >vi ■ ir tttf t_ifr. il mi or -rin 

B 1 1 ttPKAKCTEMORTCWKi :Kn.-"V:X»Vn.t.*PYC::AFLODPPVA:7X TPS/XTH t SFPEGASK 
KFJU-XrLFAV^EMAVU il**J»NPTQCTNXOVT PEWTMLO-WLAALFU: ICLLAFAFL I L 
I >;rru> UArLTWIKIlRAYKY* U It iAAVAY RCTi'RK -I- i. 

'■lnJiHfi M'ju'tK **4^04^ 

itf>i i i. unity hGlti.jtfu 

IINtMVftKAIJl(PP«WVVH^ItXJ(KRKEIWrjF r 'Er;M(TIH[ l1^FJVm:YWL£TUCLGDIO 
I'l .TPAJ? S J .fir tfj tHJUXTAY FAVYDAI/ KilrUlbPITf ll;'JW/AVF!»IFFU3S t PLQAO 

• :uivm.£:ri! itlt i w ikevfoowi jit tiiA^eerrvr tnkti*i.k:^lyrtaxkfffl 

NK» lAKt.T l* '4»: ViKrtaiFJILOWJLVFKAElLrFITI.FJl r FI*Kl.ELAHTSLENVJMO I 
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■:[TWTV^AEEAKWrTl^ /KDRENHPICTnif^VEYVAJrniEMrXPKACAU'IYA 
t PLLAPKriCOOLtaLlCYOSLEYRLR YO I RLLREVtfFSFP AXXVTPTCIBNGSL , l YPWYC 

YSFTKciHOvvcsc u g i tty B iv i^tgr i Eai L wtucauwu^tfg 8 HF tMrca t grVT 

CQCrVIXFHYDVGOPSJTEIRTTr W! T V M ?1 ULF r^LiyW^PICrCLrVtPODIPMTVPi 
OAALIOUPNFFSSPPKtJCDXilEVHRCSRCICCE^LKPtLV^ 

TS LIPTPLOCUCFtPRVI PPENVPOFLTOYAOMER tLF PNPCTRPPESYavICSIKRPH 
PAS PLHL0LHJCTKLC3VP tC : ALOCUCSKHTrLFTCACFUUCONUrjrUCOfXSTOKC 
VTAENT\'lANITt7VrncU)ALAPL3VTrrTrANPEI)U?FFSOUC^ 

u., -^-'^*?wf: .^aii- ' ■:: v- a^t;v:\:.i:-;vr ; .;^rvr^rr.; 
t^ac:-^;: ;'7.v*r:" ,; :— •n.;-!'.v*:* :r.r T^TJ^'-.-^^Fr^•x:A^ , "-* 

VTFOEIHHAKNK^ IKKiLCK IUAl^UILTwTPIEIM^^ 

KKUTKRCS5CELESI XPSQCLLLKLTRP F X LJUCTKKLVLPELKJKVES X XACSLSVO0E 
KLYKATLOReXSHXOXLETPEEPATNFLH X FALtUHLKO rCWPAVFTKWOrjYsXYtSG 
KWMAPVKLUCESLNACYlCVr^SOYXHMXRX:TLYLX£XCXl^ 
TTOPeCOVFVCSX^AACTCXNLTAS^ 

PVDAPVES0TCVLPPEDS 

CPn.0836 946960 945722 

brnO-AAino Acid (Branched) Transport 

KMXXNASKKTNDKXSLS WS IOCS I FAKTFGACNIWPLALCryWNANPKSAYPGMNLTA 

VCWIiCLVSKIJTSCDYQIfJTFSICKrPGMIFITAIItilGPrCCIPR^ . 

[^OOCSArXPSLPXFSAXCCVLXYXFSCXURLXOWLCSVFFPXhXVn^ 

THPKVOEf IPNARQAWlJSCriECmmO^JUUTFCSIVLISL^ - 

SFOGISTOnCRSlJkLCFfXAWU^HTYLC^VLSAAJOW 

PMSILACVSmACLTTEIALVCrVWJFlJUlWSFXICrw^ 

TXSKLU.PUCXOTALXVLACCN:AYKXMNFinrSPVLrYL^^ 

CPIU0837 947777 947145 

nth-Cnodnuc lease XXX 

LTMCOFXLRTUiAXSPNnCF$tJX»KSPFOL^^ 

CSXl^PPGKLYO&IAPCCLCBUCSAYtYOLSOXLVItDFHGE 

ASVflfir A l UWy PWIH IlJUAOR>n(ISEKKSPSAAXJCTUu^FTCHK>irPlPJtLOI*rY 
YAI^PALHHKICNCPICSYIJUCIANSTRT 

CPCU0838 949196 947781 

thdF-Thiophane/Furan Oxidation Protein 
ISUOYPKSFHIJTJIJCLCIL^ESSFNFSIFTCJOTDT 
VXA^XF^C5VA5FA5HTXK£CCVXFEErLXDOAXXUl<R5PRSFTC^^ 
ACSQItT^IA^WtfAlJCtTSORArUJSXIDLVWEM 

GN TSJOtl QEIHTLI ItMASXC'A^TfZEDSPI^ V^O^ QliALHr/tl^tSSTttOO 

AgaWTIlPrEXISXgjUU^AMEEAPCXIJt^ 
P PPnJ T S LPqFAlSAICrC^LTCViqALIOWMQKOEJkCmiC^ 
CLXEAQOMLYXjOPVEI I Af< FT RFAf HSICMLSCKEVTCSILCEIFSXPCICX 

CPn.0839 949230 950159 

psdD- phosphatidyl serine Oacarboxylasa 

FVZVSRCa.VQXPOYtDMTXXICVXEPXFYEICTnlJXYNSXL^^ 

GWLORCSWTfTOXRFrwmYKXSZKXLTXPVM^ 

TPVCaJm.VYTW^EFT3STrVKSWkFSIJ>iaXC^E^ 

FfCXOJOXTRCVNCAl^SVHPlAVKXKrXl^CEN^^ 

CSI VOTr^PN DTYJ u^EJ^ r FA FOC^WIUJLPNAXRFWD^ ^ 

SLUMQUIBT 

CPn_0840 9S0141 9S1S44 

CT700 hypothetical protein 
XStlUUCJCTlJCrFTCXAWKSOKWRXM^ 

QvxiuTV!ttSMn,.irFwr^roQLLPXUoqi5Titi^i^^ 

KZiSUtQPYFFLEX AYKAYRFQAFKZCAOAF ASVPQDI^XECXIAAXYASALVRLCOCJIAAC 

SLXIWISFUHOCTPVTTCHXYrTSlGtYXDAXDFYTO 

SSYAKACiaFt«U^WPVYXgDUJNICLCEOKI^ 

AAltAAKDOROYVLAZPOfrXJORCSTFAXinTOnj^ 

FPECLTACKAXJUAjOS\CYATUJCSEECXJflrAJOMV^^ 

AYTXQSFUStU7rSLCTORiU^XLRZLIUCICLI>UaiHHXVE^ 

CPn.0841 951719 954640 

seeA -Trans locase SecA 
rMUWCFUCKFTCSSOERXUCXFO^ 
E5L0SK^EAYCV\^QWOlftlJ^PVEVSCYH0RW 

CECKTLTAVMPLYl^lALTCXPVHLVTVKDYIAO RDC WUIWU^TTCVLVSCTLLX 
KRKKXYCCDWYCTA5EFCFDYLRQNSIATRLEEQVCRCYYFAX IDEVDSXLIOEAJtTPL 
X XSCPGEKKNPVYrELKEXVASLVYlAKCLCSRXAXXMlRCLDSFL^ 
ISEFCRSLWLVSKCMPLrmVUlRVREHPDLRAHXDKWDmKAE^ X 
VDEHNNDF ELTDKCMOOWVEYAOCST EE FVWDMCH EYAL I ENDETLS PACK IHKKIAIS - 
EEOTLR KARAHC LROIXRAOLLKERDVDY XVRDCQ XVI IOEHTCRPOPCRRFSECUOAI 
EAXEKVTXRXE5CTUATVTL0NFT1U.YEXLACKrCTAXTE5P.EFKEXYNLW 

VLNAKMMAQEAE 1 1 ACACKLCA VTV ATMHAC RCTO I KLONCAVX VCCUIVXCTTIU^SRR 
I OTOLRCRCARLCOPCAAKFrLSFEDRLMRLFASPKLNTLI RKFRPPECCAMSDPMFNRL 
I ETAQKRVICTlNm RJQTrLEYDDVMNKOro^ 

VASLVMSOROFKCWTLPNLEEWXTSSFP t ALN t EE1*R0U(DTDS IAEK X AA£L IQEFQVR 
FDHMVE^LSKAOCEELDASAICRDWFSVKVMH I DEOWR I HLVDMDLLRSEVC3LR7VOOK 
DPLLEFKHESFIXFESLI RUIRIT IARHLTRLELTVEPNPRVNNVI PTVAT5FHNNVNYC 
PLELTr/TDSEDOD 

CPn_0«l42 *»^S01S 'iS47IO 

TT702 nyptKhHtictl prticmn 1 1 r.inw-ohitc wich 0843) 

KYYTPPT I S RJ PWSN I ALKT 1 "JEPEY DCftOLLKTOn LLTTTIVOTlXNAPlCFPttSICICICH 

I LFC XAf ff ITLJUYAOFL t Al UffiRKFW [ hYYNOOVWTi EWTPF X 

*X*\JiH4 : </*7 "J '1'iSJtn •»',4'»'t4 

mu.'. r.ypiirheU»:.il rir>t*ih ( r r imr- nhi tr. wir.h OH4H 

riKMKL: ; :*vnncvwYrjuYr>TrrrrDi ilo i kfjji.ixn::epjwc;lokyoeti v/veewtboc 

LLMVLriEL'WFKHTIIUfXi 

' Hi_'i»?44 »'»t.7M .V..V/U 

ypl*.'*'iTP-i::w/*iTP-lMliJiW| |if'*t •mm 

KNIf » CT IMLK I A I LGRI tM JK::;:i J'NH .< ;ri»: I VHr^f? .TTRUNI ,Yt IRIJIAFIWPW 

t rrr y ,-/uin: :eo vfok \ I i ymmi .Trwr VA\ m .1 .1 .V I U t f" ? : t TEtawvm •AKLLXPLKXPL 
tLVAtirAOfiHVCEUJ [ItETYKI * i 1 1* ly I '/* rV: rr Al IOKI 1 1 l/HJ/jn I KLVANLPEPRF.CCEE 
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• X£EUM>ni'K£SEAALPr,*!nTPOr;:EVrT!rF:. VCTtPeSPOOAPrTLKI alicrp 

mvtk:::; uw jllneerc i r»rrr»7rTRO]rDt-vnHKDR0YLFtDTAGLiua>acsvKNSic 
wrssunTocA[:;RAoiru.v tOATOKLSSYExtuiJLx jkrkkphi ilinkwdlletvrm 

EHY^KDUlATDPYUTOAfMLC XSATTltRNlJCX I F5A I DELHHWSNKVPTP rVNKTLASA 
LHRNHPQVIOCR RLR I VYA I QKT7TP LQTLLF iriMCSU.TXKYCYYLiarrLKSSm.yCI 
PFDLEFKEKPKPHN 



•"Pn_'«4S 



w :r;v .>•;.* ;.* v. :c: .y;-. :. r -rr* ■ -v ; v.wrrr. : ":w»riv r,njA7Fv i n".7»nmie; 

RPLDtO t ATNASPTZ'/ST r rPOVlJ [GVATC X r/VKOOCALrCVATFRSDCEYXDCItH P 
DRI XFSSKREXVUJUtOFTVNQfYYDPFEXUCVTtlFVEGTW^ 

LRILRAX RrSSSLCm-OPTTERAI IK£APALVNSVSPDlI>«IIJC»flJaWPVGM,SU. 

ucuCTa.iFzrpaju3iPYSuitTTierAiucFNPW 

LRXSMCEUaZESWYEALPKFgNOSCMn^AHFtJ^PTAPXJIJI^ 

ISRVQELESRLEOFILRIKTSSPVVSAPDLIAICISPQUXCDLIJl^ 

EKILLLLOOOCFWIC 

CPn_0846 959393 958112 

ClpX-CLP Protease ATPase 

RE3eCffiKNLT:CSFOCItS£m^EICUASPSVYICt^Cin£SCZLOOT 
TPSOPSDLRVLTPKEXKKH ZOEYVZCQCRAKKT ZAVAVYMKYKRZ RALLMNXOVSYCXSN 
VLLLC PTCSCKTLIAKTLAX ZLI^rrZJUSATTLTEAGYVCEIJVENZVXJUXOAWYDVA 
RAOtCZ XYIDEX DKIOTTTAWSn^VSCECW^LLXrVtXTT 
YZRWTINILFXVCGAFVIII^XZAKRLCKTTXGF^ 

AFCHIPEFVCRfTCIVNCrEI^LOKLVAILTIPTWIVKQn(Kn^A£ENVKLVF 
AIAWCAJCOAKTCARAI«LENLXJU)LK^ 

exiA 

CPn.0847 960019 959387 

clpP-CLP Protease Subunic 

KLFI1CE10^^X•VPYVVCZ7^CRGEMKDIYS1U^JCDR XVM XG0EXTEPXAN1VXA0X«LFLM 
SiBPKKDIQ IPXNSPQCY XTACIAIY7TZ RFLGCDWTYC ZCOAASICALLLSAOTKCXR 
KALPK5RMHZHQP50GItCnSADXOIOAA£XLTUQO{XJ^NIt«SGCTSQPVQCZ XESSERD 
FFK»E£AlSYCUDKVVTSAK£TNia>rSST 

CPn_084B 961556 960177 

ctg/raurl-TrlTOer Face or-ptpc idyl -prolyl lsoMrasa 
voass paf7flcsnxxgclvfr£x£neofsvixeesfccxvsax^^ 
kikxeztlpgfjuccjcajtovzasrttthvr^^ 

S^rTOFTlUEEMCVEFSYCAFPAISZXJVQlLSLP^ 

ATKWVERPSOEGZJttSISIJWSXSMXJEMASSAAIF^^ 

TGHRWETXTSPEIGSFI^DTLTTrVNAVIEVSIPEII^ 

OLSCOAKr^LQlOlFSEA&ULAMLVDFZOT 

LmKSELZKIvA£EMTI»IJCLLFLTKXIFSD 

DZ SNETTZjQCLVHSAiUSIU^TYSKAZSfVXJUCAXl^ 

CPn_0849 961752 965285 

tnotl/inf-SWF/SNF fully helices* 
AltfllHST^RfiEHLHFRKZJWDPSJWIXXCXa^^ 

GLYIUI YEC EI EVPRS ESDTVPSHCPCSYNYDOQH ZVALLFYLEQYrNPff/VAYARSADL 



E103SA£LAVLFV5VNEX7rFAPA|J0PZ ETOLVIjIlLPCItSXPFYZ SNIRTPLBGVLYOEPTV 

UCRXFPTTMOSFNASDIUaZIWRYVmrFI^^ 

OLAIWJQGSLCHBCESFSGX-FCCMLEEP^ 

OOSVOPGQnaiXSCAPSZ ZHHFVYHRTSPOZ1CRAHIJISFSRLRDIAZ PEAUCSTRENA 
LPVFOEYASZANVHLUtsrm^YVDCVRAZCZMSYtiXaZ^ 

LOYOMArZSDECXX«Al^VeEIUCMI£WFSCFZYZ)ERZlQm P 
ANQHRJTFNCPEXiIX^SGOFIYDETIFEZ^FIl^^ X 
SAKIQXFLEI^KAJraSIOTUBXVNSCXLPCT^ 

KCPl>fSLTCZSLD0FEALPVNF5MSERUEZ0X0ZnCCZ EF D FOSVP QQIQATLRSYJQTE 
CVMfl^UUQ^UCZLAIJOMCZXaCTL^ 

FRKFNPEFRTLV I DCVPSQRJU^LTAIJkDRI7VArrSY»aiOXW^YKSrRFT7rmj^ 
KHI104RTTRilAlCSV(04IQSXWRWLTGTPIENS^ 

RTCNYMCJncAianwAiJtKKVSpnL^^ 

ASAKOEX£RLVKCEIffERIHXHVIjm.TRXJCOIOT 

SSLVDSCHKTWFSQryTKMLCX ZiCKCIXSRCI PFVYIJCSTXNRLDLVNQrKEDPSLLVF 
-IS UCACGTGIJJL VCADTV ZHYOMWWNP A VENQATD R VKR ICQS RSVSS YXLVTUfT ZEE 
KI LTLO^KKSLVKXVZNSTOEWSKLTWEEVUXLO I 

CPn_0850 965254 966390 

oreB-Rod Shape Protein*Suaar Kinase 

LCKJCYWNCCRYDFMS PKRNLFKlJCNFSNRLYKRAlCRFBKVTNFrSCNVC XDLCTANTLV 
YVRGRGZVLSEPSWAVOAGTHAVLAVG«KAJCAj<LG^ 

CMLKAL I KRVTP SRSVFR PR I L I AVPSG ITCVEKRA VEDSALHACAOEVI L I EEPMAAA I 
CVDLPVHEPAASIIZ ZDZCGCTTEZAZ ISI^IVESRSLRIAGDETDECXXNYHRRTYNLM 
ICPRTAEEX KIT IC3AYPt*WEl^EWRI«VACLPZTKRZNSVEXRECLAEPZQ0Z I 
ECVRLTLEXCPPELSAZJLVERCMVLAOQCAL X KCLOKALSKHTCLSVXTAPH PLLAVCLC 
TCKALEH LCO FKKR KGNLV 

CPn_08Sl 966378 968195 

pckA*Phosphoenolpyruvace Carboxyfcinase 

RETGIVMVWSTNI KKECUCSV^XJEVAKLTTPKDZRt^OCStTTEY^El^TlJlESTCTM I RL 

NPEFHPfCFLVRSSAI^VARVEOrTFICTSTEAEACPTN>WRI3POEKRRELW 

CRTLY I VFFCHGPLDSPFS IVCVELTIBPYWCSMX IK^R>CDC^^J^SL5T3CKFUCCLM 

:-^Pl^PCEADVSWPCNPKSKRIVHFOOOSSVMSFr^CYCCKA^ 

KJOCWLAOIKL I ZG ITNPECKKKYFSASFPSACCXTtJLAHLHPKLPCWK I EC XCOOZAW I 

ft H mr.P LY AVI I PEYC PFUVAPGT5ERTN PtlALATCRSNS I FTNVALTADCDVWWECLTE 

yPPEPLTOWLOCPWKPrt^PAAHPtORFTAPLROCPSIJJPIl^POCVPl^ ZFCCRRS 

CT I rLWFAU.VKHf ,VT I0ACMS.7TTTAA tVHCLJKLPHDPFANLPFCCYNHAYYFOHWL 

::KAOIR.*:tJtLrK rFlTVWrfFRKNNOCEFLWPTFCENLP'/LEWI PQRTDCLED tAERTPZCY 

M1II OKFTILNT aifLOUnVCaFnvnAECWLAEVEN mEYLK IFrVltX'PW ITDELU1 1 K 

::KIJ(Kr 

''Ili.MHM: i«H274 •>/.).. 13 

tv [ i tt/fMif tntr u;,ii protein 

1 KIJ< I IliYYYI.lrrrVTUVSlY tNFTPNVTTALJiJCK IC/T:!At El J2TJALFFOELODXA0C 
I J< I IAI / Jl.WKU ! AKALK PAvJVQTU I 'JYLPTEEJilR tSAttl I DBTMPTPTDDEVKAILO 
NI'fB'PTTlK 1 WO :ti)KVFR:n*Lft,VTPPEfi IOP.<NPEC:A 1 1 LNY ITLLMMLKPKFAACST 
I Tl jAUYflALYAf.l* IPPVKK I EALKAAOAPWS KV«A KWJK IMTl YNNMOVLSYPVTOYLN 
( AM; :i il rTAAOFVWYUCNPY.'; t LKUIUirX VTf.rClATHYPAOAEYHARnAlTVIOSL 



LfrtJCMYROLTEWLPfm'. JEZIAOIRSFCMIVNCT! lAJKTLLrTWRLCTLUr.' 
I YTYOCC AT I rGXSYCTST ? AXQW/ IDA : NQE ICS YWQAR ANC FTlT.TO C»TOOFA TN : 

cTSYRcica^qwrmgrnFTr rp w cET RYPY^MCT^CTSBta ^ g rrTwTTr: : 

SCWSTOIATTCTOKraCIiPSlXKYreTWC^ 

I ASUZOKTySMUUUCYUICXJK£ ZTTFPSAD X YYSLJ Z YUtCKNLQAVADP IGKAVCVI. 
NDEinTWMADITRC^IKAAJDKWLVTIItArUEl^ 

SClXCFLSCLTLKAVNOFflATYEAFTAE I FTCPFN»MXRCLATFC3FVZ0CCC>C ZTPCC 
C>X3LLOAMC55^F7TT?O^LAlXLES3AHQQEVrLV3A 

CT712 nypoen«ci»: protein 

NZNKPK Z DCRNSLPLTAVAPVTEESYMPSV ATTVDYVDATTURKLTVIJCZ7V7KXAWLZ) 
LOCATLTSKJCOCFINTCTELAI ICASLADOSSRESIIICXEOZraOKLCKAAPOAATXrSC 
VQPTADPVAI3KMPLOSAFAVVLLDKY X PAQEEALYA f .~R n WI CTAQNLFSPLLCHZKS 
FNSAPrWYNLCSYZ SC7TSCTANFAYCYDCZ LSRYWEVSCyRXJZASTVKAJCAALAJtttA 
SVKANVSLTSAOXJCOZESZ XASYTKSl^IKTOLTTJVKn^LAJXTTVPCUiriW 
OBSLSZ XALOKDEXVLVDCKVD VKfCGLUWyrrvr Tmnryvrm^ x^ry-|Tjfr J¥ r 
UUWOOC«SLVSASLXiafafYTTVISGro 

CPtX.0854 972849 971806 

OBpe -Outer Hen&rane Protein B 

GPFIltiSlCMUOtLJUATl^rSMPTCrV-SSPAVYALCACNPA^ 

CNSYDLFAAiAZS mUF VTSESAKlTNVPVZTSVn r SO l Ul ' lP 1 ! ITSTTKWOFD 
LfOtSSZSSSCVPATIALOmPAAZPLXDZArTAXVCCLJCOYYIU^W 

!SEVTrx^ZEvosim:ivuci^LOKvijocix7/sr 

XYmTaarc£YJCOSA5ZSZSTYLNX?YVLPYA£^^ 
FKZWCZTNFOlVNFCFCTTCCXSNhfrYYSVra 

CPTC0855 974001 972994 

gpdA-Glycerol-3-P Dehydrogenese 
CLKKOTZCYlX3CZWrCtVkSU>NXCYPWA^ 
LSPTTZMXZAjlOAFtCTVECmACZIUVAZQIJCOZTOLSW 
ZMLCVLCSSVTPYXCrl^PSIAXraiCSPC^^ 

CaJ CLtmL CVZCFStSSWgJTOfLLAQ ^ 
XiMVinUiyRVLYIZIXiJZJCKZAXIIjOI^KUfl. 

CPn_085« 975410 973995 

AflZ-1 Koatoloo-UDP-Oluoose Pyrophospnorylete 

CSBSmrvU.TVMnSVYSPSAM4VNSlJUJI^^ 

VDXDrrW0X»LLSSPTWUCDrHPITSFASSGI3>PD^^ 

ossLsrixspiccxi'pvsPZjaacpWLVAjacvRAASKi^ 

BSWrTKLDPNQVZ>FFCQPLWLI*TZ^CDLna^ 

WKMIIIffrSArXPIIMPLAZJTTWU£CPliAMS»^^ 

QCtSVXCYSaPO^WALiaSClCLKyCLANXCLYC^ 

D*BQLFKRfTG3a^mTTTE*ZADFTfVSTSTSlJ(i1QiXM f UV tt IA S 

CPIU0857 975808 975392 

CT716 tiypochetieel protein 
LZXLROYICTARGISRI>ODWX3I^ 
ZAT3UO^ERYEHFISRI»IIQfYDXLLEYlJaXOSSLYK^ 
KIXnCZlQMXYSKOOEZOT 

CPnJ)858 977115 975757 

fllZ*Pleoellua-specific ATP Syntheee 

RNStnO*3RRTRPSTFCTX»MHLNKEXLH^ 

CCI£iaSST)OPNUA£VZ5FKNHTTLXJCSLSPUt5VAZ^EVU 

RVX4)ltfOTZmiCEZUCriOUCPli^l^SP»e^ 

<?^SrPCS3ICSSU^IAl£SlCSTINVIALICEI^EVRrriDCHS 

AHeTAPT1CVIAaRAAifnArrTREC0(CVm 

ASVFHmEFTEJlAaHDJBSXTAX,YAXLYYP1CMroZF^ 

SPPZZJXLSSX^RSAOAXJU^PHHYAAAElUJlSZXl^mEAU 

ICI4^IKAKJkQPLSSYaXt»mJ0JlJALADS 

CPn.0859 *T7597 977055 

CT718 hypothetical protein 

VrLVTTPOSPCSLSOSHLPHPHDPWDTEPTSL PEOPNDXASOELHS LVKL/RXLSZKLLS 
EVEXTVXUCPDIJZLALLZCEXFLYlCXl^POZaAL^ 

PEtajCTLTtJWISTHELPMI KHAEFF PDTSCRRSCFK X ET PNC Z LROQSEZLOKUXVLT 
A 

CPn.0860 ?^S639 977608 

CllF-FUgeLlar H-Rtng protein 
FTLYTFQKlAXKLTllJZISTlJXl^IGrjVVSCAlLTGRS 

LTQNWPICLZESLTKKECLEKOLTSFHPXASAKVAZAZ^TCOCVKSPIJfljSVXLTLIUOEE 
SLTPSUJSITDYLCSSLFCUCREHISLSDNLCflLYI PE2XTVNSt^XHTLENYLCXIFP 
KEHFALAYHAKAEXPTtCLTLNENYZ AHLTKEESEK Z VAHTKHYLY0NYZX3SYDZV XETL 
PFARI^OCKSFPAKVLIC^HILVISLMIVAIASFYLARHAYERVSPEPRKZKRCXNZSKL 
LZIZOKESPEKZALII^'I^PKKAEAIO-NRLPEDLXHOVLKYKL 

CPn_0861 a-*752 'i78«S 

ni!U*Ni(U- related pvotetn 

ASYPFTWKFLMTLPLEFXI FWS2UAK*/MIOCFLTPHCACTFSEEZ)AEAXEAJfL^OIC0QK 
P.LrCNCVTFYWLVnCXNC\' I lAAKFQYFCH PYL Z PtAEA^LVCClCSYSEAYKKrUOZ 
DKSLRVHAMOPALPEOri I > LYH FVI DALDT AVEOCLE I PIXOCSLPLONS PMNLOFEOAN 
PYSOSDWEALTHEOKLYALRAT I A£X I^PY I AKCOGEVTVESLENF ZVT XAYSGNC9QCP 
5SLC5TLNS XCQLLRAY tYPELOVKVCESSLNLCHP 

rVnJWf,2 >7*J7.":; 
ythotlif S*rftl.ir«\l (•: k ir.«tin 

WGT I FRITDCKTA: [:>!EM<*«KAP[* I FWUff IQVA I I'KiERVKEiTYAUlSO I KSLPPC 
."AUtl AEKTEES I RULVi V-KIK a)IFRF*/MIFM r/VI 1 1 Vt -AAL VEN Li'<MF\t WNI 1 1 1 LP AH 
0O0LLZtiSLCRHOnunT\'^VTW#lF/il' I VtifJUL tETL.';pR:XLr:il^yVAHt:LTtIVtOr 
L&PLLCLCKDRR (U JHX 4 I^ll I LnRAI-LTPK H J IAD I ITr.UAAUS >r::; KXiirrRKSL 
ERVFCSWFPPI(T5^;:UT: , .AVAAMC r rA';KER I :;ALPI^TPirT:WI A'KKLIOBUje WLT:; t 
0LAFr:CyOMRLPN I WAA I PDr I'AE; U *\l'\\\JVJf, t Y niUTf ER FOrLAUVLUNt i; t:;pF 
l^HSALIirr:LTER::KWXr::Kl^MAHIII*A I KIII.TPLLU'^ 

CPfl.OHOl $rWQO"i<|\;!'l .H'Mtl AVOf l> I 
f/p«A I*ho*prvy|lyrj*i.tt Mnt .if... ^ ' 1 
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niHALUlU.PHO0SVWNeKNLF.7CWVOtPLS0OC .FSACJU»0NLPXDCirr3TLVR 
.;LKTALUKTTIHHSKI! IPY rVHn3PKAKEMSHIY3AEEEM#<IPLY0S3AI2iEWflrCEUJ 
-KNXKOTATC^EERVKLWRRTnn'APPOGtSLY^^ 
J^MC^LRSLIMDLOCLSEEZVl^LEUTCXPVVYOWKNKKIOCifPE 

CPn_0»M MMM 9? 2 374 

yibC-predic^e<i pceudouridine synthase 

Yr:^WPKVPfJIKFLA.->ArrVA3PBKCDCI t FSCSVTVNCRVXECPTVI.^m3KVCVrX7TS 

vi "„T""*.* , t*h>r."t* , .s : ;v * . \ \ *kk"ki - rr:*LV : : ■: ~ AjiLfYPVrTV^fr L.*.f'L* r:; . ; l.*.tt; 
: < :s? ui?' • f hi-.'. • <'T v ~Tt: srz'w v. :akpi ; :r: .vS-Trr re* :n rv? yrrr : p p'TT/k 
{WSCXPCKHEtRlJACAAGFPI LEUCRIRIGSLVLGGLRYCEY RELTCAELCTYMKLSD 

CPn_0865 982412 982942 

CT865 hypothetical protein 

SPKTifVrryiACSiri^ISLGXYCQLYYSVKSVLrSWVTXWY Uf. TOT .VSE 

MiriCtOTMSXm^rEXAAYXEAVEXyXJOJQlUPUUASXL^ 

CPn.0866 9B3494 9829X6 

birA-Biotin Synthetase 

NKKVTCTYHEEIPSTTnKAXSYlffltWPYALTiaSTKOT^ 
FCFFITDLHXDVSRLFRIjGTEAWAIjCXBIXSITc^ IKWP 
BaJXaA^IGLNGWTTKQALJCDM»PATSLCEIL^ 
POSLATXSNRGNI 

CPn_0S67 983403 984667 

rodA-Rod Shape Prat tin 

CIRIPOHHICTCHCVWXaiTFYTVIJJNFTII^ 

L7^J<LLSWVISSKDFTAMLVTSSRSU.TNK5I*^^ 

AW\fcYTFMXCALVCtJTv?SVONVHlWmPTIH^^ 

DITSKTTATLACLWAt.PFFLJIJCEPOLCrrALVLCPVTLTIFYLSNVKSl^ 

ICIIGSIAIFSCIVSH0KVKPYAIJCVIKEYOYERISPSJW 

TCETACROWUnfCYTOSVFSALCXEFCU^CIIJ^^ 

A^rnnruioivLZNXsoioGx^rrGvpuLiSYCcssvxs^ 



CPn.086B 986733 984670 

zntA/cadA-Ketal Transport P-cyp« ATPase 

NFRNCLCVRDLKHFREYYLIINEII ITCRYWSKlJTTSFSA£VWTTT^SCKStDTSPL 
XJW2NRRWWCPtJCSAYI£IXn^IAI*LSFW^^ 

N rCQKWMIDimrSAATCS IFIGGAr JTWT iriT.VIXAISEALCXHVSCKAXSTLVSIXOL 

KSCHPCS tVPAGAHNMEGSFDLRVLRTQSDSTX AHX XNLVXOAONSKPRLQORLOKySSV 
YAI^IFAIACCXAU.WLrTSIPI^PQSATlfRAIAFI«IAASPCALI 
GAXHGVlIJGGOTLDRlArSCKSVVICDXTCT^ 
SSSHPIAEAXVSYtMEOKVSSl^AQRYLTNn^^ 

HXVSASfrA£XX£XS£WFt&TFEDXlAKXR£XATQS£IMMV^ XA 
MCEAGSATAlEAAOrVLLHOSLSSLPWI IOKAXC3TKKWSONLAI-ALAI HiVSWPASLC 
IXPIiOAVXLHSCSTVIVGUtALRLUCS 

CPn.0869 987479 9866SB 

CT728 hypothetical protein 



TPKGFTYHIJVWALSTTOTIFFIRTLFFLI 

HAYMEXfSHRSKLEEXKEI EEKFEOSKZEtJtXLFQJQCFKDPI^DfVCYVCSOSTUXOT 
MlRmYlRTOLPHPLIOOCSRIl^CLCCLAIFLPLVLCI SYTLACVTSALKVLVLSFL 
KAXIUWDJCISE24VWVLCIFITSASIISSLMKLL 

CPn.0870 988881 987448 

terS-Seryl tRNA Synthetase -2 

TTTHPTQGFGSAVXLPFSPISIAIUtXXXSCCSEKSSrYSHFCniliMETSMU 

TPEECETUJlKXDPKXSlfraADKEVTO^ 

NLIQIVCTLAADLZKIEOHLWKNAOW 

PI FSFPPKHKIXUtOELCX LDFOAAAKTICSCWPAYIoaU^XEUALLTYl^^ 

OLMUPUATCKEIlJCSGOXPKFTXaYYin/EDGEm 

IJLYYAACTPCTiaiEACAAGAOB^VRVmOFHICVW 

LTOJCUYRt^ixjKa>tsrrAsicTinArvwLP<^ 

OSOOXWFVHTLNOSGXATPRUA/AILENN^^ 

CPn.0871 988766 989899 

ribD-Riboflavin Deaminase 

EYMEDFSEOOI^FMRRAIEtCERCRXTAPPNPWVCTVVVOENRXXCICFHAYAOCPHAEE 

LAIOJUSKPISCSDVYVSLEPCSHFCSCPPCANLLI KKKVSRVFVALVDPDPKVACQCXA 

MLAQAGIQVYVGIGESEAOASWPYLYORTH^ 

PE*IWDVCKLJIAESOAILV(XRTVLSOCPWLTATOPOGMLYPTO^ 

KVFOTSPTL\*VTTERCPENYIKVLDSLIWPVIATOT 

VEG£TTLHTSLLXERFVNSLVLYSGPMXbGDQKRPLV5^ 

5LXWWE XSPQVFEPIRN 

CPn_0872' 989903 991216 

ribMribB-GTP Cyclohydratase fc OHBP Synthase 
KEP. X FRVAC LAS ESVNARESHI ETREEVCSANFVSLEPA X EDLRACKFVI WDEASREDE 
COLX IACDCITVEJOfTFLIjOHTTCWVCAALSOOUi^LDLPPMVKDNRCRFKTPFTVS^ 
AA>f3VTTCVS AADRTKVV0LLADPK3KPEOFX SPGH FFP LASS PGCVLKRACHTESTVDL 
NELACXjQ PCCVLAELVNESYSMMRLPO X LEFARKHN I AV I PVTS I X AHRMLSDRLVSK t S 
3API.PTIYC0FT IHWESLUI3*0HLALVKCWACK3?r/LVRVH5ErVTCTlLCSia«:tX: 
CEQL33AMSY t A EKCTCVLVYUtCOEGRG ICXCHKWJWAIXJDNGYCTVDANIWSHGFPVD 
SPr^ICAOtLVDLXLTTrKLITWPOKYFCUWrGl^IT^RVPLPVRISGWE^ 
'>EPNCHWLOL IVCNNRVQ 

r iL£-ftibirvl turn: ine :>yntrvmn 

t HAVT V JYNNFREYMKTt JOC 'M LOAKNLR I A r VCSC Ft JQAMAOALV^rTQETFLXFTCliE 
1/ lUfT I RV(< tAFEt IX.T t KXLL;a ttkKFOA I VAfCVL t VSbTUrYNO rVWjVAAt: 1« 
LE^U* tTU". I VAAr.'WEI AWORa* I KCRH L/IVdCWTTAI EHATLFTO I 

»:i*i_l)H7* -I'Mlhl -»«JI74't 

'T7'; i tr/pr*r|H.'t pcnri-in 

I. r V.UllXl LTKORf HKFAMLK I LK IRVt,VFPU\LUry.*ri:: tf^ACIVX^UjTN: vtkvk 
f tlZKA* t hWWJtV^PKt JJrfUTEflf X^nXTTTTP IDMAYnEKLFHKKVrALL I A t CtUM tf IL 



EAE^VNSTYTVOCEH I pLTFRCLP3LVACL3VATlCTT/?P!>«UCI'YSTKLSliVKS 
UlSHI«J<l^KOL^CTVUySrPr^SCCL£VFAtJ:A^ rHUCAVSINl 



CPn.0»7S 



1t1)«3 ■ M 4r,22 



!>*:wi'*;:v*'~* r * • ■ 

RLJRRSRRLFAP.kt-LT^KUfUrVv'ANFk-r fAEK: JLvCEarU*tV/*SAA4jUJii 
OCEIKCALYR IRCVHPLALI EALADJPALICGHKKMQGREW rWNLrLTOLSEVFSOAWSC 
CVXSBIDIAAFASTLCLOSCTVASIVQCERWPEi,VDXVIT 

CPIU0B76 994123 995517 

daoA-D-Alanine/Clycine Penuase 

S IATCr37C,YriE0LMCLSTSFXr\TP«ILU^FLTVXLR^ 

OSSSfUUICVSSYCAVACILACNFCnWXACHAVAlJkCCSPCALV^^ 

SYliCSICYIUCPIXSKtCErXCCPXACLAFCMIUCKXXJ^FFALrrXMTAFC^ 

LCA£CTTCXXXVSXLLAX.WXPVLACCMNRXUtrSAfa^ 

XLPAXiaXCSSArCXIUCt^CXCCYTUGVXSTCXNRAVMATDCSSCNVS ILOAtmCSKN 
PVVCCI.VTLVPPVIVHWCSrrKLVLIVSG^^ 
AHALFCYTTILTWrACAEKSUJY>fIPCRJlANLWUCAIYVLI 
F5C^LICXAtXAl^JCXM^TtnU7VAUJCERECSVAX)PVR^a^ 

CPn_0«77 995521 995982 

ybCL Casily 

IUUlZN01XSPAFAYGAPXPiaCYTC0CACXSPPLTFVD^<»A^ 

OaLHXHKTVna^TTXTNXABCAEXFAVOStXrSCKPV^ 

VLPEEZNmX^YEAHEFHXXEXJAEtX^YCKS 

CPIU0878 99<660 995992 

SET Oavain protein 

KKSXXIUUJSPIJUQILCICLKXOOIJXPPAPPVSVCW 

TCIIiRHR0Ari4KDENDYCFTrrTHPUTtJnrrri 

EQLniVIXim^rVAO0EICYHYCll.Y«KHRiaWEEFIPEE£ 

CPIU0879 997463 996645 

yyeJ-Mcal dependent hydrolase 
YIUXMeVStOCTFPtASGSlGGNSAYL^ 
IFVTKEHSDKISCXXSr^YVrPIVCKXrAIUUrKlXOSNPEF^ 
VOTFMVPKOAVDPVAFXFHYItEXgaCFCTmCSiA^^ 

OSORPCVYioatvLSia^SNOirxo^ 

SIASXTSIJtfErJUJWCrrSPIxTSRLIVACPR 

CPn_0880 . 999164 997444 

. tciK-Cell Division Protein FtsX 
milllRKlCSRHPRIJTUlJUkJUkSLYIJTACFW 
FUYTFCUUU^FXPLYn^FLYriWrPRPLFinrXAAAFLSLP 
PAUiJTWJWlUaOTPVSYVTOIPFYI^TfEGOSFXXia^ 
YLCOCIAlJ JUUC T >UUSW «AfC SF FqTCF1WIJaCLJWRR^ 
SOPSVAXremilASXSPLPOEEXPOSXKCSFFLTPHPaCR^^ 
TIAI^STPTVVMSWaCERAALPKl^SLAVPTO 

LILKQTtTCFCXDAIg^ICSGPTlAArCVLPHSCVKVOXIKSLI«DI*fJfTW 
APXPCXAAVCX EIITPFPQAVNFItDLXXDYO)CTNRXLOI PlXLGKXAHS TtA W gt kJ W 
HLIXAGTTOSCRSVCirrri\MSMIMrTLPSEIKLVX IDPI0C\^TCY50LPWt^SPVrrE 
SRXVYNALVWUViaMESRYEIUtYliCIJWXQAFNSRTR^ 

XDELSX3£IJ^SOOXETVIXRX«A0NARAVQXHLILATCRPSREVrnntXXAIIFPIXXSn 
VSKKVNS0IIIDE3CAENU43Nra><LVlXPSVFCTIRA0CAYIC^ 
TQmPSFHAFOOSOSDNSCIXDPLFAaAlCX^ILQTGNASn D 
0LCEARI ICPSECAKPROIUONPLEO 

CPn.0BBl 1005646 1006209 

No robust homo log present in Oeneoanlc/EMBL at of 11/7/98 
NXXTAVKMPVPI DNSSRNXX}EVPE5LEDlZQHAEE5PTH0SAE5SSU}t3lWlSSAXStRV 
EOLSSLVLGMENSOFSSLROVPI FS AIYXSSTHTIVPTPLVCVCYINCSOSOyYDTORES 
U(LS0l4^RRVEVVYl)0CTlFKEA5Ul^PRitPRJtOPSPXSLALL£L 
TFNPIFFW 

CPH.0882 1006169 1007404 

No robust homo log present in Geneoank/EMBL as of 11/7/98 

OTPQVAIXXOYTFCTCAFYVREALJU.TPHAONXVLVCICPSLYPEHPRSCYYRVSCDIGS 

RFOIRCFVNSCWETI^YSSCSFCXFWISFTOPTFWFAIVNTTMRTACINEV^ 

TSLXEHRCUEOOEANNTDSLCQEESLMC XVCHTVCCVSHTVTSSPNX FYRIQTLLCLPE 

TtAEAEENPTFPNSTX0SLA£X»t4NLVRISDAVSIFWXFPZVDTTYK^^^ 

NCXCSTFl^LTNPRSRRDItWRinAIH\aCYRSLCSC>C4LFOU^ 

LYAHWLFCWTVA X QOAijOVCFPSVROAFYR YCUWRYCLTQRNniS LCTTCTRFOVTRT 

HLEOOOMyASXLNL^FCLrFCFVCUfTTFGCLEXSPSCRWDAANNRTVCXF 

CPn.08B3 . 1008904 1007)73 

dtotpP/nqrt -Phenol hydro Lase/NAOH ubiquinone oxidoreductase 
LYELFXKSC I F IWTWLSCLYF EC I ASL C FCA XGV I LACV I L.LCRKLF (KVH PCXUCXNt) 
NEELTKTVESOCTLLVSLLjSC I P I PS PCCCKATXKOCKVRVVKNADEPLETDRSTFSKR 
OLEECWRLSCCCKVOtlOMSLEX EERYUIA3SWEC7VI SNONVATF t KELWAVDPNKPX P 
FT(POOYLOITVP^KTNSoDWKOTMAPEYYSOWEHFHLJTOVtONSOLrAOSANK^ 
2YPAELPTTKFNfRtATPPFtNGKPNSEIPWTVC33WF.^LKPCDK [tVSCPYCESFMCO 
DORPLI FLIJSAvU.'TFnRim I LDLLUIW4SKR EI OLWYTykR.*;LK EN I YQEEYOILEJI0FP 
NnrfflL'/l^ETLrEOlAAlWOTDDPTrTtlFLPR^ 

.vj lucLLCDw^rR:'^: 1 1 iroFi;.': 

CMi_«HM4 I ti.K 1 0l)'M>m 

(TT741 hypufHiT it.'.i I ptnr.oiu 

i :DCML'ai f VTCFI J'LU^li.i*LPAEEJIAAy::KNTI'Vy I *AVM1 1 A I l.l->*YFn.WM«EUICRR 
KAMKKft KNOLAKvWJKVTAH i It f iTVm, | p F3 TtV 1 1 JI I A /J UTV Wl Jti i\ I : TK 1 1 rmnatr. 

Ofri_OKH*ii liHtl».M) ti|i|*i«|| 

yjtrA rkNA KHltytr i .uku -r ri- 

a:: n.m , rnw:nt^:v'^*.n:Krv;tr^ij::ijrrm:iJ4M>*Aii.vi-::miAi'tiHr::»' 

;U ,W »MKNRn;PFerYH''«K:3 1 \Y I : :: TK I KK': t r VTT :LL I IlfVTKI) 1 1 Jtl.Tt«KWWUCIt 
PKtitAYKri'KNKi b.'l. Tli.TVim «:i v,,||t-MV I I.Tfiy n-t'RYKVNFJM' IINHKKt u^:i. 
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HI AT, I YWfcEKVAARC t .TTYYETKLLYCAP" 130* .-SOGKSASFSLRPRSFFOPOITO 
AAK I IBTAKEF tNPECSETLLSLYCGACT IC tMLCPYVXNVXGVEI I POAVASAQENI HA 
imXEOCVr/'aCOAKAFCKItNDCKAPOVI 1 1 DP PRCGM0SKVLK7 I LR I C3 PK I VY t SC 
NPKTOFCECAJ)Lt30GYRIKKHOP lOOTPYSTHLENI rLLEREtDP 

CPn_0B96 1011299 10137O9 

nctA-Hiscomi-LUe Developmental Protein 

PTLPMALKOrTAItKHKOfXSS I ^KDLAKAEKmKAAAORVRTDS 1 KL EKVAKLYRKESI KA 

*:.,.r*:-r . vr.;"— \kf^.\; * • * ■ v-,v • •: r.v.VA.;j*; .vrrrAAKW>rr.: 



CPn.0887 101169: 1014157 

CHLTR possible phosphoprocetn 

MKXLYHPTlJtJlPLlRl^LrFAI^TI^SCWPOOICSrCHCCAOWSALISCKICtnJA 

DF IERVIADfUTLTAMwrr^/VtVRCVt^ia: IRXGOCPYCVKTT QK7 ,IA f .RLPKDMUCD 

LOIUmiUJfPGOAPtJU7WOOlTriCCKESLODHUJXLrrVTL^^ 

OGOMOUUEIJUCELVAAUDCKCSPHPTIVOIE^^ 

7PYCLSEIAYTEAKI»AX.VlAIARGEVSRTOEVtS^^ 

HGAYX£STt£YYAYFSIAEI.Y^iC*IKDFA£IjERliIXG^ 

AKGXYESAEXVFLOX IDPAVKLQATFAAAYCyLGC LAYVQNKYEXAEEYFLRAYXSWGRE 

esciGuwAVTOacTAcewLVHpitrsrrv^ 

AVPEI£OTSRCIYI*aKYPJIVTTrHPIIEXAYlCVM^ 

AFWGALOSGASVPRSL XE55DVDCARXTIRCYCALYTKMPDAIAMLP0AFSCCCMSMQTA 
PKSSIASLVYYLCGYSESSAIJUCVWrVK^ 

TYISLQirSOAVHILeSVKm^ASKPKLHFtJCCaiCyXAMEIJ^^ ' 
AHLSKKLLEHVEKNltX SPRSYROYYCESLORTLCLCORFLGV 

CPn.088B 1015441 1014119 

henG-protoporphynnogen Oxidase 

AERRFCVKRAII IGASISClJUaMWLHKICrPOAEILVlJMCEAYAOCrV Rl EMWSF L L 
GFXGnJRGOGEYTIJUJKELSLQNSLXFSDSUU^^ 
SLXJCDFRAPCrTODSSVQOFIJCIUISSONFTSYXLOF^mXIUtf^SXI^TK^ 
RZASSCSLL*StX»«5P1«SKTDRYLASI^^ 

SPK£ACVn**>LTF F AXHVIYTGPLQQLPVLLPNYG IIULSKHVLPWLSS ISLSWHHAN 

F5LFKC»ta(LFAXlEXJ>ZXCXVUNSOXFPOATPCXm^LLX 

YXJ«N0KPDAFAIJS$O0GMPOH*VCFI£RXERI« 

AYHAICDLHTEETLAOFOSSI. 

CPn_0B89 1016841 101S462 

hmN*Copropo E p rt yrinoqttn IX X Oxidase 
FlJ<FlWNrKrU22JfOPAPTOrrSYP^ 

CQSJCLYCCCSVVUmRmiVEAYimXIOEMaVVCTIGFRP^ 
£XJT^LFWIHKLFIX.SHAEZIAIEVDPRSUttir^^ 

QEAVRRRQSKEESLKAYEKFKEXAFQS XHXDLXYGLPXOTXESFSKTXODXLAKyPDRLA 
t^SFASVWIiaWKAMKASCKPSMEEXFAZYSOSIWtLTXA^ 
LAF1HKTLIRNFQGYS L P PEnJIiCLOfTSTS F IRC IYLGNAKI13EYHNTVIJU7rFATV 
KSKILTnfflRIRKWAIHmCTrTINKZErTNLrC Y IF LTI VTISSRgRLISHErTGLIHW 
SFGSXJCVTFI^FVRVUTAFSHYTLMCVSKXSCFSASX 

CPn.0890 1017829 1016819 

hesiE- Uroporphyrinogen Decarboxylase 
STUOJWDSKSAJTDIiKSOTASHPPrWU-ROVCRY^ 

ATLLCPStlJtVDAMLFADIl^XLZXSFA^rrYDFAPCPRIOFS PEOPFTFTSIJPOTIFSYL 
UXXJrTXJCDICXJVPLIWAASPrriJkCTXIK^^ X 
EX^A^YLICtQMD*CAAAVOXJtSSSXJtLPSALFTRYWE PNRALIAXLKSQAIPVSLFCR 
CFEINFYTWTOAimitl»OrWDLKRI0IWtJC^ 
VPUffmffllVSGHGILPETPLENVQLVVSYVOROl. 

CPn.0891 1021079 1017819 

mfd-Transcript ion -Repair Coupling 
NFHA*fl)FNPVNWSISKEf7Ui.TLPlX^IHPCATAFUgU^ 
UJDXJENLRTFUJQAPOTPSSEII^PKI/WinAVC^^ 
AUXXTRSPQATCQOTUIIAVCDVLOPEATC 

FFt£SFEPFlUEFfaGDCIISXRSYNFSlttLSTCKVSKI5X5FAYT^^ 
FSTPPLYLFttllXXlXDDFADISCm^Sl^WFSICTLYDRISTSNOVYrSETPrPra 
NUCDmvI I EAFHKNMEASRJAIPILYPEOI XONDEHPLLAFLOHLQEYMPPHCKPUCLA 
lYrrKTKSLKEARAIAETVAJraVEIYEICTCNLTSS^ 

OKORTOFSvrrEEvrvpiPcrrvvHiHNcicxri^iEKKpNHi^rrr^ 

WPSN0AYLISAYVCTSOKAADLKHLNSSKWKRSROLTCX2L XVYAEKLLOLEAORSTTP 
AFVYPPHCESVIKrA£TFPYErrPWIJCTrDOIYND»#iSPKlJCRLICCO^FCKTCVIM 
RAAVKAVCOGHROVIVMVPTrilATOHYETFKERMA^ 

VASCQIOII ^CTHKLI^^^XFKNPG^iI IDEBCRF^JVXVKDJCJCOTfPMIDCLTVSATP 
t PRTLHMSL3CAJULSV X AMP PLOftLPVSTTVMEHNTETt>TAALfUiEIX^ IHNR 
tE3XYTXArTXRNLXPEARXCVAKG0MSAEDt^NIFT1CFKIQKTDILVATAI.XENSIOIP 
NANT I L IDHADKrCMADLYOMCCR VC RV^nOUkYCHT LVPH L^RLSG PAAKRLAAI/iXOEY 
!mQOC t ALHDLEXRGAGM E LCT30SCH tCTtCFNLYCKLLKKAVSALJCXKTSPLLFNODV 
KI EFPYNSR I PDTY IETCTMR I EFYOK XCNAESSEELTAXOEEMRDRFG PLPOEICWLFA 
l^IRl^ALCHCISSIKCTANALYVOKCLSItSECTKICrLPYALSPTPElXVKE'/IES XER 
GFUNAS 

CPn_0B92 1023673 102104* 

•UaS-AJdnyl tRNA Synthetase 

SFFFKLSWTIRSNFLKFYANRHHTILPSSPVFPHNOPSILFTNACMNOFKDIFLNKEICVS 
YSRATTSOKC X RACCKHNDLIXf\OrrSRHLTFTEMljCNFSFC0YFKA£AIAFAWEVSLiSV 
FNFKPEO t YATVH EXODEAFALWEAYLFTDR I FRLTt>KDNFWSMANTCPCGYC3ELLFDR 
OPSFCNASSPLDDTOCER FLEYWNLVFMEFNRTS EGSUJUJIiCHVDTOACLERLVSLIA 
OTirr/FEADVLRELIAKTEQLSCKVYHPODOUAFRVTAi^ 

'IYVLP.K I LRR: WrrCRRU;FRNPFl>EXVPJL^PAMCEAYPEUtNSLCOIOK*/I.TLEEES 

vrimjimiMLuywijtz:i3r.ssc lxedafkucdtycmp xdei^llakdydysvdmctf 

IIKLE0EAXCR;{KKWV0'*QCT^E.T lYNEUILTHEF tOYOHUTTDTF [EAI I£KDHIVSSL 
'jetCQEHA I V*LKV':PPY A£Kf X tOVCCOCE EFCnEtTrF IVTtrTT.'tPKACL I VW FIR tSOCSL 
TVRAAVT AOVNR YRRKR I ANNTrTACH LU IKALE I TLT.Dl \ I ROAvU; VVDDTK t RLDFTHPQ 
A WPE&U*: I ETLVKKi IRENEfVtdREALlfSI^W.'l.lE I KOFWDir/^DV/P'/VSACHS 

hki / tx m lAtvn * j h ;ffr itkfjiavam : c kr i EAvrru^A^vrv) njor. evixe r atllq 

Vl-HUj t V::RI .TATt J>ERXQiJDKRU(ELEN: X t v*rK LOK L El «Mi 1 KJMCI ITI'LTKHLAEHE 

wihuajyao uMtn^xLi^Lwrrciw^Yrvu^n'/r^Pi.nxx^'iiAOOLLrAVLTPCC 

< "l H Tt-trirtkitlol.t::!' 



ETUkFCLC t .T/5CCTY TEC L .M »«KIX tLCK I ACA I IC IE3 ; JKAJSCHPCL. 
PLCCAEl>AYLYCr^a*ONPPO* , «W t NRPRPyLJAOICSA^^YJCLKL ACFC V^I . gPLC E 
F1lOUiSRTPO*Pr«MTVC7E^TrUrWCril^^ 

vr^fcC r ftA » M iiM fetl»ury wigLMMWfcWT YCVWWWUCYaHE WLP1 RKEFtRYC 
WDVYEIDCYOFTH IHETFSSI KROOERPVLV r AHT I ICHCSPXESTNlCAiCSPLCVeSTH 
ST1COFWHLPEEXFFVP PAVKNFFAHK :0ET»IWOEf>a^EVRVW^IQFrELKEtrVALTS 
HRLPIOa.CSL'.'CSVEMPSS IACRAAJNKm COVLVOH t PYi, CCSSACLJSjOCTWIANIX.' 
IHTYDFSCRNirYCVREFCMATIMfrLAYSC^^PTOCTFLVFStTYMIW 

. .. »p-ji : , — ■ . " . ' * •••*.- v ? \ • * •nrAisv;—.-- v- * > j*!ct t:t.* 
' ■ : :; ' '•' '"vr:.!". :**«-■*: \:. vAKE^y 

LDKCVRV%*^T PCWELFlw>w t VwV Kw J ;VwU-J I RVJ ; ttW*A^>~i KY i JJ COLA! AMD 
RFCYSCASDDVSEECCrtTEw XLCRILSC 

CPIU0894 1326823 1C25888 

AffiH>AMP Nucl«osid*»« 

PRNOKNAJOnJUUCKYlCCERVSKHnES^ 

FAUJICVPVrCCSMFSAAHAPHLXniLiOFKI£5PCAAlTlDLCSFLPDLJCAA 
CLRSHYOVCOYFVFVAS XRCECTSOAYFP PCVPAlANFVVQXATrEVLEDKKAffYNlC XT 
FrnWXRFVEFNKKFIUCKLYETKAOSAEKECATLFAACYIUlNL 

KTXSScaeirtnrTTDHiLTCornENizrv>cJuuu^^ 

HASGStrSDSDY 

CPIV.0895 1026973 1027557 

•fp-Elonaicion Factor P 

nDCTHWSTSETRVCUlIEIDCOPY^IU^JDrVTCPaCXAF^ 
KVDTRTCSYE5RVSK 

CPn.0896 1027574 1027822 

CT753 hypothec ic*l protein 
EXYTFrTVRJMEAKXXKEl£ir£AOl£IQCLR£X^^ 
KEERVrrKJLFOAIAEKIlZEUV 

CFn_0897 1028794 1027853 

{ phssphohy dro Use ) 

KFSUSNTVXXJKNKSNPilFIQEXFRHVMRI IHISIAWFHVLPVNFVHCnaCPXJCCLtJOCV 

FCLVHFOATTXCgRFPIOArRSLCADSVCIT<SF5LTAKDGEF^^ 

LPOMHTVYTLKSlJ UA jrFYT HF P fTOLOONKVSTHKITPHWWLILIJXSCIJOff »A>CV 

VHLAOISMETT^IiSLSPEENVIIANHYPU^SONPSKDLITiNTHU)^^ 

HCKEHOAAVY»^u7rSPSYIXJtSSSX5LPTNSRFK\a0LYPEKY0^^ 

LXUkNIATWDCOKL 

CPO.0898 1030511 1028904 

Mitochondrial KSP60 Chaperon in Monolog 

KIRBFYAXSCTEl^NSirSKtXnrDFAJCAMVNX XUXEHSIX^TTCLXIXHAXLOESYAALEIC 

ciSTwg.iASijrrorj3n^EAx^ooswxicruiJcvRNi xfsslkmftxashfynafsvvg 

PECLISIT1CEHE MA \ I S MD V FO CFK I PACYASTYFVSXff ASRI«TRXAHPI^tJTO>KlSII 
IHSlXPlXQEI^EOWKLIIFCEPIDPDVl^TLVVrnaiXW IMUtL A 
EDXAXrtCTHXCFCOEASHVXAPEMVmfiSa^XEXSESCTTUCSLn 
AJCEXfCXTSCXXrB2Qu^KS1NR10S5VAXLFTDEXMEPLYTlAl 
VALFYASLTLCTPKMAIJEKSIAISUOWU^^ 

I£X5VrSSEXEDLXACGXtOSLATTSTILAOAUTAIL^^XULOIQ^ 

CFn.0899 1030848 1032215 

nurF*Muramoyl-DAP Liffase 

NKICCRONYKWSMUilWSUfl^OVSCPra 

C»OFUCKAATACAVAAWSHDVQCOSFXaAXRVODmALOEAC^ 

C5VGIcnTKEFSmUSrr1Cna5I»XSYKSQLTV^ 

WQDUAIVOPEI AVITKINrXJHAMHFPOCIOEILXIK^ 

SCSPTA8XFSFSFKDPlJU»CYXAISCDSV\a0TPEEWaj»IAFSYltPA 

SWXL£VPEZr^R5t^EXJCLPP>OlFEHSMItlOff3^fffiAY 

CKXILXLCHMA£LSRYSEEC>(ALVA£XAASRC£WXFFXCEKWI PVQSVXJCSYSCEVSrFS 
SAODVXSXLKOVARYGOVXLUOCSRAXJaJSUACF 

CPR.0900 1032208 1033281 

mrAY-Mureno/l-PentjLpeptide Trans c erase 

LVFNFLGASM IPLIPN FLKOS LFFSIALTCMT^LVLTVALCVPVMKWlJtRX^^^U3YIHXE 
YCEXLEMLHlUJKAEVPTaCGVlWSLXA^VWUMlCFSTWFFI ILLTCYACLCKYCD 
RIXXKIUCOCHCLJCAJ01KFMVA3IAXAAFTt,XAXPYXYC3TXPUffUCrPF 
CKVFCLCLALVA I ICTSNAVNLTDCUOCLAACTMSFAALC F I FVAJJISST I P 1AOOVAYV 

LAA^vcAcxcFtwYwsFPAoutcryrcsTT.rfinr r asca vhlraec i lwiocvtvaxac 

WIWVWCRLRKKRLF1jCSPU4HHYE*^9CLPETKIVMRFWIFSFVCACIjGXAAVI>W 

CPn.0901 1033239 1034537 

murD-KuraaoylAijnine-Clutamate Ligate 

FO<R1URY5CCl>t£X0MC0RILILCTCITQ(SVARFLY00GHYLrCA£]NSLESLXSVX]HL 
HDRIXMCA5EFFENI0LVXR^PGIKPYHPW^AVSLKXPVNn , 0IOVAiJCTPEF0RYPSF 
CITCSNCM I I ILFLTHLLNTLCIPA I AMCN ILOHMOOPCVRVVCI SSFOLATQEE 
H r PALSCSVFLNFSRNHLOYHRNLXMYFOAKLR lOKCLRODKTFWVWEECSLQNSYOXYS 
EEIEEXU)KG0AlJCPrYLHDRDNYCAAYAi>NtVC>^ 

KKOCVHY I NOSK ATTVTAVEXALHAVCKDV t V I LQCKOKGCDFPALASVLSOTntHVXAM 
CECRQTIADAl^EXXPLTUKDLCEA^IAQTIAOECOTVT^POCA^FOOFQSFICEltaA 
YFKLLtREMOAVR 

CPn,0'»OJ li).WSfl7 1035241 

nlpD-MutvuBuliu* (inv.isin repeat tomilyi 

AVTOWWkl'TEVNMNR ROMV ITAV/VNA I LLVALFVT.^KR [CVKOYOECFPNFASSKVTOA 
WSEEKV t EKPWAEVPTR F I AKETLAAOFI ECK PV r VTT PP*/I , W:TETPEVPTVAVPPO 
PVRETVKESCW'VATWVVKKLXPLERrAPAJfl^ 

yOVJIMWTrVR.TANf'FIAfY JVVEnO.'riVri ALRNir IRLOnLLrKNtXXEYKARRUtpnD 
yfJUP *^ 



tr.tiWt***)) Civisifd n-ii.-in K»*iW 

:3(0!:u*:mkwfv i : a u/ :i p:h* :l tKVFf/r::^A WLOf^urrrniKALt f rf vVTYMUxia/ 

AJ( J.YMM Wn I3FI Jt I : : I *Vf .1 ;a/J I ' .VF I T*'.U J If.W* :A«kW( * Jj\ ;r I gil : W-VK 

Yhvi*i vaj.yci .tki^i .yvkoucmm j;i.ta i i,k triLLlAl ef-u* u:aa7 i i i-w im 

T.Wkl J«YWl^|-t Vf.l A< k)Al AYPMIY'/l'YHLNVYUIPEtU IK/nOK/K/QAK I AAJ^) 
K|.f/5)t!IX ^KLTIi*l.r tVkt il<Y f AA i Y AKKFCFUMLVL I LLYmrr/Y» JflYAIAI KA;.* 
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:;lboaau\hv tTt i v&QAnQti/7ArsrzLL?~rrr. . rrconzzzi, iakmccvtuxkv 
wzbgks~i> ;crrfrrpmcp53u;k(z:ff3 

'Cpnjrcni inn)20 mrm*. 

murC-Peot iuwlycan Transferase 

rywxx i ricvalavocsoch ivpalsvkeafsr£ci dvo-llciclkwpslooc: 3YRE . 
p^PTvt^piKiMTRTi^LcscrrtJCAWc^ 

LFurEONr*vmrvti7r.rrR yapo rcvNFrrpvTKHFwc p aeevfi^kr.^^sphwkrct 

*nrT r T' ; -.v; : ■ • i-v.-ii -*•*!'■ *."vai;/k ": 'MWHiKv ::*k.:l\^>"' J iv"-** <**"'••■ 

DVLEDGTMI L£KO»TXKliVDCVrrALDSKNROC0RNSLAAYS0QR3TKTFHAriCCCL 
CPn_090S 1037400 1039835 

RurCaddlA-Kuramate-AU Ligase 4 o-AU-D-Alam Liga»e 

VHYKKGTPWFIGICGIGHSAIWUttLXJRarEVSG^ 

3SHV?*IWvVYSSSIAPDNVEYLTAIORSSRIJ^^ 

TSSLIRAIFO£AOKDPSYAIGCIAAieaJ«YSGSSKIFVAXAO 

NIDNEHLNNYAG2flI*fcVOVX0DFSRXvTD^^ 

HXVSYNOKAHOSHFSrrrOTEYODIELNLFGO*^^ 

EEClCrn>KArQCAI^LTCVYSACeSPBCSIIl£DOaOIWCSSrW 
YUU^XRIHD^V5LCACNmiCEAlja)FN?nO^ICLm 
Y I SPEFYIJVSYFIIHRQGI^RTCOTFWUErT^ 
FGEOCTIOGFFtlLOCPYAGPSLSIWUkTAJC^LTlUl^^ 

PELCIONLIETTSTPMIVKTAHLCSS ICX FLVRDKE£LQEKXSEAfT»YOTDVrVEESIttC 
SRKEVSCXQiSSSWYOIAaPNBtCaASGrcEYOEKVGITCIDCM 

\^ZLABRww«OGKCSAjiiorftjaanwi^EVNPipaff 

rVDHFIICALMXTDKOOTIEOATTKCODLVKR 
CTn.0906 1040514 1039915 

CT763 hypothec leal protein 

Kl<CSEVX£LvNDSQLSREA5AFR£i3X DFFILMIVPFFWIFroilELCFFl.SXSOrNlJ'WE 
EFVAyXVMCVTNPEAVEIRSIECEEKES IKLEIRVAACDIGKI IGRRGNTIHALRTILR 

mresMJoqevoxDLVQPppTDWDODYXCOWDSSH S i irn fge setccsghchydepl 

N0EE0ESCNHHH5CEC5KHH 

. CPn.0907 1040816 1040445 
a cutA Peripiasmic Divalent Cation Toleranee Pro tain CutA (C- 
Type Cytochrome Biogenesis protein) 

FAFSKTLI IKSSMTAvXItTSFTSeZSARSLARHUTEJuV^SCvW^^ 

CESE£HKIOIXSIDIRFSEICLAIOEFS(arEVPEVTAFPI£*C^^ 

PL5D 

CPn.0908 1041607 1040780 

CT764 hypothetical protein 

IUVIUMII XIMNEIMXlWFrm^RaPOySl£YASXLXVLSSLVC^^ 
IJJ07NPSFIRNI^VSSTI^XVPPTAIA«JCLRX^AD^^ 
SLVl£KSPDNKCXTXFYTXX7rTXAYVGNRSKriXNX<ES5CTLCOPY 
. OLKMQKLPKZXKLFTK ILUCELAMES PKX IDLSLSDAYPGEJ IYTLSSGSUJILPIKTLD 
RAljDLYKKMKlCSPVIESHXOT^rrDUlFPNrLLLXAL 

CPn_0909 1041S92 1041966 

rsbV-Sigma Factor Regulator 
llSLXTTBPtJJSXJjanSSmCOlim 
NrTOVm5SACXIUXI^NFia.VQSLaCKMClXCVK£SvT^^ 
CLSKL 

CPAJ0910 1041970 1043004 

aiaA-cRNA Pyrophosphate Transferase 
FLYKLPFErEFNTTSSFrcDVeLOFOKLrrari^^ 

Wia«WILVOGSl3YFHAFLSGPPKGPAA^ 
YACTXTiraniKXXllGLEXXOLTGXICVSDHCro 
QHRCEAMLQEGXI£EVRGLIJWIRENPSArKAIGYROTEF^ 
SNSWHYTKXOKTWnCRY S I FRELPTLCLSS DA I AQK IAKDYU.YS 

CPn.0911 1044079 1042985 

Fe-5 cluster oxido reduces «e 

SUJAIFWKYFW^jCKRlSFtECLEUVSSPrnU^ 

PNYTNICKiarTTCAPYRKPKSPDAYU^FTJEVRSU^^ 

DYLEELVRITVOEFPSIHPHFFSAVtlEHACWSGISIEQGLO^^ 

LSERVRKXISPK10*3PGGWINUUCLAHUCFinTATKMFG^^ 

SCPGr/SFIPWSYKPGKTAUtRJfl/PQQASIETYYRIlJ^ 

LCAKAIiiYCADOFTSA'ILDCSVTIIUkTGWSIOSSCCEIQ^IIRSECFIPVERXrPfOHI^ 
TVSSL 

CPn.0912 1044120 1045760 

CT768 hypothetical protein 

W IMDNSDNSFHTLCTBOCSFLNDCIAVCCVASTSST EX SDATLCFAEKKVAF X LNKKRE 

ALTCSSaJSDLRLFWDUUtOCXJaJWEIEXTTAK^^ 

FVVQOXDLAITCLEIOXLKFOECTEDKIFKDREZMFUSOAX^XHOAF^ 

SFSSKI IDUKELINVrtSTOtRLKSKFFORLSNUaOVrPKRKELIE^ 

AKYFlGSDKETUCKTVTTlJUCEXIQttLQHAAlOXLFVSS^ 

r R0E0CRLR WS AE3<SK£VRCKLAEVSSU.I ECNDL2KVR ICO IXC I SKK I RALDLTHODV 
triLKKEMQOLFDOLREJCODAAEHSYDBOLAKDICCVKKEAARSLAEn XTTFSKTCSECNXT 
SEME£W0TLKEU-CIO4SFIJ>PPEKISU)WLNl^^ 

UjtOOU 

i.'PnJMll I04570'» KMS945 

H» i<.bitur tnooloq present in i isnrtMn h / EMBL *%r. ot 1 1/7 .'•»«* 
ttliJTYFR I EATD:;AXAMRRrClYAFDU)fn , U*IU»i^w;FYC/rU*Wi;U- .TfKTLPPiM 
YKKFVFKFFR: IFllPSt IR 

»lHJ)*»M lll4V>0'i KM*!"' 

Nil (*i**n:it tHNwilmi immune in i^iielMfik/EHUL .it: ot ll/7/*i« 
vFPWDtiP!lKYY:T r VTti 1 4 J :.WPCOOLYEVAl«F\'.-rn.Tn: TOFYAWLEKLEEAFAOTTTO 

v ii >*r^ :t rip r vi ir i acqi * : I : ::;wy AixrYftouiiAEc/r t ykkclt* swkac i l; rt t k k i ho 
An;yrrr:;iif( i i jJi.rKLMUJEEXTVvRrw»LKKMArrfYWN i v 



yt*t§- io«.id uiic-rt.»n. t* nolo j ^ 

FMLKItSCTL^t tF/PKACrMQJP^F=UJCVAAlUrroKKJWr^ 1 .>JVI 
FVECSVNvWIWtttfttr^ 

RLEEtHXKFIVTCKUtAJ " ' — ~" " 
CPnJ)9t« t04»«l) *. 04 8084 

tJbF-Acvl Carrier Protein Svntnasc 

LU*7VWYM3KKRVr.T?^^^CLCNE\TTrYl»n^J^ tTSF PCXSYATRFAw 

'\. J!:! t 1 .^ ^. r ' . ■ ■: - • ; rrrvAr.v * " ?* :r*r-- * - "ATTrfi— r 

daaychLvscracm: iusotsaavnr : jL£^rxAWAi^EiwDAi-aoAj>Rw o R O > K Fv- 

LJBCACI LVLCnXSAUWDAP IFAIMLCSTVTCXJATHITAVRXXJCEGITACVLQAiJttA 
C IPKDWIYVNAHCTTrPLS2I^EVl>W 

WAXOAILTCXtJtPTXNUlNPXAEXEQFDVVAMCAOMXOVAMS^F^ 
RYVP 

CPn.0917 1048054 1048539 

hydrolase/ pnosonatase nana log 

FNDIItXVCTWe^mYSFGVIPIKFFCT 

KECTOCAACu^vrrrCXSvVNFFPR^XtcrsrrQJEXOv^^ 

PKKCX^OWSIOFnT.RIJ.SrPEWtTVEADKFXMJYLrSS 

CPn.0918 1049232 1048579 

ppa-lnorganic Pyroonospnatase 

EI4J4SKKI*.YVA*P*MSPTLTCDNYESLCCYIEnPYB 

NFCPCLYClXPQTYCCTASC^SCECTRillSXOCOKOPUnr^ 

rOCLRI I OSCEADOK 1 1 AVLEDDLVFAI I EDI SDCPCTVI-DKIOKYFLTYXATPKKLIICC 

SPAXIEIVGXYGiaCEAOKvTOLWtEim^YIGOTAtVN 

CPn.0919 1049375 1050430 

ldh-Leucine Oehydrogenase 

FMKYSU<FKE I K I DDYERVI EVTCSKVRLKAI X AX HQTAVGP ALQCVRASLYSS FEOACT 
DALftLARQtTYXAX ISNTCTOOCKSVX XLFOWSLTZX»UUFGOAWALECTYICACX) 

icysiwDXSivAJnTTVCciAxy^DPsmAiceririicCTAic^ 

OC XCSVCRRIJfiSIJTECAELYVADVIilRAVODAAilLYGAT XVPTEEXKALICDXFSFCA 
RG^IUg»gJuOLNCKAIVCVAI*CIJDSSACMHLH ILYCPOYLVKACCLLMVAAAI 
ECKVYAPXEVXiJCvZELPXVLSKLYW5lcrTCm.VA^ 

CPn_0920 1051423 1050431 

cysO-Sulflte Synches is /Diphosphate phosphatase 

XLESISHKSEUWYONlVHvVTXrrTOLLNrltt^ 

UCQOLAJCAFPNXmCE£rLYPDQONEKXPCXlJCrnuU.TSS^^ 

LfVLVDPX DGTACFIRHRAFAVAX SLXYEYRPXt^vKACPAY>QTF1XYSAAJ(SHBLCXV 

HSCKlJRRFvYACR K tfrKQ FCEA SX^AAIJ^OHKATl^^ 

AEOAVDFFIRYPFI DSPARAHDHVPGAFLVXIAOGRvTBAlCA 

UVSQDQETKETTtMUJNQllfVVPTDXLXAL 

CMJ0921 1051526 1052293 

snclycerol-3-P Acyl transferase 
CEXi(LXKLWRATYEQ(rTFLvCAXXICXJCYTOe^^ 

ASPJUJTOESXXLYPSCRURTGXEZXVNOYSAYVX^LKRVKEOfm 

t MJ t b l PAL SPATKCArRAUJmGX FFMPIQXFVXXTLCQVDKl^lJCOFVTItOiaJKrriAS 

WPMQCDOKLPXEVPYA 

CPO.0922 1052266 1053927 

aas - Acy lg lycerophosphoe thanol aaine Acy It rant f e rase 

OFAHRSSLRITRKLRRKHDORWUSHNNWnJUJIPGSTXXEAFX. X LCSEHEXCXACFCOfL 

CSLSYRELRNAI IAVAIKVSKFSEX»vCV>WASI(UriAYrCXLLAUIll I'WHeWBOCr 

RCLIUCTKTVCVRRvLTSC»FXIOXLTEVQCFVTYTFOLM^ 

KCSVPWUJlIFCVSCVlSDOTAVIt^SCTEXLPlCAVPLT«KNU^ 

DVMLJ^PPWYGrKSCCIJ-PLlJtSvWVF ASNPLWPKKLVrrXD DKJ tVT f f OS I PV FF 

CrflUCTAJCXQNSClXSLRLVvTGCDAlJCiriXYErr^^ 

TKESPRK5ECVGK7IECKDVLX X5KCTHX PVSSCEOCLrVVRCHSVTSCYIjCNKEHOSFV 
SUXIOWYLTCOLOTICPSCDLFLECRI^RFVXIGCEMVSLEAl^ 
CSLVVCGircDXVRiriJTTlJ\TTXHE^^IL»CSACTSSIVKI^ 
DYVSLNALAV5LFG 

CPn.0923 1053966 1055093 

bioF.l-Oxononanoate Syntnase.l 

VCKESFLTODVIOFVTNDFLCFAWPTIYCEVSKRFOIHCOOFPHEXI^IRGSRX^ 
SSVIDOUESKIASYKCAPNAFIVNSCYKW^LCHHVSRSTDVlXWDEEVWt^^ 
ISCOHHTFHHNNLEHLESLLOCYRISSKCR IF I FVSSVYSFRCTLAFLBOI XAXSKXYHA 
KLXVDEAHAHCIFCCXKCIXMAlJCYENnrAVLvTrXaCA^ 

SPPLRYSTSl^PKTLISXCTAYOFlASISEXARXOvriCiJCEHFHECFDSHAPOCVOPXFL 

PHTCLEXAISVLETPCIMWVVAFAKHPFLRv>rtJlAYMIVDEVN 

VHXNHEFHLWRXLCCH 

CPn.0924 1057301 1055028 

priA-Prtnosonul Protein N' 

KRFTAKTKSHGYIESSTFRLYAEVtvGSNIKKVX^CVPENI^HXTKCTAVTXStJtQClQC 
'/WIYOI'HTTOCKKILPXLCLSOSEIvX.PCDliOLLFWXSOYYFAPLCICTIJtl^l.PAIS 
CTVttOPKOHYRWUCCSlUMCTKEXLAKLEV^ 

SOSPlH3l£KLCILDtVDAAOLEL0EDLLTFF PPAPKDLHPEO0SA XOK I F5SLRTSQFM 
THLLFCITCSCKTEIYUUTSEAXJtOCKSTXLLvTEIALr/OTVSLFKARro 
KL5D5DK3RmRQA>'EGSLR I LICPRSALFCPMXNLCLI IVDEEHD PAY KOTOS PPCYHA 
h OVAVM RCK LAKAT\Mjt AT Pr; LCJYTN AL5HKYVLS R LS3 RAAAAM PAX I S L IWeiLE 
P.EK5KTF. X LFCOPVLK K tAERLCVTIEOVL t FFNRRCYHTtlVSCTVCKKrLKCPHCtJMVLT 
FHKYAf r/LLCHLOt? J PKDLry'CPKCLCTMTLOYRC 3CTEK I EK I LW I FFO IRT tRI D 
Z DTTK FfWIIETLLROFA'Ti IKAOVL I CTQM I AKHMtFS AVTLAV I UKWCLYI POWAS 
EOVFOL tTWVniAiRja 11,11 ;r.(LlC>:.*FI.PPIirT IHSAMP^DYSAFYTOEtTCREtCEYP 
f-FIRL IK I Ftra'rev/mKKAIIRVHHI tKa.>lXiTNPLMr' , /rPCtTHFKIKDTFRYOFtI 
J^SAYV I PVNK KUtHALKLAKL; 1 1 F VK KM t WOt tfTTFF 

'.'Pfi.H'/iS /'»('. I't*»:.!2'. 

• ,T77't fiyrwtr. Itt «f i , •. » I | m ■ •• i m 

KUWLFMOL'^iNFl (FT! i *OU /* YHEFJ.Y Ml -Mil ANVTLrf IT A I :*J\.*."/:::: I TEKAVEVPN 
MSPUI'ITPPPf TriUVKRTKkMjWn Vt'lJU'DU^NAI LKEKYPALFt/MUAPK IPCS I 
KVYKR 0IKEVI .FFNR I AK 1 1 ;tw,| .KITKt.Tl. I UAFTN t TA frlPMFFLAJ AI'I W I NYK I P 
TTOYl^LTV rv: I Kl.fUV: :. :i .K*/RKr/;oi Ji KHLWA ILHRLPFAYTlf::.: 
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>:Vr} O'tJ- IWM'i 1W5S7 

Thioro" toxin 0i3ull id- (somer-tse „™ BB rAAjjjf 

POKNHOPEEQROKWEUCAOYKVTCFFEimC^ 
KLR 

::;r-.-«T::»r:.; K;.::.;Hr^:n^::^»MWi r "? tf " .. ; *Jirw 

FnSIIUXPLUXWVLWTTOFFaP^ 

SAADHWIOYDDLWOSIAIKX PHAI^WLYSOGWSGUTOILFCTO 
SNIAVFWPGIHSSKGEAJCRWLVOT 

reinvNSNHDOEUsasLTBio^xrpnzLPEVicrsoKiPxra^^ 

VDRLAAVlSir/LOSEMUCSOQPO 

CPn_092B 1061035 1059884 

"CHLPS 43 kDa protein honolog_3 

IWKDTAFTlJJMaSDILSCXFSNPHPVOT 
FXLUCIJXmfilYWLCOLVCSJUXWSSHLY^ 

sFXDPA\^iomiooraTirmjUHFsrARPi^ 

SWKEIJUai£ANIIJYWPGVKSSTGKI2nJ2^^ 
YSIZ*W0SAAI*104PFTNSETSWVAVKI»APHSIJ>A^ 
EKNSR£LPCPCILVYSADRP1lPSEVCODTALWErTtAHAXKirrW 
SSPUCHPTIQKlAIAILCStSRKN 

CPn_0929 1062301 1061186 

♦CHLPS 43 kDa protein homology 

DCPMAPIHCSNAFVEDXI*KSHPSPOATYFSSTWOT 

LICLIILPLGIYWl£OTLCTHSIU>Sia«^ 

SMWOTXWr*OTJDTl£ICUOA$Tl«W 

ANXLVYKYPGVMSSTGS5SUCDLASAHNXCI1tyXJQ3KIOCPGMCEX XTYCTSLTCUOAE 

LEXPLYPTDSLRitSTVTOmjtfa.TUUUIKNSPY^ 
VALATPtUUCLS 

CPn_O930 1062851 1063330 

No robust honolog present In Oenansnk/EMBL as ol 11/7/98 
WXXSEIJ^ STGLQMVPHTOVHHALCTRJtVI LT2 AACLSLIASIVLVnT fiRMTT PSLTO 
VIGCMILILFSSI ALXYtYXJCTREVIXJXALEPt-PEM ISKDQSX IDFVKTRDYASLEKXAT 
FAYTHTHYYIXXSMVFYXriFltFMXUCTlJas^^ 

CPQ.0931 1064078 106S71B 

lysS-Lytyl tRNA Synthetase 

IDIWIJ3WICSDIYTNILEEIOTAI*£YIJ>^^ 
CEDXtaeTFASOELCraEMKSRCTPRWAGro 

EFTSVHGLSSDAEXTFXKf XEKKIJIXgXlCIIXnfLFTTHSCELTVl«VtIVT T trKSTJ . fi 

LFD*XAGX£DKEVRYlUCFJrfULXSSR£VSGTfW(R5YXX& 

tmmAEXKPfTTTMEALHSBtnJtTSLETMJTTT^^ 

POTmEAYAAYKDYXEVK/TVTXLVEHLVRAVNH^ 

MTHXESttTYAGIXJVraraSDOXLmiJOOTTITETAW 

APKHITWPVETTPirmJlSCOWVERFESFa^^ 

TIO0aJJDStCHPXOE£n^M£QGHPPAa^tn:GVDU 

FOAEXm 

CPn.0932 1067160 106S721 

cysS-Cy»teinyl tRNA Synthetase 

VXSETVMAFSH I EGLYFl^ASOXXEIZFPNHTPVRLYTCGPTVYDYAH IGNFRTYVFED 
IUOT7LVFFGYSVTK\10JITDVEDKTIACASKJWIPUJt.^ rOPXTEAFTEDLOTUJIAilA 
DrfPHATHYIPQMIQArnOXCQCIAYrCglUSVYrSUqJTWY^ 
ISADinriXENPSDryLWXAYNPERO^tYWCSPTCICCRP^^ DOHCLLSD5LOIH 
AGGVWIFPHKCNEXAQStALSGKPFAKYULKSEHIXIOGKX^^ 
FTtnCVHYMLLOSfffRTOIi^FTECAUACIQiUiUUXDFV^ 

SSOF IEATSRAIJWDLMVSTGFASLFCPVKEIWn. IOOOHFSKADSLYILOTUOCVmVL 
GVXJLTTSVCXPETVHOLVAEREEARJCT^^ 

CPn.0933 1067532 1068578 

predicted disulfide bead iiomerase 
PVIII^IKRCSLKQIJC^TIXLSLSLJTLEAAD^^ 

pu.vxrsc»n^pCKxiRXEVixspeFiKRvoGxrvc/Evm 

KSKFKINH.PCMILXSHEEREIYRIGSFaaTON^ 
ftUJACDPfWEKOTHrTVAXJEFOEIJUaiSJU^^ 

CPn.0914 1068948 1068526 

rnpA-Ribonue lease P Protein Component 

YFVHPLTt^KOSRVUWKOFLYITWOFCCRCSOATT-r/VPSRHP^ 

AH ER1CFKR WREVFRHVRHQLPNCOXWFFKGHKQP.PVFSKLLQDF t NO tPCCLHRLCK 

TKATTOOECTP KSEKCVTAPR 

CPn_09J5 1069100 1068957 

cl34-LM Ribosomal Protein 

E^TWm0PSKRICRRN3VCFRTRMATRNCRKLLNRPPRHC»HSLVDL 

CHUMJ* 10*9)10 1069470 

iTn.-Lj'. RiUwcaul Protein 

flXXVCZl IVKA&P jKJDKLVRRKGRLYVT KK KDPNFKORQAGPARKX 

r.;M-^l4 Hibfihv&it Trot tin 

'/KRMAKK:: : :VAI<KAKRltRLVEANPKKR;;D^ 

TI'UMW '1.1.71 WIHLTYIJtKFAUR H:PHgMAJKlErr-T/IKA;*f 

•»•»!_»•• i« nrrov.'i u>b'iH4' 

'T7«n lryi*«r !«••! i.-.i) piiT.:in •tdftutrr ntfn iid- (v-f ipl.inalc 1 

■..t I ni rri jfmi.i ■ ! .•: uj.kyv i u»:u:ay t awkfpi r/ if wffa» :akf.;p fXWLLLL 
it:Ki<riA( j >: w*c;iu.Foot.KK.*;LAr wdf. t r-j.^x-Lyi; i v i utfjcwfylnkorcnv 



CPn.09Ji to*. ' 107 U-JS 

CT770 hypo t net ical protein , 

lUl^tM^TWWITOtTSm&llXB 

DFUXNSIC 

rpn.»»4n n7ifn* 107120* 

" .vr ■ Kxmum*: * • "'ifi:::- ** .... 

v— ' : i.vmp : rr " ■* •*: . : ' *: ■ r* " yr ' ^' • "•' • • * • • ff « - ^ tE^=-"* 
KipruoticTAStcrxv/sKnwj^Ji^xK^^ 

RTtAIimAWQOVCXFH 

SS2Sfiloyy!w 

WYRNAKAYAATTLPSSTLPYODFON IIJWSOYPYR X ITYPWO eyAMATCVYjVr OW 
C*l WHKR1I b XDS8XTQ WOl > t T rf^ /LLWFHSLTTAI.PO«XVVlX23rrifYiacnC»X 10 

lOTrwaJcKALrEOTOPGiCEVxiuau^ 

UJmOKDFHICSD 

CPn.0941 1075504 1073018 

mitS-CNA Mi scutch Repair 

SCXFVSTVDTTM^CXCnCVAVA^FCCPAXCXCSXKXCPHAMXOR 

Stotiocru^iuspfyhpiceilv^^ 

TTVTTOlJWPRDICrrU^FSAGAOrYlDtASATLPlTFID 

icn^iJtvsimpvomrmuatLPJff«DtsoEH 
mriBvsscrAfoxmFXiutosiujiAEiLrm 

r-eirrtrff J pgrLALflOSLADLDYI XSI^kPLAKACXTfCRPHVUHSCffLCIYRGCKTVAKTl. 
VDTGKFIPfOTOCRCSOTI^lXTOPrW^^ 

XUianttlGMi m CK ' * Mtf P*^^AMILHMATDRSLVILnEV Um S 1 I P CL AIAQ 

AVVE7Wni0aAXTtJATHY1CILTTX£X«CPn^^ 

QsmXHVMU>CrTbCWSRWXIJtQX^PE5mPA0fN04Q0L^ 

CPn.0942 X075M5 1077754 

dnaG/priM-OKA Prxnaae 

MCSiriiJtrAKrrKESiXNLimsiBivDiasiMXKij^^ 

PAGAKYlCFGC»UCDAXCPlHOHX^rsrm 
OlJMIIBEAErFTTrrCLYHLPtARHAlim-YKRCrePtrriDR 
CRXXSOmWAGrrGMKWFVAiaiXXFPVKIMXXXHrXCrSA^ 
PXn0CS3tXLn027S1UUUAJCEiaCVXLVEC0AIXtOfXDSCFNC^^ 

XAtZmSCCYXTFLXSDSISSYPmFltlXALLVEKA^ 

le^lIKVl^UWRJVTAEWIPIKOIO^KIHPH^ 

PTTVPlXFWPECRXX^ArMISYYlXYWaWTOEACOVt^DS^ 

TirVOSt/JWMArBP^fgC*^^ 5 ^^^ 0 "^^ 

CPIU0943 1077972 1078238 

CT794.1 hypothetical protein 

pn«Srm^Fl^Il£CQiIX£SPRSRAISVTO 

cvcass ixjfflwqrroowLiCTmi jtfWEVM 

CPn_0944 1078503 1078997 

No roouat hcnolog present in Cana bank/EM BL as ot 11/7/98 
IKX>ae4RYTIPIXAtXtFSPElAllA£WPSENJUCOGWrro 
IEEOCTCtLVrFEnUT'PEFADLTfCSFSLSTPIAXCFIArvvi.Cre 
ILYWSFU?ffPr^VSCPPXCYILIDC0OCyWOAVU>LCTTO 

CPn.0945 1079001 1079660 

CTT95 hypochatical protein 

S IP»IKILPSYFCHNPI301JUDm!RIAI^U5LLMI FPI PCEESRPCSCOaiSWTOEIVC 
SQCrovCLYHSYEOGLQASR I ECKPLV IWLCKSC DDCOACT ICLS £TCtIVLSVL8CS I 
FSnJWVVLVFSCVNPLIYPPIEDPItA£IVKFKELFirOESrPTCLSI IWCVTPECPC 
. DXXEVSPVSLTVEE2ITtJracnrrrCV£STSCL0SEDPAXA 

CPn_0946 1082816 1079745 

alyO-Clycyl tRNA Synthatase 

CSOOlOCKCYTLESrVSDIPtTX^SMXATXtJlFVSEOCCVXHOCY^ 
ALSPEPYKAAWEPSRAPO0CRYCVKPNRL0NYKQLOVX UCFVPENrLSLYTCSLRAXCL 
DLIIimiRFXHDOMIHITICAIiCUWCVWLN» 
ERIAKYljOltKXSIYWLHNOTlTYOOITOASEKAWSEYNFOT 

RTUQCLSVPAYDFVXKASHAFN X LOARCT X 5VTERTRY X AR 1R0LTRLVADSYVEURAS 
LNYPLL5LSSTSEPXETSESVVPMrSSTEDU.LEICSEELPATrVP ICIOQLESLARCVL 
TWfIVYEGl£Vrt£SPPJUJU£VKNVAPEW0KAFOW^ 

ASOCVDISKYODI^RHASLAIRT^SEYLFLU^PEIRIJn-ADILMOELPLLIORIIKFPK 

rMVWQNSCVEYARPIRMLVALYCEH X LPITLCTX I ASRNSFCHROLDPRKISISSPQDYV 

rrU(0ACVNVS0KCRJWXXEI»LIUU155OTXSAXPLPRLXE£ATF^ 

FCAU>XElXIAD<VNK)KYFPTHCTSSCAISNFFIV\^0«PfCTIIE^^ 

OEFI^OTLOTPLTrFXDCUCSVTYFEAliaLYDKVCRUCAKORVFS^ 

AlOYCKADLVSAWNEFPELOC IHCCYYUtHANLPTASAVAVnEKLRH tTMOQXLSTlCT 

UJ^XDRIJNLLACFXlAUmSHOr/AI^OSLeVLTL^ 

FPCTIEEXVWDXSKTIHEILEFIWRLirrrHQSLErRXDElAAVLtDSATKNPtEtLDTA 

EAWlAXEEHTXKLAVITrTHNRLKKtt^SSLKUJ^ 

PKET^HAFLEYFU1LADUWtOOFUmnitANOlX>lR>ILRW 

3VAV 

TPTU , »* , 47 IOMI4M l'>H40V> 

TJIRVCLPNY I TF UR1 ^ ITP t VH I LYL* ' IFWPT IT PWf .1 'T/t XA1 J -\ [ HES .TDA I DLT\VA 
RKFW/TOL^KU^I-HAIV t YH r.'I lYLTPTJI'l*VTH.r , LX*LVF t Fl AnU*Vl;?rUm.VAF 
t*:RV»/AARA: it A . r USM I Hi: ilt il.U JtK I FA: VTVrt I IAVY;: ( A^* 

1 1 EYFW«Nfl.*WHAirTKl>::mfl)Kr;FD 

'.'HtJCMH lim\4ht Hii«4i>4'/ 

' ;ei*ei I vuvAvicPTi*! vkw i tuuMK n »:ki u^kowi ww/u-niv i! . i :*h*:;nwutE 
p: rrr/EFLcacuOA: ;a i ; :y;:y n :l .tut l i ti it ^ i f.lf:,tt: ?rt ::knnwhk: waaaaaa 
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YU^AOrAUtVltLKDWHVnUACLLKNPLIIPVK: ? IMfFGYRGYCSTOLLAASOXO 
rMLJIIYOU'ROrOTSVLMKCALYCSDV'tTTVSU * *"OEI INDT3DYILK0AIIARKSVT 

..;;:w;tDEDVWNP»nT)pALAVOYDAjLL2Syt^rriO(£n^ 

TVIjR [VEEKGPEFWTEt tLHAMOtSYAF ILCCTSONCVUNEPRNL00CUS5PKXRLZ 
LOFKDPUPLTY AAADMXC I PSHREAffiLTCLIAWYCTVPLVRITOa-CTViroVNGF 
TrronwFNCTPAMI^VTTYRqEPDWLWUtSCJa^ 



-•i<n_n>M>i 



(•fit '.'Hi 



-v, :,v. y irrn Mi::.r.-r: > rr^K:!!*: >v * r- r y; - : t r.vr/v. \\ r. ■: :t.7VuY.*? 
KKPL5NU5GAUSTVFSLSYECRI Z KALVXDZQYO irTYDVIHLDF EELVEDRPIKLNI 
PIRCIfttVDCIC^KLGGStJU3VIRAVRV\ra^ 
ACirTITPLMVAITVSRP 

CPTX.09S0 1086470 1067027 

pth-Peptidyl tRNA Hydrolese 

PStXMtAttlVAIGNFRKCYAlfllQaaGrLL^^ 

SSCPLVriKtTTFVNt^CKAVVLAlCXyrNVU^HZL^^ 

NOiXSZTASLGSNEYiejlAFWraPt^^ 

EMCSRF 

CPn.0951 10B7113 1087457 

rs6-S6 Ritesoasl Protein 
£FL«iaCENOLYB»YVFSVTLSEEAfUUCALDm 

rgaregyyyfzttsvspgajtezjoeyhlnezj^^ 

CPn.0952 1087469 1087723 

r*18-S18 Ribosomal Protein 
GEte*OCPVH»JE«UUCRFireKCPFVS^^ 
RFOCVl^OAIKRARKLSLLPrVCED 

CPn.0953 1087727 1088248 

rl9*L9 Ribosoaal Protein 

R£XOAAAD*ADSZJUAOJUJ03IVlJrW^^ 

FPHAKyAixNuaonfiPXJczjc£evTATtxvE\rr^^ 

CPn.0954 10882S9 1088708 

ychfl-Predlcted Kinase 

GnCVCYKDIWPSPAJOJafXJCXNCaatFZM^ 

NVNmSPSNLZWCSI«F1WErOX*J0PVSWHZ^ 

FQTHlPlTTL01>JAR£rGSDVPFTTL0BOT 

CPn_0955 1088612 1089175 

(frnne-*hUt with 0954 1 

RATIWJfSYmiATLCSRNRKRCSrFI^SCTAIjaKGRGEHrJ 
GrFTtXAYQSXZJCOTSTGMIKACrVCI^ 

uiscaGjm jvcYixgOTSicvswntsi^KCTO 

LECFOPQI 

CPIL.095C 108954 5 1090909 

CT805 hypothetical protein 
LWraziJPOTSZ«GAAVLFTCSXZJ!rFLT1WYTLOT 

XVYTAELVl^IAXLClCnPlCSIMmmAPPlXSCIXKEI^^ 
SUtfAYATNOIJSNISIGGLTSYVSSWtfAIFPAZ^ 
TTYyriPRKETKKmZPSOlOPSVDeVPIMZICVHZir^lZ 
ISFWFTIFYQDPZNLSKWYTOJTrAGL^ 

rriJtALvwyLVHm^ATtCTnftvvAfa^^ 

»CHUlJCAU75XZ5ZjC7TWJ0WPm^Cm 

CPn.0957 1093812 109O963 

ide/ptr-Xnttilin*»e fanily/Prcc use III 
Kxrnaxn«rwKU£FxucTSt£mcEQomm 

GLPIXIISDPNLPTSCAALLVrn^ADPEgYTOaHrrEHCVr^ Kif L 

SBMBVHKAFTrpMCrVFVrSVEHSArSD^ 
AHPX^IXaUtVHRZOOLVAPCCHPCAim^GNXSTLTPVTTEl^^ 
YTSAPI^KAKKOFSKirSOIPRSKNyD^CPrLPSCDTSSUCNLYrNOAIOP^ 

ELTSCOKHYSGV ZDSTTt}YIJlYIQEHGZP^nrTLCCXSTZNAZ/fyCYS^XSPLFDIJJCXO 
IVSLCaffiDLSTYPYHSLVYPKYSSrnrBAT T MLVSDPEOARFVLSSKNSPWECATQLKD 
P I rDKTYYVKALOCVODVCKVOSLKP I ALITCPNLTI PKtVTLPCVHLLKXOEFPFAPALS 
YOlXtaTLYKCEpHTTTAPKLS SO Z RIRSPQ I SPSS POFLVATELYCLAVNDGLLRETf P 
ATOACLSFTSALOCOC I DLRVSCVTTTVPALLNS ILTSLPNLCZSYETFLVYKXOLLELY 
QGAUJCFVR5SLDa^S0VHKtrrSNTTKXJUU£KU^ 

^^SOOWCDttJJCflVFTASRSSHATKPFYYELCJSOEZSE ZHKCnfPLTANQtLZXLODK 
SSPSICGKVCAEMLTEWLHHZWElXinTOLCW^^ 
ECLLAKTXFUlKVSASPEXPCZSQEKFANlRlCAyZNKZl^D(5Lm^^ 
PFVEr3TPDLKIAIAETLTYEEPXXYC0CFl^ELCTQTSVYIRCT0)CrS 

CPn_095« 1094803 1093793 

plse-Olyctrol-3-P AcyltrensCerese 

t YRA rVMOFSRYLRY APDNOYLPEPLYOKFSVFHONYI DAATKKAAWAEVLCLOWVKV 
I lEDLHWPFIFPPYHKKIRAPZDLFRLS IDFFSLVIDOKNSRtLNUfflUtEIEEYIARCD 
fnAOXANH0T£rOPOUfnfALCICTOPELMEI«IFVACDRV^ 

KRH IATPPnjl£D<LUC^KSKOItJCTIiNBOCKFI'r/APACCRORKNABCRLYPSEFSP 

EfltEVFPaa-AKASNOTTHrYPFAUrrYWLPPPP^^ 

n>ALC^K£ELIIICDKKAORTLRAEKVPSTVKNLYtEL 

':iTi_»i'.i n»*j*ii7»i io*.»479*» 

'.-.irE-Axi.iL FU.im^nt Protein 

AU*ft^r';TRKVMENEXLliMIE^EXRYAHLKIGQLPC4VrtERK^^ 
U<N I0TAF I N r DEREM IF t H ICT r IJ^KKFEOMFOKDVQALPEEASEAPU^SEEAPtE 
KFI JtLU;rvi,VyWKErili:iCCAJILTCN IS I KWYLVUPNCPtlRCWmiCZEOPlCMREOL 
ryL I k." KRN I \)f #f X f I rRTALTTAl?TEAL (NEAHOLLLTWKTILEKFYSTEOPCLLYSET 
M LKPA a /r TC t UKNYKH U* ( DUYATYQXCW MLKKYZPDAj* I K I EYYROS t PMFERFM tE 
KB I UKATKHK IWL'^ilYLFTOKTGAMlIT IDVR'X3»J7TQL£3CVEETLVQ IMLEAAEE IA 
KJU'LWWaH.vt [DP [ 0KK:7R KNQRRVLER LK DH HJCYQAARCT [ L2MS EFO LVQ1TRQR 
rmEra IK/rm , U:PVi?:XllA t tKTPESW tErEROLriCVIMIKOGHLCLVVHPEIASYM 
KUWIU^IhJIIMLAKOr^Kl^rm^WlIUniYOPPCLrTCESIDL 



c"T107 hypochetic.il pi~.r*tn 

fff^or gvpg^/Tn f^t^rr JtLKlJtireLGLrf WgN>ffTCfl3^mtflIVl»<Dr 
VCSCVfTCTPLrRQEU^OTFEECSCO^PEWCNtW^ 

epnj)9-ii io»7i<w io-»7:it 

rM2-L32 Rioonoiiw; Pmretn 



CPn_09fi2 1097301 1099275 

plsX -PA/ Phospholipid Sync net n Protein 

t(^0PNEV0ZCXDUO3HSPLV>A«VLVZMJCSQSSTZ PT APTtrAStZXIUOOtOCXrZ 

St>UOEKFWCXXSAI*r/AKtDSttJUtflU^ 

TZARAKZPLrPAVSRPAU.VCVP^iaiCHAVZUVGAMX5V^ 

DSKmZGLlJIZGStXlUCBTEAiOOTrRltZJUrrFCEAF^ 

TGNXFUCTAICVTEnOftIt£QXX£ADXORJUJ7rTF^ 

LTICILCSXKUOARtnfRILSMLX 

CPn.0963 1098374 1103224 

pmp„21-Put*tiv« Outer imtirene Protein 
TpuiFKVAHVAKrTVRSYtssrsHsvrvAii^iAFEAH^ j<tirn .ncvnacpmts 

AHVtEAOTSVUCCSOPVNPSOXESEKVLYTOVPLTOCS SGESLDtADANFLEKFOHLTEI 

TTVFGXOOXLVHSXJLOTRKFSOPTOJVlTrSIU^IXZSSD 

EVSSXLPKSPRAVAAISCZ^I5IinSMtt?PLOCLArrY10m 

SGSaKSCbCrZ)aJCAFKSGAAVYS?U 

OSZTVTLTDCATCIJttJAlJtf^VKZjW^ 

LZALCXIlZCLTOrvacaUAAO^TZJQ^VV^^ 

VAFKQHTAAI>TQC3Al<SAMDICVI ZAtWFCEZlJXONEVRlMX^rrCOCIUMPIClJXJXZ^ 

iviKroiscArrFiJoaASVLcvKroAECT 
vuxiANHsvtrrcNGsacraaAvcAUfxsv^^ 

TICCNOOfXAFKZNFVrCSEMORSOQSAI ZANSSAWZXKMmZLrVSNSTOaYOBUrV 
CSLVASBSSNPTm.TrPOiSCDZIJA)OiSTOTAA£LSOaSFOO^^ 

vsnfpi Ws ojfcCVDZJtfajrv cLCAr ccori^ir^ ^ 

VQDp3niTODATfYglfriRflI»PBPVSPL3APSIJr^ 

KZPOZAAIQIOTIAL SQWAH WlJtfff JfQrnaSIVI^ACfiZLRrrosOVDCTAPtfTlltt 
EETLVSACVOrjMgSFTPtnaeJkViyrf^^ 

CTKLHSWtt^ZZDPWWIlWALLSSHIQ)rW.ISW PfFTTD 
VSXWITPATY<afrCVW5 « IO tEOW VVO*3PTG1^^ 
IJC0EIFAHHrZA0JWEZ4)F8TNVIQSC3jCVVEZC0NZCE^ 
tgLB2QL'r5QrTGKTE5QSTfHUWiynS 

YOTLBXCTcuwxGKOTiAonzimnrzviipiutrzsAi^^ 

UlA»KllUAlRftW AlTroAIJHAYS1ICS1UW4SVQ 
AYSMCSYCVDXPCXAllKAlUAlNmmSYLST^^ 

CPn.0964 1104612 1103301 

No robust hocBoloa present in Oenebsnk/IMBL es of 11/7/98 

OSILESllTYTYUt«gtt«HS*glSZJ^ACLlAKYt^ 

IVAtaJUJIfSlJPKXIVEI^CITlWNCVIOPFXFA 
KAPU>Nmj^VM8PQES0IJ«RICZI«CSQl^ 
EHDOTmaCTTPItZlOilEALTCKSSIJTrPSTXDCHQAA^ 
RSIllRl.YSIAIffiC0N0IXZWVDtrTOIXKSXOKOOUEYR^ 
W«ESAAKL(>IWVXICTtRTrHaoaiCL VUIHJOITfTmXl AAI 



FUamil5(»WMSVF1CftKPCADPCTIU^ 
DPrnmXACXAYLLYVLKXXtX. 

CPn_0965 1106769 1104925 

lpxB-Lipid A Disecehsrids Synthsse 

K0FSi r sinffai«ZP5cn < wu.YPLcvLASifras^ 
iGATi^rvKCTrosoFPVTvucviNLrrYijwi^rrssRPisF^ 

rLYVNHD#tA£PNXFHI^PFAOL5WHXJGCX£LAI FSCRFLZOHFYZESNKTItDFtUf 

WKXGZiaSLtALVYTIRZCOPZNZZOGCaLJ'PSZANUUrrXIXJRJT^ 

EASCOZU3C10JQ5IKSLYPNZRFUCVGCPAMROB;LOPXLKMEZrCVS(7A 

LYRJOTIUCZUCrZLKKXPATLZFZDFPD^KUZiaaJUCHQrRGK^ 

KRZI^OHt^KLlXll^FTECLFIOtfrSIXrVYl^HPLVE^ Z 

VAAmSRRCOZSR^Za^^AFUSSLSQTKOFWSSSSAKmZZEtTr^^ 

I Z PWtFRYELHRSCOCALAXCCT IVLETAUCTPT ZVMCRLR PFCTFLAKY I FKXLZ«PAY 

8LPNZ ZWSVIFPEFZaiXlC0Fl(PE£ZATAU3UiOKGSXtX0KIXCR^ 

SEEfXJCR I FDTLPAV 

CPn.0966 1108055 1106749 

pcns.2-PolyA Polyn«rdse 

ULXTZ ZNVTENNZLiSGRUIXLIJCXKSNZTLTPTZYSVSKHNZKUCDFSPHAUVZICn 
AGYZAY ZVCCCZROUAfnTPKOFOZCTSAKPEEIKAZFIOC ZLVCKRFRlJUlZRrSltQI 
IEVSTFRSCSTOEC^tTXONUJCTPEEin/UlROrTtHCLFYOPEHEEZ IOYTQCVKDLA 
NRYLRT ICOPFTRFK0OFVRMLAUJC X LSRSP FTVETQTOEALZ ACR0EZ*Z KS50ARVFE 
ELZKMLNSCAAKNFFOLL t ENHLtEZ LFT>YI«ICAFRLNRALEK7rATTZJUUXOX ZUCKE 
AEYORMOLMAlFlJPLVNmmYKHOKHPYL^LT^VTirrrKNFlXOF^ 
ILTALIU>WYRLT^LIPTKKAI^FTWCULMWT11FIXKLGLLEIRSIVY 
RKKQTUCCKKO^HSVK 

CPn.0^67 MOH4 31 110-*8H', 

mrnA/pga-Ptiottprvwiiitmnut.ir.rt 

FTAYKPAF ICACRSEK IRR tC :OFRRI#W:WftKLPrnv,VIU:iWIFErfm , ETrVU£K 

AVARVUREnRiaimmw^KDiULJxrmFoiAi .r mzuzm ; t etlvuip t ftpcvafztr 

AYRADpV ;i M I A\;*4INni'RDNi: I K I FJTUJ^K I ETW:;r.ADFi iTLTEDHAVOK 

NKRVtDAMf«WEPVKArprKf^lJCCUJ(I^U:AIRyC , :'nCVAhWFErJiUEV[CW 
PTC IN t NEJIL1 iALKI \n' U'KAVf H«MI !U; I Al X/ I i MVDOCcai CVTkllWI LfftCA 

GDU(Kft:;ALn WKWAT JTfl JT4*^ 

:«»H CFL0YMTt\ : 1 V: \M Jl LH t F.'IKj *4( .';DLTAI- 1 »/K: IITTTI. INVAVRFJt I PLET 
I PLtERTIALMXlAI i !| :h 1 1 J J#Y: » nTN n *H'/MVBrairmiovt> •i-\kau\wi r»AEu; 
TCtSRK t „ 

CTfi.t> ,M 'H 1 1 criHwt 1 1 u /;j ( 

mLk: HMw.nM.mui.' Ki «.f v. ■ Aii*if«*pi i lor.m.- 
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omen [ FOYuawncvsi vlegiakleyrcvds*. > . vcoelf irktvcrvoclsw-'' 

OERTirTA^V rCHTWATKCVPTE IKAH PMVOTCJliL: AWKNC 1 1 ENrTCELRRXLTAOC I 
3FA3DTD3E ( tV0LF3LYY0eS0DLVrSPCgTlJ«UlGWACAl.XHiaWPHTILCAfi0eS 

puifiLacEOTiAsosRAFFirtTirjisg^^ 

VACG33YHACYLAXYX ItSLVSTPVHICVASCPTmWPY IGKOTLC I LLSOSGETAXTTLA 
ALKElJlR*mAYU>CICWPESAIALCVDHCLFL£AG^ 

KuwwrALTHAcorsFnocu)5a,poLcoKi^iE3uewww 

\: •;vr , ;rA!Arri*^. f i/i;«v;.:rv. .v«"\^t:\^iiayw: .vc-METy 

PRNLAJtSVTVE 

CPn_0969 1111803 1112999 

tyrP.l -Tyrosine Tr«n»po«_l 
VYVMSNKVti»SIAXACS*IGAGVLA^ 
m»MOKIWPVNKLSMA£SILGHVW 

LCI5WIRHLGPLGFAIIJCPI XMAflTRVXlftfL NH f FMf UU VAfCirCALSTLIgOPSrt 
VltSSWLTTrWPVrTlJlFGFQSXIPTLY^^ 
VLUVStfXUOAXIGCrrAVCAUIOAKItSMArrZACII^^ 
ADCLKUNXXSHPFSXFFLTFX XPLAiaVCmiVLTCLJCirASGFCAAVX XGVFPTLXVWX 
CRYQQKHRnCQLVFGQKFAJLrU(PXXtVXMVVStYH£L 

CPn.0970 1113452 1114649 

cyrp_2 -Tyrosine Transport.2 

VYVHSNKVLGCSLLIAnSAIGAGVlAV^ 

HT*iOOMIWrVNMLSItt£SILGHUGJa^^ 

LSX5HXRHXJSPICPAXLK7IXMAGrieVXDyCN1tm(FGW 

vRss»TrxHAFPvrFLA7GTOixraYYY»rau^ 

VLaVSLPIl^AKXGGYTAVEALKOAKPA^ 

ADGLKUMQCSKPrSirrLTTX IPLAMAVCYPEIVLTCIJCyACGrCAAVI XGWPTLXVWX 
CRYtSKQHHRZ3C0LVPGSITAl^UfFlI,IVXNV^IYHEL 

CPn.0971 1114(93 1115415 

yccA-Transport tanuii 
ECSHSLYtmDnCSlSRVOCTTASICVVCTWACLm 
FATIfiVSFFIlSiaOrLSVSAVaa^XAYST^^ 

ALV7XSLAAVYGAPTXSDLTKIS KXHTFAL ICLLLVTLVTAWSKrVSHPLIYTXICYLCL 

VXFTOTAADAOUIKXSSTXCaMn^YXX^U^^ 

O 

CPIU0972 1116377 1115430 

ftsY-Cell Oiviiion Protein FctY 
RCITOISUJTSYXVSFLUOLTUIJMFKPPIU^ 

SGICmAAJO^AKYYXERSESVKLVATSTnUUUaa^ 
ArCCXQSAXARQYSRVX XDTSGIlLHVWKLHULSKXVSVa^ 

DLOLPIMKLfPCVSCX 

CPn.0973 1116346 1117527 

"sucC-Succinyl-CoA Synthetase, Beta' 

ECKBOnJrlMiieYaAICrJLLASYW 

GROOCCVTVAXSSACILOAVAJCEiCMHFTSNQTAfiGn. 

IKDWTOCPVIJMCAGGMIEWAHSSFOT 

VNKX^LXICKLAJCmCNOVSUfXNPLVLTI^ 

YDPSOCKVR£MJUCOIGLSYXAI^Q>tXt£XVNQACUUCSW 

ASOraXOEAVSLVLSDESVKVLPIHXFGCXMtCSWASCXVAVNmrOV^^ 

NVCZ^CEZVOOSGXPCQTOSKUCMWAVELSM 

CPn.0974 1117523 1118422 

•sucD-Succinyl-CoA synthetase. Alpha" 

YDSVLEAKOATCCRATKXFVPPPYAACAltXAESAGXCXtlTC 
NSTSQ^XGPNCICIIICPGECiaGIHPGYIHLPG^ 
ICVSI0C0PtiCT5ri0WOAtJEPPYTn,XLHXCEICGSArm 
XAGVrAPKaaWGHAGMISCNSGtVUCSKIO^^ 

CPn.0975 111903B 1119637 

no robuic honolog present in GenebanJc/EMBL at of 11/7/98 
3XEE0VALSIAIKIUCI ILALXtJK.VUJU^XRYOUUkNrHCSW^PGPSVN0AYKCS 
EAKIEEMLDLLDlJTU>KSRCLRQOKITAjm^ 

UXTOFTNPPKRSPVESVCHD/VrPVTDRIJCREEEI ICDOPXTRSNZZLMALU3KCTARC 
EKKTLWFS IPFKYLTQIELF 

CPn_0976 U20079 U21185 

No robuac homo log present in Genebank/EMfiL ai of 11/7/98 

I Utt/TYCFOPSVraPEHRUiAALDJOT rXG^ ISTVEKI 

LKLISYLLIPIVLIAlAXRCFUtSRFlQCNWXCTtfl^^ 

IWOVRRWSCXOVUWrVDYLRSOFPCFKE^ 

DrVCYILSU>rrYWTCftrtLKIRAMCITTO 

U3II>MVWlL^EALIflTEKPCKECECITTC0FUCOYCKKKLEVMSCPDFItSLVDEKIREr 
RCPSILNSAVCOVIDRKCOEHLLKAX rNEAMRRLPCMKHSSPTKRCNQVLFYTXPSPPKL 
PPAASSVYF 

CPn_0977 1121329 U22402 

No-rooust nonoloo preaenc in Cenebank/EKBL as of 11/7/98 
LY IH0FANI LKSSFUIEVYSFSPSVRTSFOHRVr^^ 

ACEEYVPtSTTEKVLKXLSYLLIPIVr :Ai^RYLLH5NPTAJCV30XPWUCTL0LCXDtK 
:IF r LPGGI IVWTKDSATLFXA I RLECKRVDVEYWRLHSSDKVVFY I PAOKLPOOLRLTHWL 
PEKETRKTEYVRHKt^lVrCYLTSOaO^^ 
CEFGRLLNEN ITTKCSETMEVVQCDl^MAFQOM 

LDC r/PEOOEFEOKFLhTTLLKAVLMHOFEC ISVAdMOVT Ft tCMttLAWIPFUNOK 
cMii_U'i7H U2265* U21**i 

Nn ruhuut hiaeiltM undent In <krnt*t\ink/EMtiL .j:: ot U/7/mb 
KY FFMEVYwH I PAVRT"F^I(RVMAAU)AWFFU k3 IRLKVV; XOfXTNAjWAYQELVS tSTT 
KKVJJCLLliYJ.LVPtVt lALLCRCLLK^MFREDVEXERWrjCtRfiLClDIESCKLPSSYVNO 
V: tWPKKniC::K«PR t WWITlJCWnWVVFP tVFQK I PKT.IRFnYWSQKETRKRDYV 

kNM .uiv it iyltm r;twujY isK-r.nra-?AT::LprERVi WTLTONQEUJCEVORUNCE 

::ATK:.fX»KLVW^:»V5DI COXWI^FLEVIO^'FAK fFXLVEEVTirJKLNLDFLCLDlAW 

n j/j»ci j»m::u j+awiiik s;naioi kkvcaulx t\TEA cjt -cm pfsr:: 



PCT/US99/26923 



4b 

r;Pn_W> 1 12 J'." I 1 12044 J 

htrA-OO Serirtt^RfWliJWIe/' I 5C C ... .... 

o i mrnBOuisHtAvLwsLuuJWCoav jmisuvi tucovrMxsSsRKOKtic 

ATPAWY X E5PPKSQAVTH FSPCRRC PYENPFTJYFlCEFFN*FrcLPSO*EXPQSXIAV* 
CT5FXVSPDCY XVTIWHWEDrrCIt XHVTlHOCOKYPATVICLDPICrCUlVIK iksomlpy 
LSFOISDKLXVCDKAIA ICNPFCLOAT".TVtT/ISAJCCRNCU* lADFEDF ICTOAA IWPCN 
ffV>P LXN I DW ICVWTA IVSnSTTTf IC fr.pA I P5LMANB IIDOtI RDT^VT RCFtCVT^ 

:./.si-vvr^*:^-.v -/*• :.-alp/. : v: : w*:; ::--mfw^v.; 
: :-rn i vrr*.w " ! r.— , * \ w.: — •T■^^o:^M r 

TXG ai X SVEFGSVAAS:& XAFGQU LAWRQJC^ 
VXRrXAUCPCE 

CPn.0980 1126988 1125504 

"similarity to Saceftaronyces terevis&ae hypocneeical 52.9KD 
protein 

PVttLIOIAICICKAJCin^IFFffriC^^ 
FISrPSXSADSOHLQOCOCAHFLVDHVNKXTOVTXJ^ 

HXYMCrDVOPAOUDCMXCOPF X»IX»O^YARI^IWKCOCFYTLXA£4HYYISOaiF 

LPKSIHTJtlCEPiLCFRPOCYEA^SPCESAlJtPT^r^IS Uw t I UPUI K I VTPYHATA 
Yl^CItt.VIWX)POKAAi{OVIHHIJCOOVPSSLXrS^ 
YSDLYNtIElJU.VMPATIPICPU^AAOT^PIICCTSYLSODIHAAXl^^ 
LSXC0LLDKX*PKX1C£ 

CPn.0981 1127019 1129952 

tine Metal lop rot ease tinsulinase Caaily) 
vnSI0XACX7rn^XXKSCnLPXXE5KIW 

PCWNSVAHVLDWVLCCSOOfFVRO W INAFTCPOPTCYTAASQIPCD 
PYTSXSVYI DAVFHPLLTlCOSFWlVWRYtFKSlMttXrYTCV^ 
AI>UUarPS\TYGVHSaSIPRCXVTL5HEXMUFICS0YSXN«^^ 
LEimJUAmTOAVSVPLOKItFI^^ 

tf.fjfmir.rr ILM3TnASPUCSRLL)tiL,l LKU1UCIENDIRXI PHTLVCICCCSPAaAOK 
LEAUFASLSX XllECISIinVTOAVHOT.n.SRirrTTCTSL^ 
AnSLRXJI5I^SiXI0»SrJTODYLAiaiRJCYTLP 

LL^na«£iaEiaooNvm.nsoEDma^iuwj^^ 

OSEVUCHECrrKDXVrXtlVV^XPpr^EEX^ULR^ 

imgviTvsYOFSPHANyisrigpsvsxRac ^ 

Uit^HHCALntSVSNSPrtSYAVaMCSGMSXTOAItSYL^ 

UVVXIdWYTICCFSGTOXVXSCSJUlN¥0OLIU^ 

TSRSXifX PAJUU^FKAIJVFPXGDXAYIJHPDAAALTVAAEXIJMV^ 

AA*a^RCSFYCY^YTOrciATTYKTTlJCVSEIAfiC^ 

C5fW0nvrriiLk3C»XPVLROAriuts\n^m^ 

XJWNVlTliaCDFPXVPAX 

CPQJ0982 1131215 11299«2 

yiON £aaily 

LErreriDiAEi»CRXJuoxsox4AVEiarx^^ 

AOCJCYrxm)SQTrSAaGMlUU)XXXPXrX)atCLXIItA^ 
XKraXXTUCSKSYWZXFWSPEYVXUlJCr^LFr^^ 
LLALLICnAYteTOWLQKOXQEVSIJjgJtCJ^^ 
S5FQVRVLrTIJUTECm55SK0XEEPTPXC51ATSFPm 

CFA.0983 1132045 1131206 

p» sA -Glycerol -Serin* Photphatioyi transferase 

ramCYEQKKlWraAmiTTTARraRJtVW 

FHRL0CX^LX^XSANXAX>PSrAlAXAJtXHKA££ArX]JOro^ 

GXYVGKFFS5LLLXTS IIYSLCGVUU.VRYm^SOKnaJVSKPTCFICXFIPAAAA« IVS 
LA£A>^OFFPDI^AOtJrVCL£^FAXXFIGCLraSPWK^ 
AJUXFFSCLVWTVIVFFLVSWLYTLVGFP XFSX XYRXXS 

CPn.0984 1132370 1135510 

"nrdA-Ribonucleoside Reductase. Large Chain" 

GKVKtrcvtXICHYTXVKRNGMFVPFNODR X POALEAAFRtTTRS LET55PLP10XCES XAO X 
TKKVVXEVIJUCXSCSQV\m^XQDLVE5^ 

XAX XRRXX3GSAXFNPKICISAAXJXAFRATLQ XNGrfTTPATlSEXNDLTXRIVEZIVLSXiC 
EfAINLEZIOOXVEXOLMVAOYYOVAJOVYILYREARAJUkRA 

RAN^EStEPOYAFXAAftXTSSLYEETLCCSSOOPNUEIHKICHFKfTXLNCE 
KCnfOUJAi^EVI^IJRXJOOFSYMCVONLYWW 

EOKNFWArrTYNlJ^FRYTTATPTLrTiSGMRHSOtSSCYLST^ XYKVIfDNAL 
UKUAGGXGNOIfTITVRATOAV t KGTNGKSOGV X PF IKVAflOTA I AVNOGGKRKGANCVYL 
E>(WMUJYEDfX£XRXNTCOESRimffl 

AYCWEKLYEEYERK\A^EXRLYKK>AIAEVl>mKNl^LYETr»m 

00HVCVVTC51A£T£XIXNCSESETA\OdU^INtVEHXPl^ 

DNV IDLNFYPTPEAXQANLTItRAVrjLCVHCPOD^ 

YAlLA5SIXAKEK7TTASYSC5KW0RCaPLDTIEUJCET^ 

OTtQKYGMRNSOVMAXArTATXSNXIGVr^SEErWKHLFVKSN 

KEl^LMCAfJ<LODO(YrT£SIXEIERIFf{HLXKLFLTAFCXEPEWI IGCTSRROICWZDNO 

VSL^YLAEFKKKLSNHYLTAWKKGLJCTTYYL^ 

KSAffTS IWERKTTPVrSHESQCESCO 

CPfuO^«5 U1S432 IU*157| 

-nrdB-RLbonucleoaidff KeOiicc.ir.H. ."uill i.'h.jin" 

IIWKYTCRKXNNPRLFWRRLRIU; XTEKRiWKKEAD I \JJ 'tKUCRVKV^KKCLVNCNQV 
DVHJLVPtKYT(WA«EHYLNrXAJO*^PTEVrM>\ft0l ELWK.*XELJEOERRVI LLNU3FFS 
TACULVTNN t VLA I FKII tTNPEAROYLLHUAKEEAVirTirTFLV li .T^UXDECEVPNAYH 
KRA.1 1 PJUtDPfVWri.TVDVl 4M«WFrVQ.r;Fi :i * V (KNL'/T/Y I IKHlII PFY.TaFVWXLS 
niRflNrW: ti;EUn»Y I lil DPT HiLNK: I Dl. I W ; I KEOir-PT/WTTKI VEK [VALIEXAVC 
I.E I EYAKDi.'LPIl' : t UiUllU f DYVPII I Aim M .KH lei X P I Yli:»N PFI WtJPTWTXNK 
RKNFFETPVreYOTAt»llJ>* 

'm_ii-»Ht. iiH^rj 1 1 »v 

yinitl (^"Ou-f"! iKHA Hi*riiy 

n.i>nrPOOi^:pr-Ki mkkhhh ■ .*vi.YViiiir/KKiiur (Kr.-rrrn wtr'Pt'NtiT:: i Acme 

:» M( UjWWAOAUKD) \»VI * I AVW.4«f I ^VfK 1 W: IK* 1 Nl 10 1 VMI J< I VI rTAICrFr'jYYVP 

u,»h jw<i«WNFP^rwit«MnkKiiki4 :ki\«k i ::ic:u;l:^vkai An * Hrrvr j ka 

K/niMPRMFTf/Y IKKPTITY' VEMKHfl *KTK« ?;K I r/TKP I KKAtil 
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CPn_*J*»M7 11174? J UJHUS 

ytes-iiite predicted rRMA mttnyUst 

LOC t FA ICrFMrAYimrniNWrjVSHE I FrTT^VPGDTVI QATCGNGNBS LfXARLLO 
CECRLWYDIOKiyUJHAIALFETHLSEOEJ^^ 

KcmirnjuTczisixrMjnvnvvcLrTwcrK 

^V3.7WANRCRAPRLPir0R0CSES3VDKG 



■•».*•• r- r: »• -v 1 *;. ;i v -j\~y i ■ t f • . mm.> y.-t-i^ » ■ 

KFFtNLI NLOQC I LKMKEAAPKHFPPWRRSVWIJilRYSTFIlIOGFAMf FKAI HT t EEAjtE 

VIRFUtSINypfLriCaOCSSNCIJ TOi CFD Cr^ YWIYCro 

KATAYNCYSGUtrAAGIPGS\raiFNNACTNES^ 

ELSYKSSTlTHROOETILSATFOLSiroVS^ro 

TSA^IDAACUa^IOCWSPlHANFIIWrCW 

HEIRIIPYQFXIHSPVSEX 

CPnJ>989 1139552 11)9016 

CT832 hypothetical proctin 

UrrSLAVKCVU-TrFV^VKATLSPeKTSCSPlSISKETPOOWWEI XLQKLYAIXKAPS 
A£DSLVPLLHSOTAVSOKHVLVALN0TXS I LEKSOELDLI ICNALKMCSFDSLOLVDCW 
UU.Tt^QfFYSPPZNKAILZA£AIia*VKXF5YSEACFFIOAXU0ZFTDSSLHQlSU^ 

CFn.0990 1139680 1140440 

intc- initiation Factor 3 

SVAUlrXXKROIKAFK\flU.XGSASEQLCX^ X 
MI)VGICyRYOLTKK£KOSKKA0HQVRIItEV1(lJCPN EOSNtJFSTXZJCOAnTFVEXCNICVKXT 
CMFKaEtAYPEHCFKVVQICMSOCIXDICrviAEPKIJW 
KAQDENQ 

CPn_0991 1140394 1140612 

r!35-L3S Ritesonal Protein 
KOWWRXSUtPnOCTOKSVSARJlXT^^ 

CFfl.0992 1140622 1140996 

rl20-L20 Rlbosonal Protein 
OOAfMVRATCSVASIUUUUCRIIJQAIBF^^ 

LEATV 

CPn.0993 1140975 1142030 

-ptKS-Fhenylalanyl emu Synthetase, Alpha- 
KSFOSHS lgIRISMEMgKE\VKQqntSELD0VNSS0AXjgrj^ 

HIMIEI)DWDIFVHL(ycVR^ 

RTOTSJWDARnjatOOPPIKWAPa.criWEDISARSHV^^ 

XtiAnfHSrTORKTEtRFlUlSTTFrVE^ 

KFOVUlNGNVDPErrSGYAUrciER^ 

CPn.0994 1142371 1144440 

CT837 hypo that ical protein 

UWHPflGRHKRSRRNrTOi mmjntt 

CALniVINrli£I5CVSKSHAiaCAUC£S0FL 

RAWEViaWXJCEVFTTTJt^rOE^^ 

NDALVOIXYKJNKUinVWCTPLTKriiTO 

l^I»WICVHl»VKAI^SirTrAT»ffW 

ERXQTXLIA£ALSX£X F^KtVFEEASRYI*YFNI0TMX£NANQCKPLSPO0YLTDAYEEL 

HiaJSKYWSPLriCAMtStVLZHESiro 

SSaYANOOOTjCFttttXAHRSr^ 

AFSrPEP Tn . f OW Fff lHSDIETFAPFFS IL0EinOCPUASSFFt,TKELItErVCS71JCE 

io.TAiJCDiFFAicKxiLriwcrT.r.:.T ht r raivrxLiECTNPMsrvvvsimcLOYvsvrx 

ACFJ^FSKAIVDEKSLKLliTOVUFTLVWl^ 
UCSFPICCSXESWEFTCrrLHELTEVSKXHNL 

CPn.0995 1145515 1144415 

CT838 hypo that ical protein 

WtLIWKRMLLTRFWFALTSU.VlALI FYAS XHHSLHTLXGASTAASGASVKLS I LYYLAQ 
I SLXAE FLHPQLVAVATTSTLF AMQNKRE I ILIflASGLSLKSLKHPUiSC^VIMMVLYA 
NTOWLMPICOCI S rTlcpiMDUG Tl URIXJUKIPALYLWIVU-rSSIEPJCri.TLWA^WI 
KDPmYWDOAFTTt^LJIGtWTOFFANDS^ 

^IJiSflS^b^ET^ 1 P'-^ATCX^CI-STOVPQR I LSlJ-AOF-nrVUI SPLACMAA I ILSA 
YAYAKLQ 



CPn_0996 



I 146593 U45S19 



CT839 hypothetical protein 

^iS^^^IS?^^ IMFPVtWSCAICCLNFYTCSE 
L^ICHYOTCKEIAWAm'SPAUlOn^KKPOOTIFIAVW 

FRKSIALYFIFPIL^IUIVCTC?mCNU>LAJTaJ^ 

CPn_0997 1146699 1147664 

neaJ-PF-loop auper easily ATPaae 

AUG IFl^VKHAND0A£TVLJCRUJK^HLTNlJCAliA£R3*A'TO^L^ PLLN I PKSSUCEAL 
I>AlWJi:raODP:;N£CCRYUtAIO«|«^ 

^S^^j??, : ? CELPCPIXX 1 WATLCKWVMKKFFTWAG lAVIJRHFLOMVYDHLSRS 
: A ATLRMRNK t V I ( KrUVW I D 

.:Hi_OVfJ8 EH7RU H^OW 

ut:N ATi»-ih.*»mi.»»t sine utot »♦.»□* 

L I 'iltWKKK'aCLlKKHK PCPKKNFPTVFFFLLF' WVF^rrMfVW^ ALJKKARVTTFlHlQ I EH 
^VNU<LtVl 1 OT:ilK(ALNDNLV;iFtX&FRDVQTC^ 

J!*"!'™' acEVTH:; 11 * F:;A ir * wp r PBOcvAisvnar^rs w ,t www;patfou ni. 
»\U»i*rt'i^;R;;iTAurrY»i::DLYeLI 

.'il?^™-* I ^^WTOERFiyjU'rwni.YREfMIKYIfKLW^W)Ui 



l»00PRNLVLOCTFK30CP3P. LFTFLPt XLVLLFV'Y1.VF:;RC**RCK3C3AMSFCKS 
PARMLLKCONKVTFAOVAC I CUuCCCL IC rVDFLKNPWCrTS UJCR I PICGVLLICPPCTC 
lttLIAXAV3GEAi*flFfWIC3DrvlKrraW 
RHI*CACIOOCHDOl£QTtjMCtLVD<DOTOTFr^ 
VWII^lXCRfEILKVHAKRIKtJrTmJtAVARS^^ 

TAVTAVOVACMDKVlYTacBRSLEMMm X 
PROJLQkTHFLPCKNKLJ-rtflaCCLYOOLAVUCERAAn I rLCDrSSGAOCOXSQATKI 
VR5MVCEWXS POL^,TYDCTSDCLTCYOCYHDCSYSE£TArr:grn »W T HAAYQUA 

..: : :\^iK/ t :::Ei>fr;.-a:::.--': />:rvxE:>rirr;c"::;::. \:-Lt-£e ?s.FKKijiri 
:-i-r*TKr: — n-"T 

CPQ.0999 1152859 1150766 

pnp-Polyrxbonucleotide Nucleotidyltransferase 
OCTPMirOTXSXNLTEBKXLVrETCXXAROA>CAVLVIlSCr 

PUlVDYOCKfSrPCncrLSCrilCJirCRPSDCEI LVSRL IDRSUUOTYP^JCOWLIYV 
WSYDOOVtJWJUCAASAAiAISDlFOSNlV^^ 

wxrrixAXi>tx£CHCDFrrTO\aj»XErciowiv^ 

LFAEVLTAVKECACWrTELFNIimKirVHA^^ 
CrnJBCmilUUROTIIWICllSLTTVR^ 

GSfiAMAORyEXSUCECLSKfYX^TYTf PPF5VCEVOR XCSPQWEXCHCXLWMXMALP 
DSATFPTnRIESNntSNCSSSKASVCCCCl^liCl^^ 
ItSPI9r,rtnnrtriCPnCIAGSGICITAr^ 
WIIALSAPKADI^OYAraiETMOIKPTXIASVIGPGCW 

XSASSASAXNKAKCI IECLVCEVEVCKTVTttRVTSWAFQ^FVEVLPGKIBLCHIJECSR 
QRI£NZS0SnnCEC0XXDVia^INEKC0UCLSKXA7XE 

CPn.1000 1153193 1152091 

rsl5-S15 Ribosoaal Protein 

DOKSIUJUUCLVGORXniXYLNST^^ 

CPn.1001 1153369 1153869 

yfnC-cytosine deaminase 
YYXJLQGSCLIMfEKDXrntXAnCEAf^^ 



A0CSWVNXFTEfflPFH1V5CT0Cn/CSE£AEHLKiaCFFVEXI<R£KSEX 

CPn.1002 1153844 1154089 

CT84S hypothetical protein 
KSADUCV»nCIVrU-DOI.YTOrSRL^^ 
ECVLSCXCEVRAAIUAALSOGN 

CPn.1003 1154962 1154092 

CTB46 hypothetical protein 

TSXICrXHFLUCPOTOXASIUSMRVirPOXKNNFpm^KLLI^ 
Z ZKKFTCXPCLLEXiJaSVXGIOKKHFWrLTYPtXTAX^^ 
XJ2TWY1CAI0HLimGAFSVLWXSG0A£.XXGAVlMmfAl.lH^ 
XTVOIFIXPECTrTICPTPI^VSIKVa^rVLCTTCCILIP^C^ * ' * »"T * T^T *iTT 
FCXXEXXPNPYTTSIiRF 

CFA.1004 1155418 1154879 

CT947 hypothetical protein 

KWIEELKSIQPVSmTlttDiaaPtlSTW 

CXSXlAVLEZMriVQOORVXELXNl^UJCVTOLOiaEOCSOOEY 

AWOaOOEUSAOORAOANOKSVIIST^XESMOXlOATSSKl^TLI^ 

CPn.1005 115S957 1155415 

CT848 hypothetical protein 

NRJCPVRIMMIOPt^AXKPLOAAINVPCTPXTOCPNTATAOOX XAKFSXDSNPLXVtVY 
YVYOSVLVAODNLS X X AOEXOANSSAQTYIMNOEALYOYVS X PKNKUIDKSSflYLOMZQS 
ONOAIGASROAIONOISSUaUAOVISSNLr/rNNNI XOQSLOVCOALIQTrSOlVSLXAN 
I 

CPfl.1006 1156493 U5S990 

CT849 hypothetical protein 

TKVNFF IKS ITTLCTLPTVNriNSSRPPLEPLMTPK IGAVLFS IYELLLOAIEIROQTVL 

TOS00X^tmNXOCX}LNQE^XKYAXVSACAXESEXTRVOI«NQMYSA^ 

TR0N50 1 ILSHASTNIN I ZQQQSSQDSST I RTTNSICSTVNOLKKPU; 

CPn_1007 1156689 1156907 

CTB49.1 hypothetical protein 
UrfYKSUACEDCOVSCNECNDYPEV^ 
YU.TKAROSLDE5 

CPn.1008 1156904 1158223 

CTB4V50 hypothetical protein 

VLNYSF X0MLXPNYVL5KR LYRWVNQL I KLCDLVKNSR3FSVEWF IS AUXXFOCU3CA 
JTvn/KVSLVPFIiLTSFUFFL I LCFRCKCYALIXdCVFVTLWAIOfVWETLyVSF 
'7VSFLLAFCLFLQCVWLA0 E££MVKCKEOLRLS COL0A0RSAYEOLIXTK50EKEFXDAJI 
AOGLDRECTECOELLKAAYOKOEYLT XDUCXLADOKTISIO^DYAEUMKYIELVSXIBDV 
VFPWAEPSVCESOCaERVtJVSRWVSALOEKEESLERUWEI LVEKQRCSDY EMROOCLC 
UXOHFTALERRCEELONIXWJKETO INEUCLVCICSEEJCVSVEPSAHAETSCVEEJCOYK 
CLYCOWEOFLEKCETLJLVTiKKLFAVOEKYLTLKKKEELTKODISFDDISMIOCtXERI 
EXLEEC/SHLEELVSRjLSL 

CPn.lOD'i 1159085 LlS6L8e 

mdp*Menhionin«* Arairuip«pt idase 
/RLLHRVILMKKfffiPCVCC^RICVOCOCH^ 

t YNACO ITAR I LCE1ATKAJOKCVTTNELOEL30ELH KKYOA I AAPFHYGSPPFPKTtCTS 
(JIEVICIKllPNntfl.KtWniKNlOVIVTtVDCYYCrcCPM'yMrCO/PEIKKKlCOAAIJCX 

ittti i a i i^rc t r I *:e MRA I RAIWOTYf :fw^ 

I JVt*M I FT I EFMI NWlKKEiaVDPKNLWEAftTCDflOKiAOWFJr tA ITCTtrf EIUTLLMO 

'."lii.i'il'j ti^'H./', 1 1 vim.*/ 

L "t-'™*''™*™ 1 ' 1 *' 1 ■''►VALLKNK::RKr'//»V ILRFi'l J-'AJ^^t. ILFVTTOR 
WU, I ;:i.VAK' I r»UWU .KTV:: I KWl/l ♦*PEFAFCnrr.':KTFP I FFPI^FPVIITiPA 
7 CTAI4.';VMKHTi 1 V::UR 1 1 FT AH 1 1 AWAFHI ^TU-C^FFOR l^WK.XLALEBLFR IAL 
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'TM'»i tyrmcHQEicAl prv.^in 

OmLKNYW WrSFFLPCTCIUiJCDSLTOIUU... -LAW* JVKQWW/UMSFTAr I 
AM r ALYCUA1* JCLKVLNTPVCA I EVWC tAVTLACVRAVLMi*TCCESWtPTf KFtitSPSYS 

pci-pialflmf^psc 

TPn.lQi: 1142220 1160421 

ysirO-AOC transporter permease 

A I F.1 L fT.TFMKF K F" f FYFV r/F.TLLFUrfDfTSRK^ PTf ^ rf^P PP.1.1 1 A^TTLQS LPLLL 

. .wri ; : : :hv:*\:t:- ; v;-\T:r<L.".'K,v.>*. ^ ; u ; ;-.»:.t; m:'-\la; ;,:v:a' 
:Av:-"'r"i-:r«'r-r;r:.rrvv"^-rT!«r.::';r-'*- * -t'fv.: :>"jthialp 

H I F*XKX A XG3ACFAA I ACOWASOSCLCXLKLESRWnf KeLAFAGLATLS ILTLSLF 

OXTLLXDa.IFSl^Vl«MSIJCHXSVA«CAXJV^ 

TLUJimflPKHIPLYAgVAICSYFKOHmJTfqrOirVrOSSSAVPHVIT^ 

iKiroiK»PXorrauDsstocrtYHSODprrKrmicicvic 

LNWQWPSCVWWSSOUSPIOJJ«fflrtVOArrNtB 

TO POLXVTTKXGTKASCFEXVEAFQKALQCS Z IFSKDHPIISAFICLY'AJCETICS X PKHLYQE 
YLQWETrFPLLAOSODR^mvnCLLErXIKRmLM 

CPn_l013 1162209 1163624 

CunC-Fumarace Hydratase 
R£NSXMIRCKXDKR0eXDSI£tVE^EX)n.YCA(7m^^ 
KKCAAOANQDUirLDSKKCtMXVAAAfiaLEQCFeEHFPU^^ 

OArVEETfWI^IGimaKDAVFHrL^ 

VCTCLMVPeCFVXX: XHYLRKETQEFFX PASWFSAlSCKDALVDAIICSXAtlACALTXX 
ATQLSFLGSGPRGGLG Et^TPENETCSSXHPtSKVKPTOCCAXjQHVCACVlOlNQTVX XGG ' 

5RcwrtLNVMKPvi lYNPtosvoLLStxaouF sErrvitGLiwMCAiUOTm«SL»a.vTA 

LAPVUrtDKCSKAAUCAraESISUfSACUU 

CPn.1014 1165456 1163732 

ychH-SuUate Transporter 

AIASTTJCTrCXVICVWAFXNFXPXLyTSXKSnYSFNTPlCXDFOAG 
GW^IQGIiASIICKIIASAJttStfVU 

CVFXJ AXCXLTGX/TTFXXyMPYPWICLTTGLAl X XFSSOXXDFLGLOHOANX PADFIiPXW 
X AYWOHUWIWDSKSTAVGLrnXXM l K P HM RVKit K.VMI AJVTATTLWLUCIOI PTIG 
SRYOTl^AXPLPlCXPOUmtlOXJCPQALTXAVLSSUIU^VV^ 
OLVAOGVANIGTSIJSGIPVICSI^AASIICSEAra 

KIPLTCI^VLILXAW>^CIHHFrHLFTAPKXDr\nAiTVTILTV>nTITAAVO'VOOIL 
AAFLIWKOMSOS^mSTAKnTCSDFLSKAXVPOm 
OrttPPKIPILCKTKVPTIDASAKHAlJEmXCnROCTUIJ^^ 
ELIGVDHI FSNIKSALLT AQALTN m>X T b I KH LV 

CPn_1015 1165550 1166093 

CT057 hypothetical protein (possible XH protein) 

KNNYKNFSFrTSVWVRSKVmEIIljynajg^ 

AMXajfMLVCFSHXPMACKMXLVEtXJUMSQVXFTX^ 

XQSRIX^XWUCLSFFI.SAALCWLTSII IIISXLKIttArKARFnRT.T.TflArCVIAVIttOS 

AWTPlfiDVTTTMUrfINNKrTS«irjUaW 

LOSAPPKSLWIIPICUBU>(VPVWKACI^PF»CU^^ 

HXJCV^LWDISSITFFICTIiAVHAX^ANLLT^ 

VX^MVPLVAATWKmtfLDDTXJOCLIAYAAG^ 

(QtXSWXAIASYFCSSLFSYFVLESLNFFX 

CPn.1016 1167027 1168898 

CT8S8 hypothetical protein 

KRgVFMQCKLCArVFCLLFTSSVA Ofr SAU. I IU* UY0DUJVXOtX^SLICyAPtJWCELL 

FCWLSWTOOAJu^VI^BCPITN^ 

LSEJSHVPmVOTSCC0rYLCX*ILEVXX»X»£^ 

SAAfGDAVPSSXA»O^CL»VSCURffrPVPWYTPEKXGDF 

VIJKSGVNS05SSSSX^SYMVFmfE£IJt^^ 

DKGPYRSYinrJUPSOGNPHMGftM 

TDALX IXXTTHNPCCSVFYLYSX^^TWPIJTPIW^ 

oEOAVAvi£ETHntfCHi»«AVASLGNrsQ^ 

KOYTKPL/KLX OEOOF 5CCOLAPAILKENCRATL IG KPTAGAOC7VFCVTFPNRS0 IXGL 
SLTGSLAVRKDGCFI CNLCVAPH IDLGFTSRDLOTS RFTDYVCAVKT IVLTSLSENAKXS 
EEQTSPOETPEVIRVSTfPTTTSAS 

CPn_1017 1168997 1169935 

lytB-Meta 1 loprot ease 

VIIMRKLILCKPRCrcSGWRAIGVVWALDCWGAPIYV^ 

VE&VDVFEtXBlVIYSAHCXPPSVIWEAjCARKU^ 

ILIGHXIW\n^GIVGEVPEHXTVVEICVADVEAL^^ 

ALLKRYPSI rrLPSSSlCtATTH*QI&U&A£MtrrrrMGB^ 

VPA0LXNNPEDIDTNIVNHSGOIAKTAGASTPEI7yVOACIRXLSSl.I PGLOVCNOXFAVC 

DWFOLPKELRCS 

CPn_lOL8 1169895 117062? 

No robust homo log present in Geneoank/EKBL as of 11/7/98 

RKSYFTOKNSVVUlSiXXLAKFFSRIXYOT 

WSLIUCPYAXLGIUJCASXJJCEONVYCCT 

CFVreHWRCC^IGIDWPHFIRFSSNKHRKWSCFALFDTEr^ 

JFSRRU^lIUCL5E^PCSWISrSrPt^F^K£CrrrOCSCSWP«ICTIAYKW 

CRKC3 

<:Pn_l01? 1172146 117063S 

t TT460 hyporhfiticjl protein 
lHRRMMTV!roSr:rrpPPeCEFOIFVnX^TEEAWM 

< -FC ILTOr.tCALWLPPKCKPLOECGTLVKNC ILMP3T3LPNLKPO0i'0GT3LASHRNP 

: xiaofi ivn:;Y:^;sT i k rh.^\k EKV^bTKr^^rrcK ktotktjdatlj phs lyctlhk evpo 

AUSraVTA'KraiRtX^DWEOEOEC^ 

rroprvF^Au;a:oi~"/iA^n\mii^^ 

AllQ^A:;Y:;RyAKYAkWI/ U ATATLC I l*mm&VJGBKlU IFVCR [ iKPFXDATAK 
TFFW i V IW'tt.'UVl.TKAA; IKVUKLSttAWAVAr/RKIC/FPHKUnEVTW f EEVKDNW 

k:»«#ifi j m r u/r»ibAAit:u av 

. * w:rm:a wi.- va:*vu **.»ui*y i i-papr: w.^- vrr r::Yr; itvai vkjwkslpxfftok 
pwv raw-i- rtiuitrrp r i atpae* r i jiki s^ff, ?j\ j wr jyAui^:rwNLF.TOKNjrrEA 
::KAi*jw;rMr*::ii*Tr:KKAi.OKN^^^ 



U74270 U73*«« 



OTGWtNRIAKAAOAVPVXPPP- '^^^^^rj^S^SSS^ 
LXKArtKCLEXBKFWWLEItHLP X KKEEEJUUCX TTX XCWAnrtTPVTXXXCttaTVIlLGF 
3PI XCCAXKC XOPAICSTMAALRCAX LFSOAEIYKLKGKLTK X0L0 XOJCSF0R00KTOI 

cpn.inri 

. "^!Miijf ( ': : :r.!.v..:.*KK:*TF'''-!':x^YTXTsr/r-sTa:\Lr 

TYOKI FX X^ttLCK*»"i*KKi*"f H AV LDOYAJCJ X WFRWLVTKKPFV5KFWFSMwA»L)Obf 
EQYSOALHA YC/T AVLJIDJCDPY PHYY AY ICYTLTNCTttA£XWJXAWVlAOHItPLTWL 
KttlLDIRXKK 

CPnJ022 117S709 1174216 

CTB63 hypothetical protein 

FSFfTYAIJCLO I>t*PVPSAVP SANXTUttnSSTVSTASC X LXTATGXVIVSCTALXGSS 

mEXCTOSRSXXWQO^SSKOSAUPfUXJCPnSDSKCOQ^ 
VttQmSLSQASSSSQMSimtSAPMXriJUIQQU&SK 

C^PACZTSKKETTriaCXI^PHSVFSItnPSmLSVCSSXHGP 
KIJWULGOA£A£ANELYMKVKOirn3UVinXT^ 
K£ICVTIDK£KYTWrtZEXIUlJClNVOWUCl>^ 
VUOiJCCXMDTrXYHURP 

CPH.1023 1176008 1176331 

No robust hoMloQ present in Genebenk/EMBL as of 11/7/98 
GtDMXFXIWCWm^IimTTCASCLSA VTWAVPLSgA^ 
GSV FY S fYt WU Y U Ky V Ptn t U H mmJ OtSSZ C rTKFEDCTXrYBJO 

CPn.1024 1177317 1176334 

xarO-Xntearase/recoBbinase 

IPTFPWFSU^IAPU»ILia^l>S>mtI»STOrHTTIL« 

Y1tXX55FLTXSAX5SPOOXSCtl5VYXFAEn,YIUUCtAZTTXAMUa^^ 

LLPYPPIXtHPKIWO^XJ WTP QEVXiAXJAVPlO^^ 

FL^SSSs^SRlTWAJCOVTSirPVSyHStJWA^ I .nWX^DI JVIQDCLGKA 
RIASrr\nrrHVAADSUI3CrLAHHPRNL 

CPn.W25 1177266 1178879 

P8i-Glucose-6-P isaaexese 

GAXOFssYwxnoaucmcaOTTULOtxjajn^jxTAPG^ 

SPATTOJIAXlJUU.ISUni<GIinrSMTJu«XGt^^ 

KSFTCgAPIAVItSRVEAORLiairLTlCWC^ 

TOWWISNIDPDtCAEVUTTIDCAJW^^ 

WFlAVTCmS PMOtfrCI C n JVFIILWESI^ 

ASA»ffOIAMPHATOnJ«JAl.ISIWn W rLCYPTtAV^ 

M»SXAODCRRVGrmPVXICEPQTIOOHSFFOCXJiam 

FC<nTSSCWAI€HIAOAIAlAXSDffNPNICNrtX>0^ 

pngVTQGIX>CIKSrtgg^LG»JaANHVr,n T.rnAfWSKTPtAASLLTLnfXKTt 

CPn.1026 1178961 1179137 

ltuA 

CSFGrt^CniRMrriAVRSRCrLDIHCILAAJlK^l^^ 

CPIU1027 1179172 1180755 

No robust honoloa present in Genebank/CMBL as of 11/7/99 

tMFOCTSSPPUPVZVllERVFSSSG5QU0PKAVljeX^^ 

AI4>SLVLTVSC2:iAXAVCLIGLGILVTTlLILSTXRKVllAKJYT^ 



liASL^RDrRAYTCOKTPU^LEPWCmiAC 

CFQSXA/WtFAPRSR F F Qt PK YTYl<SWIB<SIXSKVAAVCAiUJClCErF8AVtXlAClYCX^O 

ICDUVAUCTTLPtJ^VYT7n.VOEm^TAXSUOCIWCFYSYP^ 

VQLFXCXXXJCXJTICSPEXXTALVRXJ'SYYIQmXPAV^ 

LLUSQXEVUTitYtJCXTrFVI^DmSFMrSYNQsriC^ 

PP5PUEESZSEE^raCESVSVX^fU>DXJ7VDSIMVim 

CPn.1028 I 180995 1181999 

■KthC-Matate oehyrovenase 
FTLWSVRMAfTCEVVRVAVTG CTO XAYWFLFALA^ 

0GAALKTAAKROAXX FVVWPVKTTOfXAMKHAPRIJfRKNrHAMLRLXJONIMiSHUHXA 
eVPlXCVSRWXtOmSAKGVPOfTOARXSCXPAACVXGDTtOl^raXLV^^ 
EAM3CSSAASASRAIAEAAB^XFCf1CSOX»ff5SCVCS0HNPYGXPC^ 
DYEXXPCLPWEPF IRNX XOISUDEXAOeKASVSSL 

CPn.1029 1181987 U82844 

No robust honolog present in Genebank/CMBL at of 11-7/99 
RVFVISTKLMCVSNttSFOCl^ONAriaiiraon^ 

PeiSLSTLCLCAFFCAFSVICASAX XVOFLUfKESOCXTSKLCCAIKNTWSSLWLSUVS 
MpmAMVAV\nVAML^rU^PWWKLFHTVUFXW 
PVlJfNOESXDYRKXxeCFIOtlLROFICWXALVPLAX^SWUu^ 
TWSFLAQMFVLXVP I AX.XLTPAVSFFFNPS FSFYLAKOEZCXALVK 

CPrulOW U83901 1182843 

predicted D-<m>xno acid dehyrogenaae 

FKVHFMR IAVU2ACYACLSVTVKLLLHSOGTAT X OLFOP I PLCECASGMSJCLLHAFTCK 
KALXPPLADCC tNATHAL [TCASXALNVP I V I .W I LRPA I DEDCAQLFTCXVCCFFKEV 
nMEXAItCF. I > I T^-MV I PPKLCALF t K.7^rrUB CLY JOOLADAOIKUTrw^FYOCIEDL 
ADt VErrat I r VrrCANA.'; t LPEUtDNPVNKVr' CLLG t rXPKOLAMWFU inahkyhva 
WTUWT. 1 1 < wVTFQINUPEETPDPA I AYOE mPP'/UlLFP^LKrMOVUIi^'AvMRSStfXS 
MI-TV [«R IHFKf *m a ULCUKnLLYDn ITTSnMLAOAVttT^rjTAY t AKKFLPT [ 

•rn.l'Jtl t|H'.Mi7 iiH4n<i(i 

.irt:D-At'|HMi«T/itt nit hiiui Aia. iM>tr 'ff 

i Ki?rFttT:»TK:»KNi<n > i ai^i iwnn i fjyjr rzu<ji ikaata. ^v^vilmi wi.Tr; 
mkk (AfiTFR i urr i Rrrjocwi i ymy^pcti :r/ I';ft ir jwr;vwx4 x 1 1 v* aAt^AVf-ntOA 
i jjykfm-yfo > :nti^a tukk.'ii. iwvn ik i vi j«; i koa:; i r hv h m kk m n.n f ii l 

TA>TTKI>VKKTDFW UlAVTKAUPtXT ;:r/:^»>irrmT.'/TIXAK fill W lAWWMAIQU' 
(- !Vi X/AT/U IFt X* :t.T I V I LFu*M.I«*l':i^jtP^I IltlPCTAf JVIJ)ll.V.*4CWJ«VWt<V 
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• a. 1 1 Avu;.*.'w^vrt r/AE t pr&wMnrmtt .EKsrevsLYrTsavHOUMji 
vyr^,\i*mcjrr3vMvrj»AyL*aAAFLFitL3KwV nrnncssricAPLAMrTcrLCWT 

:*LML I YALJCUT^LFKALVLtALCI PFY IOACKKKXNAKTFFAKKE tVCKTT IGLI^UAI 
FtFLTOtKI 

IPn.1032 1116153 1185566 

CTJ7) hvpntncticoi protam 

i ^A-rrrpyrr ^nrmn fiKrujonMpPOPriprmf dsalloak t WFTiTvPYTSVLPKEL 

•• «.;vi v - •«■>.>•;; •yvvi^.v'iv ;aa;-*.i».t:;a; at :i» w :>'«>*; r icli vwaa 
::: v~tAJ ,, :ti:r. i '.*:..iiK!X.i^." ^"!/!-:r;rn';i7*:;:". Fr-:r**L7 

ALCFLNPEKAEPAJCVN 

CPn_1033 1187656 1196187 

CT372 hypothetical protain 

NNKXKDfSGtFLTRTTVDS lATLPSEENTCyimLKrRVWlllHWmCEnilSFWtL 
t£GLCAX£ISSYAOTKmonrH*Y1CAM 

csOTwtxoisvsNLLTSvnowwro^ 
sTTvrLYPSPTAOQsrviffsrruHYicrNtwwo^ 

QTTP<3)riJIArogYSl.YAIICTLYPWD0YS0ri SYAISOWAS^ 
SEIKVOLCTODSWrOCTWrS IVMLT KS ICYWryEnfASW yKPS O GI jCOySVLltYSTKKVP 
eONSOVTCWSLN AAQH IHEKLYLFGR INCATCTALP rNRSYVLCLVSgJPLMR HSQDLIg 
rGrATWCVNAXAISWWOJ^ESVKWAn 
VYCUUNLSL 

CPn_1034 1188589 1187732 

Pradictad CMP (CT371) lludir (18) peptida) 

KTSHQICYKKYLSYS ILVQ K I ARYVKKTVLFTTFLTSCSSFYASCRYAEVRS IKEVAGDI L 
YDEO«mXLOU3mrUXXXAI£HSIWKSX^ 

gn/qpiESAirLLigciQmjuM'i r w rpprrAioaTiJtQLiwLNvsLEPr apopoapl 

PKNU.YTSSIX^SCamnCCPCUJILiaXXCTPLPAICt XYXCHOXENVUUCDLCOlCrCIAy 
FC rTYKAQCLHPP rjfnKIAQVQyNySKXLLSWTAAA r In'.F HQMHE 

CPIU103S 1190081 1188570 

aroE-Shikimaee S-Dehyroganaaa 
VVgfJtJWPIVHLOIWRFSHIYYCVSVHLCATVTO 
LIWTasrMElJrXXmACWttTPlWHXIltS^^ 
. LQTIJuTSHPItlKLII^YHroia'mirAlY^^ 
U.FKProi£HGTHftFSftVLSPLXSNAMfr^ 
KS«IY<*WG®PVDRSISu^HNFIASWLr4AW 
VTMPLTTAI FDKVDALDASAOt£ESXHTl*VPTOQXIL(mirtSn^ 
HIAIVGAaGJUUCAIAATIJ^MQWNI^ 
IXIOJPEVTFPWuWIVMDimKmSPYtniA^^ 
FPDTLTPESCOSraNYVlCNrMAICV 

CPn.1036 ' 1191180 1189984 

aroB'Dahyrooulaata Synthaaa 

gytocscrsci i ptMum«sCT iiTrpKvvia^sNrroKKLrssisTAYPLvi rrws 

VQOKIXCPIUWIKKLCYOVTVtTrPPGEPOTCTWrrriSLQ^ 
. CTVXiHIGFI^TYCRGLPLYLXPTTITAiWOT 
MCPOTLSTLPREEWVHCIAEAI IQ^XACUYlMEri>ISH5XKZJ^SSOXIi<EFIKRNCOI 
KAATVAaJTYDRSUWILNFCHSIAHAIETIJU^ 

TPOUO0LEiaiJCItFm^STlJCDL0SXVPtHti04SLYSPDrX XYTLCYOXIOaSOXCLJIM 
XMXEHX^SRAAPFtCTYCASPIMEXLYDXlMSIClMIIWC 

CPn.1037 1192386 1191123 

aroC-Chorismaca Synthase 
LHFSRCSIUlSrtXEIXItTSVSllSHYLVlCVMWSF^ 

gLEXJESDrVPAMCWUtPGWPCTSSiUCPffilVOILSCVY mUK, ' nVl 'PLSLOILWTDVDSS 
PYZNSEXLYRPGHSOYTraO^XVDPIQCSItSSAlteTACRVAA 
AYI^SMSLTLPtnflJCISPELIHKIHTSPrYSPLPiraiOEIL 
TSPXKDFTU«EFLFGXVKAI*LASAIJGXPAAJQGyEXuWF^ 

TLKSWCGCTtjCCXTXCVPXESRIAF KPTSSXKRPCATV lAllUai U RrPQTCRHDPCV 
AXRAVPWCAHXNLVLADLVLYORCSXL 

CPn_1038 1192750 1192199 

aroL-Shikimate Kinase tl 

WXLELRNVMT 1 1 LCCLPTSGKS SLQCALAXF LNLPFYDLDOL IVSNYSS ALYSSSAEIYX 

AYCDOKFSECEAJIXLETLPPEDALXSlJQOCTUrYEASYlU 

LOCim*E«JC£AMKTKPL£nLT£JUDM0^^ 

LLKS 

CPn.1039 1194011 1192665 

droA-PhOBphoshikinata V i nyl trans £• rasa 

VCFTMliTYKVSPSSVYGHAF t PSSKSHTLRAXLUASVAEGKS X X YNYLDSPOTEAMXCAC 
KOMCASIKKFVQILEXVroPlAXFPKYTLXDACNSG tVUtf^ 
LQRRt^PUdALRNFCA5FHFSS0K5VLPFTMSGFUlSAY 

LAEGPC3FT I XEPKERPWF0ULUWt£KLHLPYSC5XnTYSFPGSSHP<^SYH>nrG0FS 

£JMFIAAAALLJK5Zj0PIRUWLCXl^X0CmtFFSLH0iajCASXOYtM 

OCSICMDGC t DALP ILTVL0CFAD5PSKLYMARSAK0K tSOR XLAXTCCLQKHQACIOPT 

HOGUAWPsTLYOAVLDSHDCKR IAMALT lAALYASCDSRIHNTACVRJCTrPNTVOTLNI 

MEARXEGCHDNYSrMSTHKRXVFARCSFG 

':Ftl_1040 1194076 11*407J 

No robu-ie homo log present in Cenebank/EMBL a« of 11/7/98 

RP'^SLFtJtTVSPSSSFREinVCAAPU.YPRRR8POYLFSPTGCFMSTTnnCHFXKTA5R 

WF.PVLKE rVAJNYWHAQWI rnLSFt£NSCVUCKISASDIPTEVKEEVlXKAAEEFRHCHYL 

VTQtSH ^ET^LPDYTSKNLtaUXTmiXLLOLRTCRVl^ 

T/A t ELRAiI ELY PLYHDI LK EAQSK XTVK5 X Z LEEOGHLOEMEftEUCDLPHGEELLGYAC 

yFECELOt A>FVERl.EOM I F0P3JJTFTKF 

'.'lli. 1 04 I I l-ifi jo * 1 1 -147 2* 

•lunA Atbfkmytah'r UiufiirhfN.AMuio. / -Ox'jnon.inoatft 

AMLItiAt.UI;tt<fr 

1 +t*R IK [tUt I KITi!l« VKFLVMFLDV:;eP5 r S R KYOd tLriXZLmLTT. FOY LH YOGK 
WV;»rtp|.AJJNN: '.kNO^r/MP UJlX 1 " !KPNI IK0NK K LLCPTHD j^HDKO^SCf IWHPFT 

*;:ald;;ti • t k t vt* ;» iaylyagjutrylha i JswwcNUtrattiiPY itkklceoaoklehv 

I KAWmiErAU:i.V:;KU\ri^*D3LEnFFFSONn:rry (CIAHK rAVCYYYtJ0*lKAK3HFV 
' a klNAYIf mttWlJlMt::! TTVPHlDUXParJT rAAPYYflKKELA iaoaktvfsesn I 
AAK ( YK1 1 J o W * Ml ITiTJ PD ;LKP. I LKLAKHYt WU.'f ADKII.TJIFX UtlCPLFAHEFTD I 
M1if li U*. a.1»» a-IArTVrntEIIIIWFV!^i»mrAUJKaiTFTrWPLOC^UASL 

I I :pu : i •»■) -i o.'i«.*« i ki« i v&FWM L*n www* kvi/ rr/i altiy paiwttyfotyrdhlw 



RFFLCUCVLiRPLOnXYVL 



lOCEDUIX ratLCDALJlCPC 



fTPn_l042 n "nr*J6«l I CD5?)4L, 

•Dioo-dathiobiottn iymhitwr " 

^sprrYrRANFFwixrvcrOTcrarrrv^iLAi^ 

HEi^CAYCHmYIUiaCPt^PHKA>OXC»l\^rC£SH:CAPI^^ 
KRLOCtVreSWSCSWXLVSOAYLMXNKTCtTVEAMWR^ 

Otxictpxxcr^KKcrrrrixscYAix^nceswrsNHOG 

:■• 

DioF^l*OMOflonaooac* SyticMie.^ 

LCAJCSm.«^lCORIEXDLWYHMTESCLIFNIOT 

YIHASITDGIlttJWWSWWaiDtKXLX^ 

SEI£E»YSAYLIVDDOaVCVT UJUUeJLX V3AL^ 

SIt»«JIIPailTrTITAOPPMALTAXllAYtKN0RAnW 

LQLMIVrTTPXOSXCVSOSHIUUtOAAlAXONSCr^^ 

TlOlCIDKLUCTLEOiriCNVSSL 

CPn_1044 1198700 1197699 

•6lot-»lotin S/nthua 

Aiugat£sT^$iAXRmHmrn.xtfKMM 

CAYCAOS5RY?rfTWT p rCPft«IVDWERAICRA 

K^rrDLOAEVCCAU>fl^EEOAJOa.YnAttYAY XTTItSYTCDRXM 
TLDVVMXSXSXSTCCCSXVl'K'XSEDKXIC^^ 

rWXSFMEVLItriATAirVVPPRSIMUAACM^ FYCOCLLTVEN 
I«ri)EnAEMIKLLaJPRPSFCrERQfPCYANNS 

CPTV.1045 1199602 119B901 

•coRMrvtd hypochacical bactarUl nenbrans pro tain 

cmj»s»TS > 4iucrLVF5Yi^sTrrx4xvxA&^ 

SOVVNCXTOPKICARVKXnATrJUaiJlSSXVOIFMrr^ 

ASUATTVS001X)IVl.YTTFWnrTPKSSUaJlSNCSWrS0I POTTIV0TCXLYFQCL5 
rPOrl^XM^IfSYTYlCXTrCVLTITL^YlAV•^^ 

CPn.1046 1200675 1199590 

Tryptophan wyroxyla»a 

VHYCERTLDPXY XUCIALKLRQSL ^LFFONSQStQRAYSTFYSYYRI ILOKPOCDCQALA 
RHKCIS XLEm»lLXXVKLLSL5l0>rO1UXXSTt^ 

YCPirmmzAicu^DFLatoAviKFrnzncFm r 

ASWCirTLDKTONrSLTIWKSUd^^ 
l09ttASNLIAXVIOHPTVESCZJE>0{03UU^r^ 
tOIZSLFFVmPOtTX^X1tKPI)eLVQ.TSXLE 
00 

CPIU1047 1200537 1301043 

dapB-Oltrydrodiplcolinata Raductaa* 

DFSHPLLTOEVVAHLLISPKPLlI Ul lUP PUAOCSAHDSIJELTHXVIVWCPHAILCAY 

tHXJaVKa^IOnCFDIMRTTIOiRYKICDSl^CT 

lTI»5XICrXEVQ^SRUC»IPGWEmiSSCSOILVRirr^ 

pQpcr^sLcamAvijfiosccixiQmtiH 

CPn.1046 1201588 1202604 

aid-Ac parrata Dahydroaan*** 

OCnUECKRXAVLCAnCLVBOlCrVAlXHICm 

PHPlinntX^IItXXEEVOSDrVVSF^PSSAXSKEAYCX^OG^ I 

IPlVKSPHFWJEOPYPqCXITCPMCCVSCITUU^PLM^ 

fCVVSLDlJJUWFHXVGtmXIJIJETVKX 

VtTSRIVSLOnLYSYQtmEPT»nT0LYtXf^ 

RTXRMNVLX HKLVRCAACTUASMINYFFX9YXJQtOCLR 

CPn.1049 1202586 1203914 

lytC-AapartokinaM XXX 

ECNVSKIVYXFCCTCIATAENlCLVCDIiaCriKPSFVVVSAIA^ 
EEVLWCIECrOiEEIVKl^IPFFVrrWreRIiPYIflHLEI 

RAVCST^QWDI^FIXAR^ILTODrnUlASPNU)^^ XGCPXGSNG 

LCETWXC3^SDYSATllAXIJkWTEVT«YTI7VT<:iYTMDP 

Nl^FCAKVt,YPPMl*FPCWlACIPXFVTSTTOPE)COCT 

Y0SFCSVprrVLCCCCl^LCXL£SHCXDPELraAO*«l\A^ 

VLSX^SVTIOKHSVAtlTMICWI^PKV^ITgKLRCre 

ELAECI IEXUWtTYVKOKAIVAT 

CPIU1050 1203884 1204798 

dapA-Dihydrodipicol inata Syntha»« 

LCIOTSYSRHVCRIKHU.TATVTPFFltCTIDFA5LERli^FODAVCNOV^^ 
SLTXKEK0ALXCFAO)L0LKWLrvCT?CT 

K^COOXLWFEAVLNAAKHPAtLYNIPSPJUkTPLYLCOVIUUAHHTO 

OSYKS I APH lOLYOCDDVFWSEKAACCAKCL T SVI^AWPEEAREYVLWPOEOOYRSLWM 

ITCRWVYTTTNP XC XXA XLAYKKAXTHAOUtU PU3 X EOFDLENVSPAVtSMLAWFKLItTS 

VFSYS 

CPn_l05l 1204956 1205270 

no robust homoloa prasant in uanabank/EMBL at of 11/7/08 
FFMTPKS tOOLH L I KT tDPVRK I S PVTTKKSSFFROSLLRFLELFWHFLYC IRS I R FHCV 
HIATFICRCLILFLTTLFLSHICILHFITI.PWICKCOPRI [RKNK 

CPn.1052 1205402 1206 16«» 

^to robust rxMBold) ptea«nt in 'imtMUink/EKDL -»a hc 11/7/9R 

FF IOKHKYNSREKXKl'ALR II* J.T/C ITVFRKNf*!XSCYOK X PY^LSCYVFMIPKS ICRCR 

.''FCFFH' iKKTEVETKEVK t KD£ I RP*.TLF/^NDfVKVAEnFPFR KAAL&*U<*5Q:*r! IflNLCA 

i.WLfi:y^t.:r(NF:*4vTiv«WfFm^K;rrf*rv^B;EPFnYT^ 

iTJTPCLU/XllXWilVi :i IRTRATr^: r U'."VI If i #MVPR PUICTFY IW IKAP PIJ*FFFRLTSD 
VRROItKRKFRL.Kt'irKD 

'l-t._l«'*» I.Mi.|»M I .:*).. 70 1 

K tf .i'UililKAK 1 1 * MIILYLTt '[* ZA I WAi : I ■ I L; m * "I tTNY.'IKKA: ; | (BVI ,\T^ lit Klfl : 1 3REP 

. acuvm W0T%*Y: : t v.* u .i ■»« u MHftnx :t< x a« n »kn i n/Apr 1 1 iavi :^ « mi bake I PC 
::r;KBYAKKni.TAna»*t:;EKt^ffffrvv';w^ 

EU'.VK^AF 
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-:pn_iovi 1307010 1 30746* 

*m Inuuxic hoAOloQ present m Qenebenk/CMBL as of 11 '7/98 

sRMiKRrtKQvu^POi^ppPOHSVCSisspsKLfnaArrrLVFtaau 

•VJICTU^LPKELOOIJTIttOEVFACLlUC^ 

C FEXJ*Q I VAO ECWDLNFLI NGGRSLKHTAESES LDLfHVSKRLCYLPSCOVFCICUCJtSA 
T rMPfltiritCetHKVAVAPtWN.Tr AMAEKAr « ALfifcLCeSVYRSLTCSYHOItrLCSC 

. ;it ITA?^tUV« - i "ALYKLi* LiAr>.UWK *" f'AVWKEO / :FL Itt&b X**7T 

YVKOKTOEFQKAGEIUJJUJtAlATCVSW 

omMfmvarii M Wn »i i( Tt ew e* >e^g ? rrMir.tJtM»g rTTilFT.BrB i TTJrT/irgA 

TAon2«ir.snA rirc i. rmEWMMHj tzg 

CAQOXJCVKPTCKESlAyVTSCTU-VSUSmiJUWYTM 
RmOfyETOCtBLVWILLCVSSJUJt^^ 
OSIMfTltSESIJUX^ACXniLMCUeDllCVUC^^ 
QORVAAFESTEVPEI FEAFUKFSUXUUUtSLrrUDKT 

CPn_1055 1205363 1210521 

No robust hoaolog present In Oenebanlt/DIBL as of 11/7/98 
CKYLYHHSYPPPPCHSMSAFTCLSJCFJWIAZTFLV^^ 

FCTflTGLSAl/SWLWyff.rn MKRgVFTVRPEEZ PECVSVAPSEEPALOATOKTtAJL 
PKttaQLDRYIQEVVSCUaCIJCPIJCITOGI J.jmAKZXLQVfIgVWIUJWl £f VELQQIH 

RRiOTRjcvAjrrrTONAYGVAJCTATEi^^ 

EKILRIGfLELRR 

CPn.1056 1210482 1211228 

No robust htanoLog present in Genebenk/EMBL as of 11/7/98 
GEDIKML£KVEEXDOfUlVia*XXFX^^ 

TSExg^ostJOTLguiYXgY^KOT^ 

IFVSXWLPluWSCLVSTVHVPFltErirY^^ 

RETUXAXAPREErWLLCX3nCSKEICRLX^^ 

YSFTXRUC5 

CPH.1057 1211467 1213596 

CT356 hypothetical protein 
IIHFYFTNFAKPEPLYTMCLITBCSPYUAYAJ^^ 
GOmSMCgVKWESrTNFKAAMIil^^ 

ETvswpxjn^TroOTrr^vNYLatfECKx^sTwz^ 

KVI^IASnaCgVreCXatPESSXJCRTVAAfcYOD^ 
LEYOES^rTVDRSISNVALQGVIuWXaQ^^ 

f cmyjiTif [ fyy p^y^ffgyi ^yjT 9fT Y5PfV^ftFY9?r0ftin^V??0t f, ^ w * <c " rrTg 
N«i3t)A£irCOrraiSRICFTO™ 

LKSISAQRSHItSICXnLSLTFNNCWflYTF AYAGRUJGEVEYZ EXGXKCGZFVRHSLYIOfH 
ELYWOmcaAKyRASIZmfGALILI^^ 

rY5VXXntDSTU.I1C0SPI^00£TXSGNALZCQCXJ^LKLZTOa«rW 

AjmOCPSSI£U^ASQNYr3PJCHVKVI^^ 

ODlUTVLPeySO.ZPKCOCTArriYVLEVDOCXItnCOLE^ 

CPn_10S8 1213742 1214836 

CT35S hypothetical protein 
EVMttYCrWZVLVSTGCZFfcOHHOCT^ 
nCVOlXllWTAXJnttHKlNUJTfNOT 

AXAXKX ATDPTAVNQEIED4PCRDLS PLYAHFQISPNDI FNVI DKTLTAORVHuWVRSK 

VMLKVTFG KZRC YY»CLgE»SIUOaWC^^ 

KXtuVTALVZSOOOQLVCSEmRXNSELSGS^ 

UfKrSQSTEI^XMfESXIKOHl^ALEAESVEKOyXDR^^ 

CJSLL 

CPn_105> 1214848 1215678 

kgsA-Dunethyladenoslne Transferase 

VTRSSFAOt£RJT£EZOWCPjaCSWNFXVDW 

T^EtlAAOAOVIAIEJWIWAWlJEELPIRIXZXmCICYn^ 

YKrrrPtXTn/UAPOPMCTVTVMVQOCVnMR^^ 

KVSASCTyPKPf^SAVXHHKVKETXPUDEEXPVFFTL^^ 

KaTVEQALKEUBXI^PlVI^^ 

CPn_1060 1217694 1215727 

dxs/tfct -Transkecolase 

YTCW^IHITKVXraSCPIiELIl^PADUCK^ 

SSNXfiIVELTIAlJ(YVrSSPXDICr:rOWHOTYPHKLLTCRNNB^ 

PTESDHDCJTSGKACTALSIALCHAOTTPl^inWIPILGOAAFSCCLTLCAUWXSn 

LSKFWI LNDNNMS X SKNVGAHSR X FSRWUOf PATNKLTKQVEKWXJUCX PRYCDSLAXHS 

RJU-S0CVKNLFCPTPLFIOFGI>YVGPIDGHNVM^ 

OUfcAONNPAItYHGVRANFNKRESAKHLPAIKPKPSFPDirXOTX 

AKSIGSRI^FK0KFPERFFDVGIA£CHAVTr5ACIAXACNPVICStYSTPXJflMI*tK/F 

HOWaCDLPVIFAXORACLAYCDCftSHHCI YDMSFLRAMPOM X ICOPRSOWTOOLLYSS 

LKWS5P3AIRYPNI PAPHCDPLTCOPNFLRSPCMAeTLSCXSEDVH IALCTLCPTALSIK 

HCLLAYC r SATV\,T5P I F I K PFDNDLFSLLLMSHSKV IT I EEHSIRGGLASEFNNFVATFN 

FKVD I LNFA I PDTFLSHCSKEALTKS IGLOESSHTNR I LTHFNFRSKKQTVCDVRV 

CPn_10tl 1217932 1217666 

CT330 hypothetical protein 
F(XtJfV£lKHKDPSLJeKLFAX^5LETU<St£DIVATY^ 
U^>VKSKGELUCSPSCDPXVOTTPIHPKH 

*JPn_lQM 1219835 12 18150 

KneA'CxofJoxyritxmuc lease VII 

Kt JFPVM^prOAVASLTER XKTLLESNFCO X I VKfJ EL1INVCLO P.TIH LY FC I KDSQArLN 
- y\FFWFF_*;KYY0feKPK0nDAV I IHt'KLAVYAPRGQYQ IVAI lALVYAfiECDUjQXFEETKR 
Ul.TAECT FATCFJtKPLPFAPOCIOV ITCPTHAV tOO t LRVL2RRARNYK I IVYPVTVQCN 
^AAIIEI.IKA [CWftlAENLAOVLI I AROGC i*| EDLWAFNCE ILVKA I HA:T t P rVSAVWE 
TnYTU:i#FA:;WRAPTP:>AAEtVlTKS5EE^ 

imKliWAEFYTTAWCLDfl t EIA IQKCVCCK til ESKWYON UWWLOOJLVSPKtCRLOS 
I XKMLJOA^'-HKAI^WVRi:! IOLKKSLTYPRO IvXJA^OKUIIWROOLDTLlCPRLMYOKE 

wkYEN:;vAtfY:;AU(£oui:;LNPKN^^ 

I < «A I LT/TN I K C • K L I Kt : 



PCT/US99/26923 



tpiJ-Trioseoftoepnat* aras* 

PCRESMAlKFWDflCDIKKrPCv . U»MWWIcrtCEAK^Tr^U^EFJ^CT£3lA 

JPrrSWIHIVitff WAPW LQAQNWm^TO 

HtFCCSWtASXVKSVJkOAOArPVUrvOISlXVS^^ 

n* X AYEPVWAZC7TCKVAEA2I7VQO XKNFCftCWAEKFSCATACEX J UYGS5VJCVDKAOR 
FC0C3X7VtX^VCGASLIX»SFTfVAI(MFMV 

CPn.1014 122371* 1220H5 

,1.-,^,^ v ri-. •.-* •,".ii'-*\R'** -t: "•r.vrr?;;: T" :r:\* v y~ v :pKr* r >"*.T \ 
CPH.1063 1221140 U20928 

No robust homolog present in Ceneoanft/EMBL ** of 11/7/98 

jUOU£BlMlUrrSOPCrLrYFSXFE££LFFCttCIU^^ 

YBCAIFWCLP 

CPn.1066 1221132 1:21488 

No robust Aonoloo present in Ceneoank/EKBL as of 11/7/98 
SKSuwioma^ATtJirLFijcviircLiLvo 

DXKjaCVTSNCAVAfCZOOXLSFS'tNX^SiaCIJUJCm^ 

CPR.1067 1221675 1222292 

def •Polypeptide Deforsylase 

ZOWLVVRDmElgOAIWOWXWUJEYTCSPILI^ 

HRcvciJuu>ovG»ivsLrvKrvtmrmx:E^ 

SXFGLRCTVFIlPOXrrvrAMDUCKZrrEKUSCT^ 

mnukSLXKiwrntmasKOLys 

CPn.1068 1223267 1222365 

rahB-Ritonuclease HI I 
KSQtPPPFVVTLmAOe»(LRX)OUCEia^rSQPO^ 
KaEPIETFIJPCIUnTTHAJWIQDWPXC^mE S a ^ 

AYIMXVODSlOIlJaTriCZASlARZ XRSlCVCDVIZLTTfXYNELYCXrQMXimiJIWAHA 

TlTDWIAPltPACDVTAISIXJrAASErrUJOkl^^ 

RWVQSZQlOXECnrQVQtP1GSA<»NVnCAACIVZAX0P^^ 

K 

CPIV.1069 1223507 1223941 

yfoA-HTH Transcripcioael Regulator 
VZHgHZHmJJDXKntSSRESQSLS XJffi^ 

lOMAauzzasxMVHULSStrszr 

CPn.1070 1225523 1224144 

no robust honoloo present in Genabank/oou. as of 11/7/98 

RR^LKTrTCGMOCrifWPPPIffOOlPZPLOBOOQSCSOOSIW 

<^I»JVbGHVTOACSUJ^^ 

tlVUUPlJNPSMrtXXlUQOYGPZCVGMSfQQLPHCTOIUirjCEPU^^ 

HRJUJCAAOPROCESLV TnT O NNrJi.UMU^P PWSUU^AVSISA^ 

WltQPEQQPCPPPPmfTOT WVOWAJOQKIWAOCCIWTaO^inrn TQIT JW1V 

LSLESOYlCDPIi»AAI©Ivm.IKKIJCW.VA£Dt^ 

S3BKLWP WB0TW P R^P ^RZXJ XVLVrT r« 

WI^EIDUXJIVTOCBFCIPCOCLRCWWaJTEKRPHRWL 

CPtx.1071 1227336 122S88S 

No robust hanoiog present in Ceneoank/EKBL as of 11/7/98 
l^l lM V LP MNSWnMaWrMCEWEV U mi im SASDISEEACSSOCAAP irmPlIll 
TTCVEICRVOF>rrACXa3E5TIKMIOEACG.VDSII^ 

CRIJCCTYXACCXJ»E»«^ACLVHEIX5THC PI AVAIJUUCIMJLNUIttVTOriLgEX 
0nSPXQKCSEAICrOLYOTfQSl«S(7NFFCJS^^ 

IESIXAOTSSASNSTRIPACYILNTSPLTTSI^CBSROARJIPSSVCAXPOYV^^ 

ICMMIOWZCVTNLICrarSAX£PFGII»rVI^^ 

PNFKDrWUJ<LAICrrCPAKTOCTSVVDIKKro 

P5LVR0ESWPTPAE5VPIitSSUEC)O0^0EZXNlAr^ 

ZNXE 

CPn.1072 1227924 1228835 

No robust honolog present in Genebenk/EKBL as of 11/7/98 
KiaKILHAJMCCWXQHiJCZOnauCTS^ XOXX.YSPT 
KVSSYLWOIlASTIFCVI^RCIXOMRYUISPCWOEnT^ 

LAIOXrRNKECXXSCI PEOKEAXLANSSALVLC ISSFCITCI PATLMSLLRONtSPQKRS 
XASCSFXl^XDSAPSOA^FYKCVLFRCrrAIVXlALSOl^AOLXJLSP^ 
OAVCSACXCV«3<NFl^NnrYPAOESLFSYVHpYSTATEIOEAOCWXSDEVACLTI^^ 
PKMN 

CPn_1073 1229011 1229832 

Predicted OMP rcT37n 

MRJtYLFMVlAtXrLYRAAPLEAWIKXTX>AOAVlJCFAREKTI.VCFN X EDTWrPKOMVOOS 

AWLYNREIXUCTTUEEQAREOAFUI^XSFLWELV^ANLPjr/LTC 

SORPVHL I KNTLR X UlSFNIDFTSCPA ICEOCWLSHPTKOTT FOOAMAI EKN I LFVQSLX 

NCOPMDAALEVLLSC I5SPPS0I IYVDOOAERLRS XCAFCKKANIYF ICMLYTPAXORVE 

CYNPXLTAIOWSOIRKNLSOEYYESU^YVKSK 



RNA SECTION 



r.mHNA I ltt.ft i | itt074 

RiUttin.-l }' HNA i.HV l-lli »'.07M'> 

.: is ihna itut.;4 1*. inif..:/ti 

tKNA U»<". f»l tdtr.'.irt 
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tRHAt 



CRNA • Mgln end 



7 
8 

9 

10 
11 
12 
13 
14 
15 
16 
17 
IB 
19 
20 
31 
22 
23 
24 
25 
26 
27 
26 
29 
30 
31 
32 
33 
34 
35 
36 
37 
30 



69637 



29607 S 

296151 

409948 

462141 

672236 

677264 

739403 

781610 

784822 

784922 

636119 

643926 

677400 

108S60S 

1142034 

1173663 

1230026 

1137462 

1030603 

1000022 

961607 

607413 

786780 

71S971 

708441 

680239 

631443 

626987 

293477 

293399 

269142 

269063 

164389 

67522 



69728 
91070 



i ■ 



1VP€ 

Thr 
Tro 



296147 v*l 
296224 Atp 
409922 Pro 
462214 Arg 
672218 Lftu 
677337 Arg 
739486 Uu 
781680 Gly 
784896 Glu 
784994 Ly» 
836191 AU 
643999 Pre 
877473 Arg 
1085676 Gin 
1142119 S«r 
1175944 Ltu 
1229942 S«r 
1137389 V*l 
1030333 Cy« 
999949 His 
961536 Gly 
807341 Arg 
766708 Thr 
713989 (*u 
706334 Ser 
680176 
631373 
626901 Ser 
293405 Thr 
293317 Tyr 
269070 Al* 
268992 lie 
164318 Asn 
87450 Mec 



Pne 



GOT 

CCA 



TAC 

ore 

TOO 
CCT 
CAA 
TOG 
CAS 
TCC 
TTC 
TTT 
GGC 
GOB 
ACQ 
TTC 
TEA 
TAG 
CCA 
GAC 
OCA 
GTC 
GCC 
TCT 
CGT 
TAA 
OCT 
GAG 
CAA 
GGA 
TCT 
GTA 
TCC 
GAT 
GTT 
CAT 
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What is Claimed is: 

1 . An isolated nucleic acid encoding a C. pneumoniae protein as set 
forth in Table 3. 

5 

2. The isolated nucleic acid of Claim 1, wherein said nucleic acid has 
a nucleotide sequence of an open reading frame in SEQ ID NO:l. 

3. A probe comprising a hybridizing fragment of an isolated nucleic 
10 acid according to Claim 2. 

5. /ai isolated nucleic acid that hybridizes under stringent conditions 
to the nucleic acid sequ :nce of Claim 2. 

15 6. / ji expression cassette comprising a transcriptional initiation 

region functional in an expression host, a nucleic acid having a sequence of the isolated 
nucleic acid according to Claim 1 under the transcriptional regulation of said 
transcriptional initiation region, and a transcriptional termination region functional in said 
expression host. 

20 

7. A cell comprising an expression cassette according to Claim 6 as 
part of an extrachromosomal element or integrated into the genome of a host cell as a 
result of introduction of said expression cassette into said host cell, and the cellular 
progeny of said host cell. 

25 

8. A method for producing a G pneumoniae protein, said method 

comprising: 

growing a cell according to Claim 7, whereby said C pneumoniae protein 
is expressed; and 

30 isolating said C pneumoniae protein free of other proteins. 
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9. A purified polypeptide composition comprising at least 50 weight 
% of the protein present as a C pneumoniae protein comprising an amino acid sequence 
of claim L 

5 10. A monoclonal antibody binding specifically to the polypeptide of 

Claim 9. 
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Contig4 63 
Length: 273254.. 



1 


ATTGTTCCTG 


TAAGAACACT 


TCCAAAGCGC 


ATTTAATCAT 


TTTTAGTAAA 


51 


AAATAAAAAT 


ATACTTTTAA 


ATGTTGAGAA 


AATTTTTAGC 


TAAACTTTAT 


101 


AAAGGGTTGT 


TGGTGAAACC 


TTTGGGTTAC 


TCCTCAGAAC 


GACTTTGTGA 


151 


TTCTATAGTA 


TTAAAAGGAT 


CTTGGAGTAT 


AACAAGTAAA 


GATCTTTGAG 


201 


GATAGCGTAG 


GGCCGTATTT 


TGAATAGCGT 


CCAATAAAGC 


GCGTTTGCAA 


251 


AACGCTTGAG 


TTTGGTTGTC 


CCAATAGAAA 


GTGCCTTCTT 


TAGGAAGAAT 


301 


CTCTTCTGGA 


GGCACTTCAT 


AGACCGAAGT 


AAAGAGAGGA 


AGAGCAACGA 


351 


TTGCTGCATG 


ACTTTCTATA 


GCTGCTTTAA 


GGCAGTTCTC 


GTACGCTAGT 


401 


AAAGCTTGGC 


GATAATATTC 


TTGCTGATTT 


GGTAACTCTT 


CAGATTTAGG 


451 


GCCGCATACG 


TGGCCTAAAA 


AGGTCGGAAG 


AATACTTTTC 


TTTTCTGCAG 


501 


AGCTTAAATT 


TAGATTAAAC 


GTTTGATCTA 


GAGCTTCGTT 


TGGAAGTTTT 


551 


ACTACTCTCA 


CTTCGGTAGG 


GGAAAAGGGG 


TCTTCTCCTT 


TTGCGGGACC 


601 


CCCTTCGCGT 


TGCTTGCATG 


TATCCCACAC 


GCTTTTATCT 


TTTAGGGTGG 


651 


AGTAAAGGAT 


AGTAGAGAGG 


TTCGTTGCAG 


TGTTGTCGAT 


CAGATTCGTT 


701 


GGGCCTACGG 


GATTAAAGAT 


GATCCCTGTG 


GATTGATTTT 


TTTCGATCAC 


751 


TCTAAGTCCA 


GTTAAGAAAG 


TAGGCTGAAA 


TGGTTGAGAC 


GCATCTGTTT 


801 


GTATCGCTAC 


CTTGAACTTA 


GGGTTCAGGT 


GATTATTGTA 


AAATTGCATC 


851 


TCGTTTGAGT 


AGCAGTCTAC 


GTTTTTTTCT 


TGCCACGCTT 


TTCCCAAAGG 


901 


CTTGAAGTTT 


TGCTCTAGAA 


CTTTCTGCCA 


GTTAGAAGAT 


ACCTTTGAGG 


951 


TCATTTGGTG 


GTAGACTAAG 


AAGGTTACAA 


CTGAGAAGAG 


GGCCGTGGTA 


1001 


ATGAGAAGAG 


CCAAAAATAC 


AGGGTTCCCT 


AATACTATCG 


TTAAAGAGAT 


1051 


TCCAGCCACC 


AAAGCTCCTA 


AAGCTAAAGA 


AGCTAGGATT 


GCAAGAGTGG 


1101 


ATATTTTTGC 


TATGGTAAAC 


TGTTTTTTAG 


GAGCAATTTC 


TTTATCCCGA 


1151 


GGCACATAGG 


ATAGTACAGA 


AACTTGAGAG 


CTCTCAGTAC 


GTGAGGGTCC 


1201 


TGACATAACA 


TTTTTTTTGT 


AAAATACTTT 


CTATAATTTT 


AACATATTTG 


1251 


TGTTTATCGA 


TCCGAGAAAA 


TTGGAGAGTG 


AGAGCGCATG 


TCTTGCAATT 
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13 01 TAGAATGATC GGGGACGACA TCTAGAGCTA TGTAGACATT GCGTGCGTAG 

13 51 TGGGAGCAAA TATAGCGAGA TATAAAGTAT AAGGGAATTG CTGTTAGGAA 

14 01 GATAAAGGAG CACAAAGGGT GGATACATAG CCCAATAGCT ATGGTGGTAG 
14 51 CAATCAGAGC TATCCAGACG AGTGCAATCG CAATAGTAAC GAAGAGGGCA 
1501 AGCTTGAAAT TATAGCGAGG ACGAGTAGCT GGGGG AAATA GAGAGGGAGC 
1551 CGTTCCATCA AAACCGGGAG TAGCTGAAGA AGCCATAAAC TATTAAAAAT 
1601 TAAGTTTTTT TCGGAGCATA AAGCATTTTA AAGTAGTGGG GTCTTTTTTG 
1651 TCACGGAGAT GTCCTGGACT TCCCAAGCGT TTCTAACAAA GATACCTGCT 
1701 TTTGAGAGGA GAACTTTTGA AACTCCTGCA AGGTCATCCT TCCTTGGCAC 
1751 CAGTAGGTTT TTTCAGGAAA TCGCGGAAAG ATTTTGGCGA AAGCTCTTAC 
1801 AGTTGAAGGG CTTGTGAAGA TAATTTTTTT GTATTTAGAT AAAATATTTT 
1851 TTTTAAGTTT TCGCGGCTTC ACTGTGTAGT GAGGGTAAGA GAAAAAAGTA 
1901 AATCGATTGT AAAGAAATTC TCTGATCACA GGTCTTGCGA GGGAGGAGTG 
1951 GGGGTAGAGA ATGCGGGCTG AAGAGGGCAG TGCCTGTAGC AATGGGAAGA 
2001 TGCCTTCAGC GATTTCTTGA GTTGCTACTA CGTACTTCAC TTGTCCAAGG 
2051 AAAGAGAGAA GTCTTTCTTT GGTGGACTCT CCTATACAGA GGTAGGTCTT 
2101 TGTTTTTAGA GTGGCCTTAG AAAGAAGAGA AGTCATTCTG GAAAGGAATA 
2151 GGTGAGTGGA TGAGGGACTT GTGAGAATCA CATGGGTTGC TTGTGGAAGG 
2201 AATTGAAGAG CACGCTTATT TTGTGGAGTG CTTTTTGCAT AGGGGAAGAG 
2251 AGTTAGAATA GGCAAATAAT GAGCTTGGTA TTTACGAGCG GTTTTTTGAT 

23 01 TCAATCCTAA GTAGAGGGTC ATGAGTCTTT TCGGGTAAAA GGAAGGCTGC 
2351 CTAAGTTTTT GTACCTTCAA AGGGATATAT TGAAAATAAT TTTTCTTTTT 
2401 CCCTTGGTTC TTCTTGATCA TGCGTTGATT GACATTTTTC ACTTTGAAGG 

24 51 CTAGGCTGGT TTTTTCTGGA CTTAGAGGTT CTCTCTATTA AGGCTTCGTC 
2501 TTTAGAAGTC CTTGCTAAAA GTTTTTGAGA AATTTAAGAA ATTCGCAATA 
2 551 GTGGAAATAT TTACAAAGGT GGTTGCGGTG GTTTCGTTGT TGCATAAGTT 
2601 TTTAGAAAAT GCTTCGGGGA AAAAGGGACA AAGTTTAGCT TCGACAGCGT 
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2651 ATTTAGCAGC TCTTGACCAT CTCTTAAATG CGTTTCCTTC CATTGGGGAG 

2701 AGAATCATTG ATGAGTTGAA GAGCCAGCGT TCCCATTTAA AGATGATTGC 

27 51 TTCTGAAAAC TATTCTTCAC TTTCAGTGCA GTTGGCTATG G GG AACTTGC 

2801 TCACAGATAA GTATTGTGAA GGAAGTCCCT TTAAGCGTTT CTATTCCTGT 

2851 TGTGAAAATG TAGATGCTAT TGAGTGGGAG TGTGTAGAGA CAGCGAAAGA 

2901 ACTTTTTGCT GCGGATTGCG CTTGTGTTCA GCCTCATTCT GGGGCTGATG 

2951 CTAATTTACT GGCAGTAATG GCCATTCTCA CGCACAAAGT CCAAGGCCCA 

3001 GCTGTCAGTA AGTTAGGTTA TAAAACTGTA AACGAATTAA CAGAAGAAGA 

3 051 ATACACTCTA CTTAAGGCTG AAATGTCTTC TTGTGTTTGC TTAGGACCTT 

3101 CATTAAATTC TGGAGGCCAT TTGACCCATG GG AACGTACG TTTAAATGTG 

3151 ATGTCTAAGC TTATGCGTTG CTTCCCCTAT GATGTCAATC CGGATACGGA 

3201 GTGTTTTGAT TATGCAGAGA TCTCCCGGTT AGCTAAGGAG TATAAACCTA 

3251 AGGTACTGAT CGCAGGATAT TCTTCCTATT CTCGAAGATT AAACTTTGCA 

3301 GTTTTAAAAC AGATTGCAGA GGATTGTGGA TCTGTCTTGT GGGTAGATAT 

33 51 GGCGCATTTT GCAGGCCTAG TTG CTGGGGG AGTGTTTGTT GATGAAGAAA 

34 01 ATCCTATTCC TTATGCAGAT ATAGTGACAA CAACAACGCA TAAGACATTA 
34 51 CGCGGTCCTC GCGGGGGATT AGTTTTGGCA ACTCGAGAGT ATGAAAGCAC 
3501 TCTCAATAAG GCGTGTCCTT TGATGATGGG AGGTCCTCTA CCTCACGTGA 
3551 TAGCTGCTAA AACAGTGGCT TTGAAGGAAG CTCTCTCTGT GGATTTCAAG 
3601 AAATACGCTC ATCAGGTTGT AAATAATGCT CGTCGATTAG CAGAGAGATT 
3651 TTTAAGTCAT GGGCTACGTC TTTTGACGGG AGGAACAGAC AACCACATGA 
3701 TGGTGATTGA TTTAGGTTCT TTGGGCATTT CTGGAAAAAT TGCTGAAGAT 
3751 ATCTTGAGTT 'CCGTAGGAAT TGCTGTGAAT CGGAATTCAT TACCTTCAGA 
3801 TGCTATTGGT AAGTGGGACA CTTCAGGTAT ACGTTTAGGA ACCCCTGCAC 
3851 TAACGACTTT GGGTATGGGT ATCGATGAAA TGGAAGAAGT TGCAGATATT 
3901 ATTGTGAAAG TATTGCGAAA TATTCGTTTA AGTTGCCATG TTGAAGGGAG 
3951 TTCTAAGAAA AATAAAGGGG AACTTCCTGA AGCCATAGCG CAGGAAGCTA 
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4001 GAGATCGTGT TCGCAACTTG TTGCTGCGTT TCCCGCTCTA CCCTGAAATT 

4 051 GATTTAGAAG CTTTAGTTTA GTTAGGAGAG ACATTATTTT ATGGCAGACG 

4101 GGGAAGTTCA TAAATTACGT GATATTATAG AAAAAGAGTT ATTGGAAGCG 

4151 CGCAGAGTAT TTTTCTCAGA GCCTGTAACA GAGAAAAGTG CTTCCGATGC 

4201 AATTAAAAAG CTTTGGTATT TGGAATTAAA AGATCCTGGA AAGCCTATAG 

4251 TTTTTGTGAT CAATAGTCCT GGGGGATCTG TGGACGCAGG TTTTGCTGTT 

4301 TGGGATCAAA TTAAAATGTT AACCTCACCC GTCACTACTG TTGTGACAGG 

4351 GTTGGCAGCT TCTATGGGCT CGGTATTGAG TTTATGTGCA GCTCCTGGAA 

4401 GGAGATTTGC AACTCCTCAT TCTAGAATTA TGATTCATCA ACCTTCAATA 

4451 GGTGGACCGA TTACCGGTCA GGCAACCGAT TTAGACATTC ATGCGAGAGA 

4501 GATTTTAAAA ACAAAAGCTC GCATTATAGA TGTCTATGTA GAGGCGACAA 

4 551 ATCAACCTCG AGATATCATA GAAAAGGCTA TCGATAGAGA TATGTGGATG 

4 601 ACAGCCAACG AAGCTAAGGA TTTTGGTTTA TTGGATGGCA TTTTATTCTC 

4 651 CTTCAACGAT CTCTAAATAT TTTATCTATT CTGGAGCAGG AAATCGTTTC 

4701 CTTCTTGGTG AAACACTTCC TGAGGTTGAA GATGTTCGGT TCTTATGCCA 

4751 AGAGACGAGG GTTGATGGTT TTTTATATTT AAAGCCCTCT TCTTGTGCTG 

4801 ATGCGCAACT CATTATTTTT AATTCCGATG GATCACGTCC AACGATGTGT 

4 851 GGTAACGGCT TGCGTTGTGC GATTGCTCAC TTAGCTTCTC AGAAGGGAAA 

4901 ATCGGACATC TCTGTATCTA CGGATAGTGG TCTATATTCA GG ATATTTTT 

4951 ATTCTTGGGA TCGTGTGCTT GTAGATATGA CTCTCGCAGA TTGGAGAGCT 

5001 TCTGTTCATC GATTGGAGTC GCGTCCTGAT CCTCTTCCCA AAGAGGTCGT 

5051 TTGTATCCAT ACGGGAGTGC CTCATGCTGT CGTAATTCTT CCTGAGATTT 

5101 CTACTTTAGA TCTTTCTATC TTAGGTCCTT TTCTTCGCTA TCATCAGACC 

5151 TTCTCTCCAG ATGGGGTGAA TGTCAATTTT GTTCAGATAC TGGGACATTG 

5201 CCAGTTGCGC GTTCGTACTT ACGAACGTGG AGTCGAAGGG GAAACTGCAG 

52 51 CTTGTGGAAC AGGGGCTCTA GCTTCTGCTC TTGTTGTGTC AAACTCCTAT 

5301 GGATGGAAGG AGTCGATCCA AATCCATACT TGGGGTGGAG AGCTTATGAC 
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5351 


TGTGAGTCAA AATAGGGGAC 


GGGTA 


1 


GTAACTAGAG 


5401 


ATTTATAATT 


AGATGTGATT 


TTTGATTTTo 


I CA1 OtAAyu 


ATTTTAAAAT 


5451 


CTTGTTTAGG 


GATAGATCTT 


GCTCTCTAAC 


1 oooAI 1111 


PTATAATCGT 


5501 


AATTTATGAT 


GACGTATCCT 


GTACCACAAA 


AC.Ct.AV- 1 I C 1 


TTTAAGAATC 


5551 


CTTCGTCTTA 


TGGATGCATT 


CTCTAAGTc 1 


p a PP 1 AG A. 
oALuA 1 onun 


GGGACTTTTA 


5601 


TTTAGATCGT 


GTTGAAGGGT 


TTATTCTCTA 


CAIAoAl 1 1A 


GATAAAGACC 


5651 


AAGAGGATCT 


AAATAAGATT 


TACCAAGAAT 


1 AoAAoAo AA 




5701 


TATTGTTTGA 


TTCCGAAGTT 


GACGTTTTAT 


GAAGTAAAAA 




5751 


AACGTTTATC 


AATGAAAAGA 


TTTATGATAT 


CGAT, ACL AAA 


AGTTPP 


5801 


TTGAGATTTT 


GCAATCCAAG 


AATGCCCGTG 


AGCAGTI III 


A P a G TTT A TT 


5851 


TATGATCACG 


AGGCAGAGTT 


AGAAAAGTGG 


« » OOTl "ft M "1**1 "P 

CAGCAAITI 1 


A 1 1 MOrlOV-O 


5901 


TTCTCGAATT 


CGAATTATAG 


AATGG CTTCG 


C AATAA I AAo 


TTPP ATTTTG 
1 1 V-V-Ml 1 1 1 V? 


5951 


TCTTTGAAGA 


AGATCTAGAT 


TTCACAAAGA 


Al o 1 1 1 iubA 


A P AGTTG A A A 
r\V— AO 1 i o/vrvtt 


6001 


ATACATTTGT 


TTGATGCCAA 


GGTGGGoAAA 


bAAA 1 CMV- 1 \_ 


A AG PG CGTC A 


6051 


GTTGTTGTCG 


AACAAAGCTA 


AoAI I I AC 1 A 


TTPPA ATG A A 


GCATTAAACC 


6101 


CTCGTCCGAA 


ACGAGGCCGT 


CC 1 LLbAAot 


A ATPTGPTA A 


GGTAGAAACA 


6151 


GAAACAACAA 


TTTCGAGTGA 


1 A J. 1 


A/irtO 1 V—V- V- 1 >- 


AGGCTGCTCG 


6201 


TCGTTTCCTT 


TTCTTACCCG 


AG ATT AC- 1 IX- 


ACCC 1 C 1 1 V„M 




6251 


CAGAAAAATT 


TGATACGGAA 


GAAGAATTTC 


1 1 OC 1 AjHC 1 1 


GPGPGGTTPG 


6301 


ACTCGTGTTG 


AAGACCAGCT 


GAATCTTACC 


AATC I 1 ICAo 


AP Ar2^TTTP.P 


6351 


TTCT CTT AAA 


GAGCTTTCGG 


CTAAGCTTGG 


TTACGAC I C 1 


ti 1 1 LI nL i O 


6401 


GAGATTTCTT 


TGGTGATGAT 


GATGAGAAAG 


TGGTCAC I AA 


P A PP. A AGGGG 


6451 


AGCAAGCGAG 


GCCGCAAAAA 


ATCTTCTTAA 


TCTTCTATTT 


TGTGAAGTAG 


6501 


TTTATTTTTA 


GACGCTGTTC 


TTATTGCTTC 


TTTACATGAT 


CTTATTACAA 


6551 


ATCTTTCTTA 


TTTCTATTTA 


TTGTTTTGTT 


AAAATTTTAA 


CAATAGCTAT 


6601 


TTATTATTAG 


TCATTTTTTT 


AATTAAAAAA 


CTGTTAAAAT 


TTTTAAAGCT 


6651 


AATTTAAGAA 


ACAGTGAATA 


GTTCATCATG 


TCATCACTAC 


TGAGCTGCGG 
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6701 AAGAATAGAG CCGACTCGGG TTACCTGTAG CTTAAAGACG TATCTTGAGG 

67 51 ATACGAGTCA GAATCAGTTG AGCACACGTC TAGTTCGGGC AAGTGTCATC 

6801 TTTTTATGCG CATTGTTGAT CATTTTGGTT TGTGTGGCCC TCTCTAGTTT 

6851 GATTCCAAGC ATTATGGCCT TGGCGACCTC TTTTACGGTA ATGGGGTTAA 

6901 TTCTTTTTGT GATGTCACTT CTTGGTGACG TTGCAATTAT AAGTTATCTT 

6951 ACTTATAGCA CTGTTACGAG TTACCGGCAA AATAAGAGAG CTTTTGAGAT 

7001 TCACAAGCCC GCTCGCTCCG TTTACTACGA GGGGGTCCGC CATTGGGATT 

7051 TAGGACGATC ATCTTTAGGC ACAGGCGAGA TTCCTATAGT AAGGACGTTA 

7101 TTCTCTCCAT TTCAGAACCA TGGTCTTAAC CATGCCTTAG CTGCTAAAAT 

7151 TTTCCTATTT ATGGAGCATT TCAGCCCTGA GCCACCGAAC GAGCCTTTGG 

7201 TGGATTGGGC CTGTTTGATT CGGGATTTTA GGCCTCACGT CAGTTCTTTG 

7251 TGCTTTGTTA TTGAAAAACA AGGGTCATCG CTGAGGACTA AGGAAGGCAA 

7301 TACGATTTGT GAGGCTTTCC GCTCTGATTA CGACGCCCAT TTTGCTATGG 

7351 TAGATTGCTA CCGGTTGATC CACTCTAAGT TGATTATAGA GAAAATGGGA 

7401 TTGAAGAATA TCGATATCAT TCCGAGTGTC ATGGTTCGTG AAGATTATCC 

7451 TAGCCGTCCT GGGGAGGGCT ATCGCGAAGG CCTATTACGT ATGTATGGTG 

7501 GCAAGGGGGC TCTGTGACTT CCCTACTTTA GTTCCTAATG AGCGCTTGCC 

7551 CATAGGGCCT TTCTTTGTCC CGCAGCACAC TTCCGGTGCG AAGGGTAAGG 

7601 AGTTTGCTAA AAGGAATTTT TCTATAATTT CGGGATTGGA TGACATATTA 

7 651 AAATTATGTA TTCTTCAAAG GCGTCCTTTT GCTTTGCAGT GGGATAACCT 

7701 CTCTGTGAAA AGTGATTATG AGGAGGCTGG GCCCGCTATT GGGATACGTT 

7751 CTCTTGAGCC ACAAGTTTCT CAAATTTCTC CAGCCCACGG CCGGCTATGT 

7 801 AGTACTTTGG TCCAGTGGGC CCCTATCCTT GGTTCTGAGG AGCAGCTAGT 

7851 TTGGTTAGAA GAAACAATGA AGCGCCTAAA GTTTCCTAAA AGTTTAGGTA 

7901 GTAAGGACGC TGTTATTGTG GATTCGGAAA TGGTTCCTGT GAACGCCAAT 

7951 CCTACTCAAG AGATACCTGC AGCTTCCGAG ACTGTAGAGT CTTCACCTGT 

8001 AGCTCCAGGG AATACAACAG ATACCATGCC TGCAGCTTCG GGAACTACAG 
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PCT/USS 


8051 


ACACCACATC 


TGGGGTTTCA 


GAGGCTGCGG 


CGGCTGAGGC 


TGCCGTGGAT 


8101 


TCTACACCAG 


GGACAGAGGA 


GGAGCCGAGT 


TTTTCTCTGA 


GGTATGCGCT 


8151 


TGTAGTTCAA 


AATGTTCCCT 


ATCCAGAGCC 


GCCTAAAGAA 


CCTGAGGTGA 


8201 


TGTTTACAGA 


TGAAGAAAAA 


AGTCTGATTT 


TAGAAGCTAC 


TCGTGCGCGT 


8251 


CGTATGGAGT 


TGGACTTGTA 


TAATGGCTAT 


TTAGCTGATT 


ATGAACTTTC 


8301 


TAAGGATGAA 


ATACAGAAAC 


ACGTTCCTGA 


TTTACCTGAG 


AATTGGCGTA 


8351 


CGAATTGGCG 


TTGGTCGGAG 


AGGCTCTATA 


AATTTTTCTT 


TAAAACAAAG 


8401 


AAAGAAGGAT 


TAGAAGAAAT 


TTTCTTAAAC 


AAAGAGTTAG 


GGAATATGAT 


8451 


TCTTGCCCGA 


GGGCTGGCGG 


CAACTCAGTC 


ACAAGCACGT 


ATTAAAGTAT 


8501 


TCAATTCTTT 


AGTGGCATGG 


CTCTTGCAAA 


GCTTTAACGT 


AGGGAGGAGC 


8551 


TGTACAGCTA 


AACCTCTTCC 


TACGTCAAAA 


CTAGACCTCT 


TTAAATCGGA 


8601 


ATTCGAGTCT 


AAGCCTAAAA 


ATAACATCTT 


AACGGAATTT 


TTGGTGGCCT 


8651 


CTGATGAGGA 


GATTCTCTTT 


AAGGGGCTAC 


GGGTCCTAGA 


GCCTGGAATC 


8701 


GAAGGTTGGT 


ATGACCATCC 


TGATCAAGCT 


GGAGAGATTC 


GGTCGGTACT 


8751 


CGAGGGTCTG 


GTGCAGGCTG 


GACGTATTTC 


TGGATATTGG 


GAGAATCAGC 


8801 


CGTTTGGGAG ATTTGTCCTT AGAGGAGTTG 


GTGAAAGACG 


TACCGAGCTT 


8851 


GTAGAGCTTT 


TGGAGAGTTT 


AGTTGCTTCT 


GGTGAGATTA 


TGCAGTTCTT 


8901 


TGAGTCTTCG GATGAAGAGG 


GTGCTTTTAT 


TATCGATAAC 


GAACCTAGCA 


8951 


AGACTGCTAT 


GCTAAAACAG 


CGATTTAAGA 


GTTGTGTCAG 


GACGAAGCTT 


9001 


GTCGGGAGTT 


TTGCTGATGA 


GAGTCTTCCC 


AGAGGTAGGT 


TTACCATTTT 


9051 


AGTTTAGCGT 


GGGGTAGAGC 


ACTCCACGAA 


TCTTAGGGAG 


CTCCTTGCGA 


9101 


CCAAGCTTGG 


AGATCCTCCA 


TGTTTTATTG 


TTTCTCTAGT 


AGCCAAATCG 


9151 


TAGCCGCTCC 


TAGGAACAAT 


TTTTTCTTTT 


TCGCAATATA 


AAATCCTGAT 


9201 


TTAGAGAATA 


GGTCTTCAAG 


ATCGTGGTCC 


TTTGGAAGTT 


GCTGGATACT 


9251 


TTTGCTGAGA 


TAGCTATAGG 


CGTCGGGATC 


TTTAGAAACA 


GACTTTCCAA 


9301 


TCCAGGGGAC 


GACAGCACGC 


AAATAGAGCT 


TATGGGCACT 


ATAGGTAGGG 


9351 


TGTGTTTTTT 


TTGGAGGTGT GAGCTCTAGA 


( ATGCCCAGTT 


1 TTCCAGAAGG 
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9401 . CATAAGCACT CGGGAGATTT CTTGTAGGGC TTTATGTGGA TCCGAGAGGT 

94 51 TCCTGAGGCC ATAGGCCATC GCTGCTAGGG GATAAGAATG ATTCTCCAAG 

9501 GGCAGTTGAT TAATATCGCT ATGAATAAAA GAGCAAGAGC CCTGGGGAAG 

9551 GTGTTGTTTT GCAATGTCGA GCATTGCTGA GGAAAAGTCG ACGAGAGTTA 

9601 CTGATGCTTG AGGGTGTGCG GCAATATAAC GCTTCGCGAC TTTTCCTGTT 

9651 CCTGCGCAGA GATCCAGGAG AGAGTATCCC GACCCTAGGA TCTGGATCAA 

9701 AGAGCGATTC CAGAAATGGT GCATTCCTAA AGAGAGTATT GTATTTGTGC 

97 51 GATCATACTT ACTCGCTATG GAATCGAAGA TCTTTTTACA GTCGGGCTTG 

9801 TTGGTAGAGG GTTCCATAAT ATTCCCGGAA TTTTTCAAAG CTTTCGTAGT 

9851 GTTCTTCTCC TAGACGGTAC TGGCATAGGG CATAGTATTC TTGAAGAAGA 

9901 GAAGGGGGCA GACCTGTATG TTGATGAGCT TCTTTAAGGA CTTCTTCGGG 

9951 TGAAGATTCG AACTGTTGGA GGGCTTCTTC CATCGCAAGG TTGGGTAGGG 

10001 GATGTTCTTT CCAAGAGGTG CTGTGTAGAA GAAGAGCAAA TACAAAAGGT 

10051 AGCTTTGTAA GATCATACCA CCCCGAGGCA AGGTCATAGG TTACAAATCC 

10101 AGGAAGTACA GGATGTTGTA GCGCTGCATC TCCGATTAGG AGGAGGCCAT 

10151 CATAATTTTC AGGGGTTTGT CTGAGTACTT TTGTAGTTAT GAATCTTAGG 

10201 ATATGAGGAG TTGGGATGCG CCAGAGATGA CG AC AAAGCA CTTTTAAGAG 

10251 TCCTATAGAG GAGCGACTTT CTAAAGTTGC GGCAATCCGA GGTTGCGGTG 

10301 AGTTAAAGAA AGTGGGAGCT GCATAGAGGT TTACACTGAG GATACGTTGG 

10351 TTTGCTGCAA TTCCAAAGCC GGGGACATAC CCCAAGTTAT GAGAGATAGC 

10401 TCCTAGGGAT GAGGTCAAAG CAACATCGAG TTTCCCTTCG ATTAGCAAGT 

10451 TGAGGAGGTC TGCAGGGGGA GCAAGAACAC AGCGAATATC GTTTCTTTTT 

10501 ATGAGTTGTA GGGACAGCGG AAAGGAATTA ATATAACTTA CGCAGCCTAA 

10551 GCTTATACAT GGCTGGAGTT GGTTAGACAT GGCGTTCTCC CTTGTTGTGT 

10601 GATGAGGGCC GCCATTCCCT CAGCGTCCAT TTTAATAGGT TCTTTAGATG 

10651 AGGCCATCTG GAAAACCTTT TCCCCCATAT GTGTTGAAGA AAGGTCATTA 

10701 GCACCACAGG AAAGGAGGTC TAGAGCTGCC TCAATACCTA GGTAATTCCA 
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10751 


TAAGGCTTTC ATATTGGAAA AGTTGTCTAA GAAGATTCGG GCTACTGCCA 


10801 


TTAAAGATTT 


TAGAGGGATG 


GCATGACCCT 


GGCCTGATTT 


TCTTAATCTT 


10851 


TTTCCTAGGA 


CATTATTTTC 


TTGGGCGAAT 


TTTAGAAGTA 


TGAAGTTTTT 


10901 


AAAGCCCTGA 


GTTTCGTCTT 


GTAAGTCGCG 


GACTTTTACC 


ATGTGGGTGA 


10951 


CGAGGTCTTC 


AGGTCCTTCT 


TTATGATAGC 


AGAGCATGGT 


TATATTGCTA 


11001 


TGGATTCCCA 


GTTGATGAGC 


CATCTTATGG 


ATGTTGAGAA 


AATCAGAAGA 


11051 


AGAAAGGCGT 


TTGGGAGCTA 


AGAAATTACG 


TATTTTGTCG 


ACGAGGATTT 


11101 


CAGCTCCTCC 


TCCGGGGATG 


GAATCAAGAC 


CCGCATCTTT 


TAATGTGAGA 


11151 


AGAACATCGC 


GAATAGAAAG 


GTTATCAAGA 


TCTGAGAGAT 


AGGCATATTC 


11201 


AATGGCAGTA 


AGAGCTTTGA 


TATGGATCTG 


AGGATCGTAC 


TCTTTGATTT 


11251 


TAGTAAATAG 


ATCGGAATAG 


TATTGCAGAT 


TGCAGGAGGG 


GAAACAGCCT 


1130.1 


CCCACGATAT 


GTACTTCTGT 


AATTGGAGTT 


TTTATATTTT 


GGATTTGCTG 


11351 


TAGAAGATCA 


TCTGGGGAGT 


AGAGCCATCC 


TTTAGGGTCT 


CCAGGTTTTG 


11401 


CATAGAAAGA 


GCAAAATTTG 


CAGCTGAAGT 


CACAGAAATT 


TGTAGGATAG 


11451 


AGGTACAAGG 


TTGAGGAGTA 


GTATACAGTG 


TCGCCAACCC 


GTTGTTTGCG 


11501 


AACTTGGTCT 


GCAAAATTCC 


AGAGTGTGCG 


TTGATCTTCT 


TTATTCGTGA 


11551 


GGAGGAGGAG 


ATGAAGAGCG 


TCTTCACTGC 


TTAATCGTTC 


TTGGGCATCC 


11601 


AGTTTTTCGA 


ATATGGAGTA 


GAGGGGGGAA 


GTTTTAGGGG 


GCTGTGGGAG 


11651 


GCACGTCGTC 


ATTTGATGAA 


CACTTTGATG 


TACTATTCTC 


TCGAGATTTT 


11701 


GTAGCACAGT 


GCTCTGTTTT 


GTCACATGTT 


TTTTTTTGGC 


AGCAATCTGG 


11751 


TTTTTGACAC 


CCTTTAGAGA 


GGGGCCTGCC 


AAGCAAATGG 


GAACCTACAA 


11801 


GTAGAATACC 


CATCCCTAGA 


CCTAACAATA 


CTGTGGCACA 


GCAGATTACG 


11851 


AGAAAAAGTG TCATCATAAG 


AAATCCTTAG 


ATAGGATAGT 


TTCTTAATTT 


11901 


AAATCCACCC 


AGATTGGGGA 


ACTCCAGGCC 


ATAGCATTGT 


CTGCCTGAGT 


11951 


GACCCTGAGA 


TAGTAGAATA 


CAAAAGGTGC 


TTTACCGTTT 


GGATCTTTTA 


12001 


GGGTCACTGA 


ACTTAGGGGT 


ACCATAfCAT 


CGTATTCATA 


GTCCAGGTTA 


12051 


TTGCTATCGG 


GGAAGAAGGT 


ATGGAGAACT 


TCGCCATTGC 


GGATGATTTC 
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12101 TACAGTCTTG AGTAGGGCAG TGCCTGCCAC ATGACCAGAG ATGTGACGGT 

12151 TGACGTTGAG TCCAGGTTTC GACCCTGTGG AGAGTTCGGA GCCCATAGGG 

12201 GCTGAAGTGA TGTTGAAGCT TAAGACGATC CTAGGTCCTG TTGTAGCGTA 

12251 GCAATGACGT GCGAATAAAG CTTCAACAAG AGACTCTCGG GTATATTTAT 

12301 TACAAATGAT AGCCGTCAAC CCTGGGGAAT ATTGCACTTG CGGAGAGTCA 

123 51 AAGTAGTCTT TATAAATTCC TCGATCGTCG AGACCCCCAG CAACAAATCC 

124 01 GAAGCGGAGA TTCTTCTTTA ATCCTTCAAT TACTGTACCT CGAGGATCTT 
124 51 CGCTATCTTT ACCTTGGATA GGGAAGGGGT TGTTTAGAGC GGCTGTGGTT 
12501 TCTGAAGATC CCCAGGCATT ATAAATTTCT ACAACTCTTT CGAACTCGGG 
12551 GTAGAAATTC TCAAAGTCAA AACCATGTTC TTTAGAAGCT GTGAACGAAG 
12 6 CI GAATAGAAAT CATGTCGTGG TTGACAGTGC TTTTATAGAG CTTGGCGAGG 
12651 GGAATATGTT TGTATTCTTT GTGTTTCGAG TGGGACTTTG TTTCCTTGGT 
12701 ATGAAGGATG TGACGCACTC CCTCGAGATG AGGTTCTCCG CTATATTGGA 
12751 ATCCGGATAG TGTGATGAAG CG ATCTTCTT CATTAAAGTC GGAGACAGTT 
12801 TGATTGATGA GCTTCCAAAT ATCTGGAGAG AGGTTCTCTT GATTTTCGAA 
12851 TGATGAAGAA GCATAGAAAT TCAGAGCGCG GTCATCTCGG AAATAACGCA 
12 901 TACAAGTTTC AATATTTTCT TCAGAGTCGA CGCGTTCGGA TTCGCCGTGG 
12951 AGGAGACCCC ACATAAGATT CGGGGCGGAG TCAGCGAAAC ATTTGATAGG 
13001 GGCAGAGATG AAAATTTCTT GTGTAGAGAG GTTTTTCAAT TGGATGCGAT 
13051 AAATTCCAGG CTCATTGAAA TAGAGATTAG GAAGAATAAC AAAGCCTGTT 
13101 TCTGGGATGA AGAGCTGCCA ATTTAAATTT TCTCTAAQAT GCTCGTAGGA 
13151 AAGCTCGATT CGGGTCTCTT CAGGAGAGAA GTTGGTGAGG TTCCCGAATT 
13201 CGTCTTCAAA TCGCACGGTG ATATCGAAGC GTTTGTTTTT AACGACATAG 
13251 GAGGGAGTAA AGATCTCTAT TTTTTTTAGG ACGTTTCCGC GGATATCCAT 
13301 AGAGAAGACA TCGGGTTCAT CATAGTTTCC TTCTCCTGTA GGATCGATGT 
133 51 AGAGGTAAAA GGGTTTGCGA CGTTGTGCGA AAAGTTGGGC TCCGTTCCCA 
13401 GCATCATCGA CTTGAGGATG GTTTGGAGAG GCTCCCATGA CAATAGTGAG 
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13451 GGTTTCTCCT ACTTGAAGTT CGTAGGGGAG AGTAAACTCG AATTGTGGAA 

13501 CGGGATTGTC TTTTACAGGA ATGGCGGTTG CTTCGATGAT TTCGCCTTCT 

13551 GGCATTTCTG CGTAGATTAC GTTTCTAGTT TGGGAGAGAT CTGTCGCGGG 

13601 GGCTTCCCAA TCTGTGGGTT TCCCACTTCC TGCTAAGTCA AATTTACATT 

13651 TGGTTCCAGC TGGTAGTGGT GTGGCAAGGG AATAAAGAAA TTTCCAAGTA 

13701 GAAATTTGCC CTGCTCGAGC TATCGAAGGG TTAACGTAAC AAACAGATCG 

13751 TCGCATAGTA AGAGGGAGGC TTTATATGAC TTAAAAGCGC CATCATATAC 

13801 TAACAGTGAG GTTTTTCTCA ATCCCCGTCT TTGTTTAGTG TTTGTATCGC 

13851 TTCATCCACA GTATTGAATA TTTTAAAGTA AGAAAGGAAT CCTGTAACAT 

13901 AGAGAGTTTG TTCTATGGTT TTTGGGACTG TAGTCAGGAC AATTTTCCCA 

13951 GAATGTTGTC CTACTTGATG GTAGCTTTGC AGTAGGACTC GG AT ACCTGC 

14001 ACTGGACATG TAATCGAGGT GAGCACAGTC GAGAATGATA TTTTTGGATC 

14 051 CAGCTGCTAG GGATTGGGAA ATATTTTCTT GTACTTCTGG AGAAGAAATT 

14101 CCATCAAGTT TTCCGTGGAG ATGAAAGATT GTTGTTGAGC CGTGTTCTTC 

14151 TTTTTGGATA TCACTCATCT AGATAGTTCT CCTAACTATA CGGGAGCTTA 

14201 AGTTTTCACT CTGATAAATC TTTAGCTTTT TTGCAAAGAG ATTTTTATTG 

14251 GTGATGTTTG AGAATTTCGA TTGGGGGGAG GCAGGATGGG ATCGTGGAGT 

143 01 GCAAGGAAAT CGGTCCTAGG ATGTTTACAT TCTTAGGAGA TATTGAATTT 

14351 ACGTTTTCTT GGTGTGATTT TTAGTTTTCC GACATTTCGT TCTGTGCAGG 

14401 TAATGATTTC TATATCGAAG TTTTCGTGAT GGATACGCAT TCCTTTTTGG 

14451 GGAACAGCAC CCACTTTATG GAAGACATGT CCTCCTAGTG TATCGTAGCT 

14 501 ATTTTCATGA TCGATTTTCA AATTGAAGTA CTCTTCAGCG TCGGAGATAT 

14 551 TCATTCTTCC ATCTACAATC CAAGAGCTTC CGATTTTCTT ATAAGGAGTA 

14601 TTTTCTTGTA CGTCGTGCTC GTCTGCGATC TCTCCTATAA TTTCTTCGAT 

14651 AATATCTTCC ATGGTAGCGA TGCCTTCTGT GAATCCGTAT TCATTGACTA 

14701 TGATGGCTAG ATGGCGATGT TTTTGTCGGA ACTCTTGGAG AAGAGAGGAG 

14751 GCTTTTTTTA TTTCTGGGGC ATAGAATGGG GGTTTGCTAC TGAGGATATG 
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14801 


GGTTGGCTGA 


GGTCGTGGCT 


GCTTGTATAG 


AGCAGTAAGA 


GATCTTTAAC 


14851 


AAGAAGGATT 


CCTGTGATGT 


TGTCTAAGTT 


TTTTTTATAA 


ACGGGAACGC 


14901 


GACTGTAGCC 


TTCTTCGCTT 


ACGAGAACCA 


GAGCTTCTTG 


TAGTGTAGTT 


14951 


TCTTCGGGAA 


GTGCGAAAAT 


ATCTACTTTT 


GGGATCATGA 


CTTCACGGAC 


15001 


AATGAGGTTA 


TCAAAAGCGG 


AGAGGGCTTC 


GGAGAGCTGG 


CTTTGAAATG 


15051 


ATGTTGAAGA 


TCGTACTTGT 


TGGTTAGGGC 


GGCGTCTGTA 


AAAGAGCAGT 


15101 


TGCAGTGGGA AGAGACCGAG 


TTGGAATACC 


GAAGCTAGAA 


AACGGAGGTG 


15151 


boLOu lOol 1 


1 L. 1 1 1 nuuun 


CTTTTGTAGA 


GATCCATGGG 


GGGAGGAATC 


15201 




LAbuuLo^. 1 1 


AGAGAGTATA 


GGGGCCAGAA 


TAGGAGATCT 


15251 


i 1 u 1 uAoL 1 u 


J. 1 1 I 1 uunW 


GAGGAGGGTA 


TAGAGTTTTG 


TCCCGAGAGC 


15301 


1 L-Lni Ao/Wju 




GCGTGGCGAG 


AATTGTAGGA 


GCACTGGGGA 


15351 






TTATCTTTGA 


AGAAGCGTTG 


GTTTAGGGTT 


15401 


ill rtbbAAl 1 


TT(^ 2xP.P ATPP 


GTGACAGGAC 


GGTTGCGTAA 


GCCCGAAGGC 


15451 


1 Aoo AA 1 


2vp.zvATZvPAP,a 


ATATGGCTAA 


AAGAATATGG 


AGCATGTTAA 


15501 


LaV_ loll AoLA 


21 21 P. P 21 TfJTTT 


TTTTCTTAAC 


ATACACAGGA 


TTTGATTTTC 


15551 


TTTAACTCTC 


ATTTTTCTCT 


TTTCTTCTGA 


TGAGGTGTCG 


TCGTATCCGA 


15601 


GCATATGGAG 


AATAGAGTGG 


ACGAGGTATC 


TCGAGATTTC 


TTCGTAGATA 


15651 


TCCTCTTGGT 


TTGGGGATGT 


GTTCTCTAAA 


AACCTAAGAG 


CGGCCTGTGG 


15701 


GCTAATGAAT 


GCTTCTCCTA 


AAACATGAGG 


ATAAGCGGGA 


TCTCCGGGAG 


15751 


CATCAATAGG 


CAGAGTGATC 


GTATCTGTTA 


GAGAAGGATC 


AGCAAATACC 


15801 


TTATCATGGA 


GTTCTGCAAG 


AGCTTTATCT 


TCTAGGAAGT 


AGATAAAAAT 


15851 


TTCATTAGTT 


GTTACTTTTA 


AGTGCTCTAA 


GAGCGTAAGA 


ACCAGCTTCT 


15901 


CTACAGAAAC 


CAAATGAATA 


GGAATACATG 


TTTGCTCATT 


GGAAACATGT 


15951 


ATTTTGATCT 


TTTCTTGCGT 


CACGCGAATG 


AAATTCCCAT 


AGAGAAAAAC 


16001 


AAACTTATTT 


TAAAATAGGG 


GTCTTAGGTA 


AACCTGTGAC 


TTTTTTCGCT 


16051 


GTACTATCAT 


TCCAACGGCC 


CAACTTACGC 


AAGACTTCTA 


CTCGCTCAAA 


16101 


ACGCTTCAAA 


ACATTTCTTT 


TGGTAACCCC 


TTTGACAGAT 


TTACCATAAC 
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16151 TACGATGTCG AGACATAATC CTGCTCTAGA AATAAACCTA TTTTCGGGAT 

16201 AAAAATGCTA TCACTGGTGC TCAATGCATT CGTATGCAAT TTATATACAA 

16251 TTCTTGGAGC TGGCGCTGGA TGCACAATGG CATACTTAGG TTTACGTTTT 

163 01 TTAGGACTTT TCGCTCTGCG CCGAGCTTGT TTACTCATTC GTGTCATAAA 

16351 TAACTCGATA TTGAATTTTT TAATTTCTCC AGACAACGGA AATTGAGGAT 

16401 ACGGAGTACT TTATAAGAAA AAGGATAGTA AAAGAAGAGT TTTTTTTCAA 

16451 GAAGATGACG TCTTTTAGCT GCCTTGATCT TGGTGTAGCT CGTCAGGGAG 

16501 GGGAGACGAG GGGCATGCCA TGGGGGTTGA GAGGACTCCA CAGGAGATAA 

16551 TATATTTGAA AGCATCTTCG ATTTTCATAT CTAGGAATAC GATATCAGAT 

16601 TTTCTAAATA GGGTAAGAAA CCCTGAGGTG GGGTTGGGTG TTGTTGGGAT 

16651 GAAGACCGTG ACGAGGGGGT CGTCTTCCTT TTCTCCTGTG CAGCATACTG 

16701 TGGGTGCGTC TCCAGCGACG AGACCGATGC ATTGAACATT TGCGTTAGGG 

16751 AAAGGAACCA TAACTACTTG TTTGAAGGAT CCTGATTTTG ATCCAAATAT 

16801 GGTAGTCATG ACTTGTTGCG CAGCTTTATA CACTGTTTTA ATGATGGGAA 

16851 TTCGGTGTAA GATTTTGTCG TAGATAGAGA GTAGGGATTT AAAAATCATA 

16901 ATTCTCGTGA GGAAACCTAG GAGCACTGTG GCGAAAAAGA GACCGAAGAG 

16951 TAAAATGATT TGCAATACGA ATTTTAGAAG AGCTCTATGT TTAGTATAAA 

17001 AGCTAAATTT CTCAAAGAAT TCCGAAGCCA AGCCTACGAA GGGTTGGGTT 

17 051 AGGAAGTTCA TGATCATAGT AACAATAGCA ATAGTAATTG CTAGAGGAAG 

17101 GAGAATAACA AGTCCTGTAA TAAAGTATTT TTTCATGATT CTCCTGCAAG 

17151 ATATGAGGAA ATGGGCATTT GTTTCTTTAC TATACAGCTT AAGATTATTT 

17201 AAGATAAAAC TTTTCCCGAA TCTTCTGGGG ATAGGAGAAA TCTCCATGGG 

17251 ACATCACGAT ACTCTTGAGC ATAATCGATG CCGATCCGGG CAGTTGCTGT 

173 01 TAGAGTCCCA GAGATTTTTT CTTTGCTGAT ATAGAGAGCT GGGGTATTTA 

173 51 GGCGTTGCCT ATTGTTTTCC AAAGAGATTC CTAGAGCTTG GCACACTTTT 

174 01 CCGGGTCCAT TGGTGAGAAG GTGTGGGGGT TTATCTCTCC ATTGGCGGCG 
174 51 TTGGATCATA AGTTCTTTGC CTTGATCAGG AAGGATGGCC CGGATCAGGA 
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17 501 CGGCATGGGG AATGTCCTCA GGTCCAGTGA CAACATTCAA TAGGTGATGC 

17551 ATGCCATAGC AACGGTAGAG GTAAGCAGAG CCTCCTTTCA GGTACATCGC 

17 601 TCTGTTCCTC TGAGTTTTTC TGTAGTTGTA GGCGTGGCAT GCTTTGTCAT 

17651 CAGGGCCACG ATACGCTTCG GTTTCTACAA TGTAACCTGA AGTTATCAGA 

17701 CCCTCATGTG TTGTGATGAG TTTATGTCCT AAAAGCTGTT GCGCTAGTGT 

17751 AATTACATCT TCCGATAGAA AAAAATGTTC TTGTAGCACG TTACGAGGCT 

17801 CTTTTTTTCG TTCCTTTTTT CTTAGAAGGC GTTTTCTTTA TTTTCTTAGG 

17851 TTTATCTTCT GTGGTTGTCG CTATAGACCA GACGATTTTT TGCGTAAGGA 

17901 GATTCACGGA ATCAATAGTG ACTTTTATAG AAGCTCCAGG TTTCATTTTA 

17951 TCTGGGATAG ATTCTGGAAG AGCGTTTTTC TTTAGGGAAT ATTCTTTAGG 

18001 GAGTTCTGCT GCTGCAATGA ACCCTTCATG GCAGAATTCG GTCACTACAA 

18051 ATGAGAGTCC TTCATGATTT GCAGTGATGA TATACGCATG GTATGTAGTT 

18101 TTAGGTTGCT CTTGCAAAAA TTTATTTATG AACCGAGTTT TTTTGAGGTT 

18151 TTCGAAAGAA TTTTCTGCTT TTGCGGATAC TCGTTCTTTT GTAGAGCATG 

18201 CTCTTACGAT AATTTCGAGG TGCGTTTGGT CTATAGATAG GGGGTTGAAG 

18251 AGAAGCCTGT GAACAATAAG ATCGATATAT CTACGTATGG GACTCGTAAA 

18301 GTGGGTGTAG TAGTCGAGCT TAAGTCCGTA ATGACCTTTA TTTTCTGTAG 

18351 AGTAGGAGGC TGTTTTCATA CTTCGGACAA ACTGCGAGTG TAGAACTTGC 

18401 TCTAGGGGAT GTCCTGCTGA CGTAGTTTGC AAAAGGTATT GGTAATCAGG 

18451 TTCTTGTGTG GGAGTGAACG TGATATCAAA GCCCATGTTT TTTGCCAATT 

18501 CTTGGAAGGC GAGTAGGTTT TCATCATTGG GAGGTTCGTG ACTACGAAAA 

18551 GGTAGAGAAA CGCCTTGATG GGAGATATGA TAGGCGACCA CTTCGTTTGC 

18601 TTTAAGCATA AACTCTTCGA TGAGTTTATG GGAGAAGGTC TGGTGGTTTT 

18651 CTATCAGAGC TACGGGTTCT TGAAGATTAT CCAAGGACAT AGTGACTGAG 

18701 GGGAGGACAA AGCGAATGCA ACCACGTTCT TCACGGATAT CGGAAAACTT 

18751 TTTACTTAGA GTGGCCATCT CATTGAGGAT TTTTGAGAGG GGGTGGGAGT 

18601 GTTTCTTTTC AATGATGTTA TCGACTTCAT CGTAGGTCAT ACGATATTTG 
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18851 CTTCGAATGA CGCTACGGAA 

18901 TGTAAACGTC ATAAATACGG 

18951 AGCTGCAGAG ATTATCAGAG 

19001 CCTGGGAAAT ATGTAGAGTT 

19051 AGAATGTGGG GTAACGTAGT 

19101 TGTAATTGTT ATTATGATCG 

19151 CTGGCTGTGG AAGAGTCTAT 

19201 GCGAGAGTGG AGAACTTGGG 

19251 CTTCAATGAC CTCTGGGGGG 

193 01 ATTGCCTGAA AGTCCGCTTT 

19351 CATTTGTAAG GCTGGAGAGG 

19401 GAGTGCTCAG AAGAATGCGA 

19451 AGTTCTACTG GAATTAAAGA 

19 501 TGCTGATGTG GGACTGACTA 

19551 TTCCTCTTGC GAGTACTTCG 

19 601 CTTGGATAGG GAAGCACGGA 

19651 GCGTAAATCT CGGGCGGGAA 

19701 TGTCGGGAGA AACAAAACCG 

19751 CCTGGAATAA AAATCTTCAA 

19801 TCTTTTTGGT TTTTTCAACA 

19851 GAACGAGAAA TCCCGTAGTT 

19901 TTTCCCTTTT AAGTGGGTTT 

19951 AAAGCCAATT 'GTACTAAAGA 

20001 AATAAGAGAA TGGAACTGTT 

20051 TATGAGGATG GGAATAGAGA 

20101 GTCGTTATCA ATAATTTCTA 

20151 AACCGTTATT TGAAGGAGGC 



PCT7US99/26923 

AATCTGGTAA TCTGAAAGAT GACCTGATTT 
ATACAGCGAG TCTATCAACG TTTGGTTTTA 
AGTGCTGATG GCAACATGGG AATGACTTTC 
ACAGCGTTTA GCAGCTTCTT TGTCTAGGTG 
GGGAGACGTC TGCGATGTGT ACACCAAGAA 
TAGGTGAGGG AGATGGCATC GTCGAAGTCT 
GGTGAAACAG AGGAGATCAC GGAGATCTTT 
TAATGTGTTT TTGAGAGAAA AGGCTTGCTT 
AATTCTTCGG CAAGGTTATA TTCGGCTTGA 
AGCGTTGGTG ATGTGGCCAA TAAATTCGAG 
CTCCTTCTTG GGGTTTATCT ACCCAGGGAG 
TCGCCGATTT TGTAAGTGCG TCCGGGAAGG 
TTGGGATCCC GACATGCTTG TGTAGGCAAG 
GTGAGGTGAT CGTTCCTACG AGTGTTGTTT 
CTGATAGTGC CTTTGAGTTT TTGTCCGTCT 
GACAATCACG TGGTCACCAT CTAGAGCCCC 
CAAAAATATC AAATGGGTAT TCTTCGGGGT 
AAACCTTTTC TAGCATGAAC AAATAGGGTT 
GGATTTACCG TATGTTCTTC TCCCTGGTTT 
ATTGGGCTCC GCCTGTAAGT TTAGGACATT 
TCACTCGTGA ACTGAGTGGC TCAACAAAAT 
TGTGATTATG AAAGGAGCAG TCTCAAATTC 
AGCTCTGTTA TTGCAACTCC TTGATCAGAG 
TTCTGTTTAT AAGGAAGTTT CCCATCCTCT 
AACTTAAATT GAAAATTTTG ATTACTTATC 
CATCAGCTTC TTCGATATGG TCTTCTGAAG 
TTCGTACTGA AACTATGTTT TTTCAAATCT 
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20201 TCTGTATTGA TGTTAGGTCC ACCTTTAGCA TTGGCTGCCG ATGATGCTGC 
20251 TGCTGATGCA GACTGCGATT GCATAGACTC TCCAATTTTT TGCATATGCT 
20301 TGCTTAGGTC TTCAGTAACC TCTTTAATTT TTTCAATAGG AGCGTCATCT 
20351 TTGAGTGCGT TGCGCACGTT TTCGATTCGC TCTTCGATTT CTTTAACTAA 
20401 AGTTTCAGGA ATTTGCTCCT TATAATCTTT AATAGCTTTT TCGGCTCTGA 
20451 AGATCATGCT ATCGGCTTCA TTTTTAGCAT CTGAAGCTTC ACGACGTTTT 
20501 TTATCTTCTT CCTTATTAAT TTCGGCATCT CGAACCATTC TTTGGATTTC 
20551 ATCTTCTTGA AGTCCTGAGC TTGCTTCGAT ACGAATTTTC TGTTCTTTAC 
20601 CGCTGGCAAC ATCTTTAGCT GAGACATGGA AAATTCCGTT TGCATCGATA 
20651 TCGAAGGAGA CTTCGATTTG AGGATGGCCT CGAGGAGCCG GAGGGATATC 
20701 TGTAAGATCG AATCTTCCGA TTTCCTTGTT ATCTTTGGCC ATGGGACGCT 
20751 CTCCTTGGAG AACTACGATG GTAACCGCAG CTGGTTATCA GCAGCTGTGG 
20801 AGAAGATTTG TTTTTTCTGT GTAGGGATTG TAGTATTTCT CTCTACCAGA 
20851 GTCGTCATGA CGCCTCCTAG AGTTTCGATA CCCAGAGATA GGGGGATAAC 
20901 GTCTAGAAGT AGAACATCCT TAACTTCTCC GCCAAGAACA CCACCTTGAA 
20951 TTGCGGCTCC AATAGCAACA ACTTCGTCGG GGTTGACTCC TTTATTAGGC 
21001 TCTTTGCCGA AGAGTTCTTT TACAGTTTCT TGCACTGCGG GCATTCTTGA 
21051 CATACCTCCA ACTAAGAGAA CATCATCGAT ATCCTTAGCG GAAAGTTTTG 
21101 CGTCACTGAG TGCTTTGATG CATGGAGATT TTGTTCTTTC GATTAGAGAG 
21151 GCTGCGAGTT TCTCGAATTG CGCACGTGTG AGTGTCAATG CAAGGTGTTT 
21201 AGGTCCTTGT GCATCCATTG TGATGAATGG CTGATTGATT TCTGTGGAAG 
21251 AGACTCCTGA AAGTTCTATT TTTGCTTTCT CAGCAGCATC TTTAAGTCTT 

213 01 TGTAAGGCCA TATTATCTTT GCTAAGATCA ATGCCTTCTT GTTTTTTGAA 
21351 TTCTTCGATC ATCCATTTGA TAATGACTTC ATCAAAGTCG TCTCCACCGA 

214 01 GGAGAGTATC TCCATTTGTA GATAGAACTT CGAAGACGCC ATCACCGATT 
214 51 TCTAGGATGG AGATATCAAA AGTTCCTCCA CCAAGGTCGA AGACAGCGAT 
21501 TTTTTTATCA CCGACTTTAT CGATTCCGTA GGCAAGAGCT GCTGCGGTAG 

16 



WO0027994 f file:/A\dcwas03\firmdata \l p\FoleyPat\PatentDocuments\WO002 7 994.c pc] 



Page 144 of 330 



WO 00/27994 PCT/US99/26923 

21551 GTTCTGGAAT GATACGTTTT ACATCTAGAC CTGCAATGCG TCCAGCATCT 

21601 TTTGTGGATG CTCGTTGAGA ATCATTGAAG TATGCGGGGA CGGTGATCAC 

21651 TGCTTCTGTG ACAGTTTCGC CTAGATAAGC ATCAGCTGTC TCTTTCATTT 

21701 TCATTAAGAT TTGTGCGCCA ATTTCTTCTG GAGTGTATTG TTTGCCATCA 

21751 ACTTCGAAAA CGGCATCACC TTTAGATCCG GAGGTGACTG TATAAGGAAC 

21801 GGTTTGGATT TCCGAAGCTA CTTCAGAGTA CTTACGGCCA ATAAAGCGTT 

21851 TTGTAGAGCC GAGAGTTTTT TCTGGATTTG TCACTGCTTG ACGTTTTGCT 

21901 GGAATCCCCA CTAATTTCTC ATTACCTTTG AAGGCAACGA TCGATGGCGT 

21951 GGTTCTTGTT CCTTCGGATG ATGTAATTAC TTTAGCTTGT CCTCCTTCCA 

22001 TAACAGATAC GCAGGAGTTT GTTGTGCCTA AGTCTATACC TATAATTTTG 

22051 CTTGATTTTT TGTGTTCACT CATGTTTGGT ACCTAATCTC TAGGGGTTAT 

22101 TTCTATTCTT TATTTTCTTT GGGAGTAGGA GCTTTAGCGA CTTTAACTTT 

22151 AGCTACCCGA ATCGGGCGTT CTCCTATTTT ATATCCCTTT GCAAACTCTT 

22201 CTAAAATCGT CCCCTCAGGA ACTTCAGAAG TCTCTTCTGT TTGCACCGCT 

22251 TCGTGTAGGA AGGGGTTAAA CTTTTGGCCT ATTGAAGAAT ATTCAATAAT 

22301 ACCTTTTTCC TCGAAGATTT GTTTGAATTG GTTGAGAATC ATGTTGAATC 

22351 CGAGGGCCCA ATTTTTTACA TCGTCGGACA TTTGTGTAGC AAATCCGAGG 

22401 GCTTTCTCCA TGCTTTCTAT GGGATTGAGA AAGTCTATTA AAGTATTTTC 

22451 TAAAGCATAC TGCATAAGTT CTTGGCGTTC TTTTTGTAAG CGTTTTCTAG 

22501 AATTCTCAGA TTCTGCTAGA GCCATGAGAT ACTTATCGTT TTTTTCTTTT 

22551 AATTCGGTTT TTAGGGTGAC GATTTCCTGT TGCAAATGTT CAACTTCATT 

22601 TTCGTTTTGA ACATTGCTTT CGTGTTGTTC CTCATTTTCA GGTGGGGTAT 

22651 CTGTCATAAC GTCTCCTTAG AGGGTAATAG TTTTATAGAA GAGTACTCCG 

22701 TTCTTAAAAT AGGCTCATTC GAAAGCTTAC AGTTAGAGGT GAGTGGTCTT 

22751 CTGAAGGATA GTTTAAATTT GTAGAAACTT TGTGTCAGGG TTTCATTTAT 

22801 TTTATTCGCA AATAGTTTGA GCAAAGGAAG AGCTTCCTTA TAAGGAAGAT 

22851 TGATCGGGCC TAGGATACCT AAAGCTCCGA GTGGAGAGCG ATTCATATAA 
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22901 TAGGGAATAG TAATTACAGA ACATCCTGGA TTCGAGGTCC CTAAAATATC 

22951 AGAAAGCTCC TTCCCTATGA ACGCTGTAGC TCTTCCTTTA TGCATTCCTA 

23001 TATTTAGAAG CTCACACATT TGTCTGCGAT TTTCAAAAAG AGAGAGTCCT 

23 051 AGAGCTAGAA CTTCAGGATC TTTAAACGCT TCGTATTTCA GTAGTTTCGA 

23101 CATTCCTGTT TGATAGAGAT CTTCTTCACT AAAGTTGCAG TAGCGTGTTA 

23151 GATAGCGGAC AACCACCTCA TTATAGAGGG ACATGCTCAG GTGTTCTTCT 

23201 TTTTTCGAAA GTTCCTCATT TGTGGGGAGC TTTCGGATGT AGTTCTGCAG 

23251 GAATTTTTCT ATACGTTTGA TAGAAAGAGT ATCGCAAGCT TCAGGCAGCC 

23301 ATAGGGTGTC TGTGAAGATC TGACCAAACT CCGTAGAGAG GATGGTGACA 

23351 GCTCTTTGCT TATCGACCTG TGTAATTTGA ATATTGGTTA CGGAATCATT 

23401 TTCAAAGCGT GGGGAAGAAA AAAACGTAGG CAGGTCTAGG ATTTCTCCAA 

23451 GAAGTTCCGT AGCTTTTTGT AGATCCTTGA TAATATTGCG ACTTTCGCTA 

23 501 GGAAGCTGAC TGATCTTATC AAAAATGGGG GCAGAAATCT CAGCTTCTGG 

23 551 GCATTCTTCT TGGTGATCTA CATAGTGACG TAATGCTAGG TCTGTAGGGA 

23 601 TTCTTCCTCC GGAAGTATGA TTTTTTTTTA AGAATCCTTC AGCTTCAAGT 

23 651 TCTGCAAAGT AATTTCTTAT AGTTGCCGTA CTCAAATCAG AGCAAAAACT 
23701 TTCCTTTAAA GTTTTAGACC CTACAGGCTG CCCTGTTTTT AGGTACAACT 
23751 CTGTTGTAGC AAACAGGATA TCAAGGATTT TTGAATCTCG CTTTGAGACT 
23801 TTGGATCTAG CCATCTCGAG TCCTACTAGA ACAATCGTAA CTGAGAGCAT 
23851 TATAGGAAAA GAACCCGAGA AGGTCAAGAG AATTTAGCAC TCGAATTGAA 
23901 TGAATGCTAA CTTTTTTACG AGGAGGGCAG CGATCAAAGA GCTAGGCTAA 
23951 GTGATTCTGA CACCAAGTAG GGAAGGCCTC CGGGGAGACT GTATACTTTT 
24001 CTCCAGATCG GGATTCAATT TCGAATATTC CCGAAGATTG GTAGGACTTT 

24 051 CCTAAAATAA GCTTATAAGG AATGCCGATA AGGTCACTGT CTTTAAGTTT 
24101 AAATCCGAGT CTTTCATCTC GATCATCAAG AAGGGGCTCA TAGCCTTGAC 
24151 TTTGTAGCTC ATGATAAATA GTTTCCGCAA GCTCTTGAGA TACAGTGTCT 
24201 CCTCCGTTAA AGGCGATAGT GATAGAGAAG GGAGCGAGTG CTTTTGGCCA 
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24251 AACAATACCA CGGTCGTCGG CAAGCTGTTC TACACAAGCG GCTAATGTTC 

24301 TTCCGACTCC AATGCCGTAG GTCCCCATCC AGCACTGCTG GGTTTGCCCG 

24351 TGTTCATCTT GGAAGTTTAC CTCAAAACTA TCGGTATAGC GTGTCCCGAG 

24401 ATTGAAAATA TGAGCAACTT CTATGCCTTG ATAAATGCGG TAAGGATGGC 

24451 CAGGATTTTC AGGACATGTG TCTCCCTCTT CAGCGAGTAG AAAGTCACCG 

24 501 TATTGGGGGG GGAGGAGGTC GCGATCCCAG TTTACATTTA CGTAGTGCTT 

24551 ATCTTTAGCA TTGCCCGCAC AAACAAAGTT CGTCATTGGG GACGTTGTTT 

24601 CGTCTGCGAA AAAGTCTATG GGACAGTTTA GGGGACCGAT GAATCCTTTT 

24651 TCTGTGCCTA GAACGCGTTC GATTTCTTCA TCAGAAGCTA GAGCAATATC 

24701 ATCGGCATTC AGTTTGGAAG CGACCTTCAC TAGGTTGACT TGCCGATCTC 

24751 CTCTCATTCC AATGGCAATG AATTTTTCTT CATTTGAGTA GGAGAGTTTT 

24 801 ACGACAAGGG TTTTTAAAAT TTTATGTAAG GGGATAGAGA AGAAGTTTGC 

24 851 TAGAGCTTCT ATTGTTGTAA TCCCAGGGGT GGCCACTTCT TCGACGGGAA 

24 901 GAAACTCGCG ATCGTAGGCA TGCTGTGGAG GAATGGAGAC AGCAGCCTCA 

24 951 ATATTAGCTC CATAGGAACC GCTGACGCAG ATCGTGTCCT CGCCTAGAGA 

25001 GCAAAGGACC TGAAATTCCT CAGACTTTCC TTTGCCGATT TTCCCTCCAT 

25051 CAGCTGTAAC GATGACATAG GCAAGACCGA GACGATCAAA GATCTTACTA 

2 5101 TACGCAGAGC GGAGTTTTTC ATATTGCTCG TTCATTTGTT CGGGAGAGTC 

2 5151 TGAGAAGGTA TAGCTGTCTT CCATAAGGAG CTCTCGAGAG CGAATGAGAC 

2 5201 CGAATCGAGG GCGAATCTCG TCTCGGAATT TTGTAGCAAT TTGGTAAAGG 

2 5251 TGGAGAGGAA GTTGTCTTTT TGAGGAGAGC CATTGTGCAA CAAAAGAGCA 

2 5301 GATGACCTCT TCATGTGTAG GAGCTAGGCA ATGAGATTTT CCTTCGCGGT 

25351 CTTTGAGAGT GTAGAGCAGT CCTTCCGAAG TAAATGCCTC CCATCTCCCT 

25401 GTATGTTGCC AAAGTTCAGC ATTGTGGAGA AGTGGGAGTA GAAGTTCTTG 

25451 ACCTCCAATC GCATTAAGTT CCTCTCTAAT GATGTTCATC ATCTTGGAGA 

25501 CCACGCGCCA TAACAGGGGT GTATAGGTAT AGACTCCTTT ACTTACTTTA 

25551 AATAGGTATC CTGCCTTTTC TAGGAGCTCG TTTGAGAGCA CAGCAGCGCT 
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25601 TTTATTTGCA TTTTTTGAAG TCTTATAAAA GAGTTGAGAC GTTTTCATAG 

25651 AGTGGGGCTG TTATCTTCAG TGATTGATCG TCAAGAGTCT ATAGAAAAAA 

25701 TTACAGGCTA TTGCGATTTT ATATCGAGAG TCTTTTAAGC TAGTAGATTT 

25751 AGTTTAGCTA TGAAGGAGGT ATCTTAACAC AGGGATTTTT CTAGATCAAT 

25801 GTTTTGCGTT GGGTTCTTAA CTTAACAATT GGGGTCAGGA TTATGTATTA 

25851 TTTTAATTAT ATTTTTGTTT TTATTTAGAA ATTTAATAAT CTCAACTTAT 

25901 AATTAAGTGT ATACTTATTA ATCTTTTATT TTTGTAATTG TAGTACTATG 

2 5951 TCGTCAGTAA ATCAAAGCTC TGGAACCCCG AATCCAGAAG AGGTAACTTC 

26001 TCCTGAATCT ACGGAAGAAA ACAAAAATGT TGTTTCTTCA GATGAGGCGC 

26051 AAGCCACGCA TGCTGTGGCT CTTCCTATAG TCACTCAACT TTCTCTTCCT 

2 6101 GAAGGTGTGG GGACCTCATC TGAAGAAACG GCGAGTAATC CGAGGGTAGA 

26151 CGAGATTGTA GCTGAAGTTT CTTCGAGTCG GGCGGTTGCT GATCAGATCT 

26201 CATCACTTGT AGAGCGTGTT GGAGAGCTTT TAGACGACCT TAAGGGTGCC 

2 6251 CAGTCCCTTT TCACTAGCTT TCAGTCAGAG TTGAAAAACT GTCTTCCGGC 

26301 ATGGAAATCT TCAACGAGAA GACTCGAAAC TCGAGGTGCT GGGGATAATG 

26351 CGGATATAGC GAGGCTGGAA TTATTTCGTA GCGATTACGA GGCTGTCTTA 

2 6401 GGCCATGCGA ACCAGTTTCA TGGGAAGGCT CATCTCATTT TAAGTAAGTT 

2 6451 AACAGATGTA CATCACAAGC TACAGGGACT CAGTCGTGAA GATCTTTCCC 

26501 TGGCGTTTGA CAATAATGAT AGGGTTCTTG AGCATCTGGG TTCGTTAGGG 

2 6551 CTTGATGTAG ACGCTGAAGG TAATTGGTCT CTTTCTTGTG AGAGGGGGAT 

2 6601 TCCGCGACTG GTGCTTACTG CTGACAGTAT GCTTGTCCAG ATCAAGAAAG 

26651 TGAATCTACC TACTGTAGAA GAATTGCGGA CTCTTCAGGG AACAACGGAA 

26701 TCTTCGTCTG ATCCTAGGGT TGAAGAAAGT TTGTCTTGCT GTGAAAGATT 

26751 GCTCAATGAA TTACGTCGTC TTTGGGCGAA TTTTGTAGGT TTTATTTCGA 

26801 GTTGCTATGA CAACATCGTG TTTGTTTTGA TGTGGATAGT GAGACGGATT 

26851 AACCTTTTGC CTGGGCTGGG GTGTTTGCCT TTCCATAATC CCGATGCTTC 

26901 TCAAGAAGAC CAGAGGTCTT CTTCCGGAGA GCGTTCTACA AGGAGAGAAC 
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26951 GCCTTTCTCG GCGATCTGAC TTATCTGAGG AAGAGATGAT TGTGAGAGCT 

27001 GAGGGAGAGT CTATACATCC TGAATCTCCC CATGGAGATG GCCGTAACCA 

27 051 ACCTAGTCGA GGTGATAAGC AAGACTCTGA TAGTGAGGAA GAGACGGAGT 

27101 TATAATAAAG AATGATCTCT TCTGAGTAGA AACTATCGAA TCAGAATTGC 

27151 CAGCTTTTAC TCTCACCCAG TCCTATTTTT ATAGTTCAAA TTGCGGTGGT 

27201 TTGTTTATTG TTTTTTTGAA TTTTTGTTAT ATGGTGGCGT CTAATATTGC 

27251 CGATTAAGGA GGGCGCCCTT TATGAATAGA AGAAAAGCAA GATGGGTAGT 

27301 GGCATTGTTC GCAATGACGG CGCTCATTTC TGTTGGGTGT TGTCCTTGGT 

27351 CACAAGCGAA ATCAAGATGT TCTATTGATA AGTATATTCC TGTAGTCAAT 

274 01 CGTTTACTAG AAGTTTGTGG ACTTCCTGAA GCTGAGAATG TTGAGGATTT 

27.451 AATCGAGTCC TCGTCTGCTT GGGTACTGAC TCCTGAAGAA CGTTTTTCTG 

27501 GAGAGTTAGT CTCTATCTGT CAGGTTAAAG ATGAGCATGC TTTCTATAAC 

27551 GATTTGTCTT TATTACATAT GACTCAGGCT GTGCCTTCGT ATTCTGCAAC 

27601 GTATGATTGT GCTGTAGTTT TTGGCGGGCC TTTGCCAGCG CTACGTCAGC 

27651 GCTTAGATTT TTTGGTGCGA GAGTGGCAGC GTGGCGTGCG CTTTAAGAAA 

277 01 ATCGTTTTTC TATGTGGAGA GCGAGGGCGC TATCAGTCTA TTGAAGAACA 

27751 AGAGCATTTC TTTGATTCTC GGTACAATCC TTTCCCTACT GAAGAGAACT 

27801 GGG AATCTGG TAACCGAGTT ACTCCCTCTT CTGAAGAAGA GATTGCCAAA 

27851 TTTGTTTGGA TGCAAATGCT TTTACCTAGA GCATGGCGAG ATAGTACTTC 

27901 AGGAGTCAGA GTGACATTTC TTCTAGCAAA GCCAGAGGAA AATCGTGTGG 

27951 TTGCGAATCG TAAGGACACC TTACTTTTAT TCCGTTCTTA TCAAGAAGCG 

28001 TTTCCGGGAC GCGTGTTATT TGTAAGTAGT CAACCCTTTA TCGGTTTAGA 

28051 TGCTTGCAGG 'GTCGGGCAGT TTTTCAAAGG GGAAAGCTAT GATCTTGCTG 

28101 GACCTGGATT TGCTCAAGGA GTCTTGAAGT ATCATTGGGC TCCAAGGATT 

28151 TGTCTACATA CTTTAGCGGA ATGGTTAAAG GAAACGAACG GCTGCTTAAA 

28201 TATTTCAGAG GGTTGTTTTG GATGATTCAT GGATCTTAGA GGTTAAAGTC 

28251 ACTCCAAAAG CCAAAGAGAA CAAAATTGTA GGCTTTGATG GACAAGCTTT 
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29701 CCTTCAGGGA GTTCTACCCA AAGGTAAGGG GCATGATCGC CACCATGAAC 

29751 TGAGAATCCT GCAGTTTCTA AGCTTTTTTT AAGTTTCTGA GCATTGGTTA 

29801 GATATAAAGA GATGGCGGGA GGTGTCGGAA ATAAATCTAG GCCGTAATAC 

29851 CCTGCTTCTT GCATGAGGAG AGATGCTCCG TTAAATGTAG TCGCAAAGAG 

29901 CCGTTTCCAA TCGTTGATCA TAGGTTCGTT ATTGTCATAG GTGAGTTCTT 

29951 TAGGGATCAC GTTCCAGGCA AGGCGCATGC CAGTAAAGCC TAATGATTTA 

3 0001 GAGAAAGAGT TGATTTCTAT AGCACAATAT TTTGCTTCAG GGATTTCGAA 

30051 GATGCTTTTA GGTAGGCTAG GATCTGAGAC AAAGGCGCTA TAGGCCGCAT 

30101 CAAAAATAAG AACGGTTCCG TGCTGATTCG CGTAGTTCAC AAGTGCTTGG 

3 0151 AGTTGTTGAA AGGTTAGAAC TGTTCCTGTG GGGTTGTTAG GATAGCATAG 

30201 ACAAAGAATG TCTAGGGATT GTTGGTTCGG AAGTTCTGGA ATAAACCCAG 

30251 TTTCTTTTCT GCATGCTAGG GGGATAATGT CGCGGATTCC TGTAATGTGG 

30301 GCAATGTCTC TATAAGCTGG ATAGACAGGA TCCTGTAGAC CTAGAGTCTT 

30351 TTCTGAGCCA AAAAAAGAAA AGAGACGGAA GATATCAGGT TTGGCACCAT 

3 0401 CCGAAATAAA AATCTCTTCA GGGGAGATTC TATTTTCATA GACTTCAGAG 

30451 GCAATTTTTG TGCGTAATTT TTCTAATCCG GTTTCTGGGC CGTACCCACG 

30501 ATAGGTCTCT TGTTTCTCTT GAGAAACGCA GAACTCTTTG ATTGCCTGAG 

30551 TAATAGAGCG GCAGAGAGGT TGTGTCGTAT CTCCGATAGA AAGATCTATG 

30601 ACAGAGATTT CTGGATTCTC CTTGCGAAAC TGAGCAAGCT TTTTACTAAT 

30651 TTCAGAAAAT AGATACTGAG GCTTGAGAAG AGAAAAGTGG GGATTTCTAC 

30701 GCATAGCGTG CTCCAGGTAA GGGCGTGATT CACTTGGAAG ACATCTTAAG 

30751 AAAGCCCCAG CTTTTTGGAT AGCCATTTTC TGATTTTTCT TTAACCGCCT 

30801 CTATAAAGGC AAGGAGTGCT CTGTAGTTTC GGTTAATTAT TAAGATATAT 

3 0851 TTATAATTCT GTTTATTAAA AGTTTTTTAA ATCTTTTCTA ATTTGCTCAC 

3 0901 TATAATTAAA GGATAAGATT TGAAAAAATT TTTTAGGTAA TTATGATAAG 

30951 GGTCAATCCT TATGGAAGTT ATAGGGGTAG GAATCCTTCT CCAGAAGATG 
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31001 GGAAAAAGGA TGTACCCCTT TCAGGGAACT CTCGCTTGCA TCGTCGTGGT 

31051 GGGATTCGTA GAAAGCATAA GAGTGCTTCA GTTGGGGTGA CCTCGGGTTC 

31101 TAAGACGGGG AAAGCTTCTT TAGAGAAGAA GGTCAAAGGC ATTTCAGAAG 

31151 CCCATTTCAA ATAATCCAAG ACAGAAGGTT CTCATTCGAA GACAAGCAAA 

31201 GGATTCGTTG GC AG ATTTGT TCAATGGATT AGAACGTTTA CAGGACGTGG 

31251 AAGCAAGAAG CGTTCTCCCT CAAGTTTTTC TCCAACGCAC CCTTACATAC 

31301 GTTTGCGAAC TTACACACGC AGTCCAAAAC AGAGTGGTGT AGAGAGAAAA 

313 51 CAAGAAGATG CTGAGACCTC ATTTATAGAG ACACCCAAAG GGATCTTGAA 
31401 AAAGCCTGGA AACAAAGACC CCAAAGGCAA GCACGTCCAT TGGAAAGACA 

314 51 GCTAATCCGG ATCAGAGCTG AGATCCCTCC TTCTCTTCAT TAAGAGGGTT 
31501 CAGAGTTTTA GGTCTCTGCG GTAGAAGGGA GGGAATCCAT AAGGAGAAGA 
31551 CCAGTGGATT TTCAACGGAG ATTCTAGAAG GAAACAAGCT GATTCTACAT 
31601 GGAGATGTTT GTAGGTCTGT TGTGAATTCC TCTCTTCACA GATCGGAAAG 
31651 ATCAATGAAA AGAGTTAGGG TCTCTTAGTT TTAAGAATTG CTATACAATA 
31701 TTGCGAAATA AATACTTTCT TCTCTATACA CTGTCTTTAC GATGAAGACA 
31751 GCTTTTCACT CTTGCTATTC TTGGTTTTGT TGGCTCTTTA GCTTCTTGGT 
31801 ACTCTTTGTG GGTGGCATCG CTGGGGGAGA GCCTTTGTGC CCCGATTGCA 
31851 AATACGAAAC TAAGTCTGTT TTACGTTCGG ATCAGCTGCC GGATCATCTC 
31901 TGGAACTATG AAAACGACTG TTATCTTACA GGTTATGTGC AGTCTCTTTT 
31951 GGACATGCAT TTTTTAGATA GCCGTACGCA AGTTGTTATT GAGAAGAATA 
32001 GAGCGTATCT TTTCTCTTTG CCTGTAGATT CGAGTTTATC AGAAGCCATT 
32051 ACCAACTTTG TTAGGGATCT TCCCTTCATA TGTGCTGTGG AGATTTGCGA 
32101 GCGTCCTTAT GGTGAATGCA TAACGAGATC TTCTGCGGAG CGTCCCTTAC 
32151 TCCCTAAAGA GAAAACTTTA GGAATGCCAA TTTTCTGCGG CAAAGAAGGG 
32201 GTATGGTTAC CTCAAAATAC CATTTTGTTT TCTCCTTTGA TTGCAGATCC 
32251 TCGTCAGGTT ACCAACAGTG CTGGCATTCG TTTTAATGAG AAGGTCGTGG 
32301 GGAATCGGGT AGGTGCTACC ATCTTTGGGG GAGATTTTAT TCTCCTGCGT 
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32351 CTTTTTGATG TTTCTCGATT CCATGTAGAT TGTGATTTCG GAATTCAAGG 

324 01 AGGAGTCTTC TCAGTTTTTG ATTTAGATCA TCCTGAATCG TGCATGGTAA 

32451 ATTCAGATTT CTTTGTTGCC GGACTCTGGT CAGGGGCTAT AGATAAATGG 

32501 AGTTTTAGGT TTCGATTGTG GCACCTCTCG TCCCATTTAG GAGATGAGTT 

32551 TATTCTTACG CATCCAAATT TCCCAAGATT TAATTTGAGT GATGAGGGCG 

32601 TCGATCTCTT CATTTCGTTT CGTTACACAC CACAGATCCG CTTGTATGGC 

32651 GGCTGCGGTT ATATTGTAAG TAGGGATCTT ACTTTTCCTG AGCGGCCGTT 

32701 TTACTGTGAA TGGGGTGCGG AACTCAGACC TTTTGGTCTG AGAGAAGGAA 

32751 ATCTCCACGC ACAACCGATT TTCGCGATGC ATTTCCGTTG TTGGGAAGAA 

32801 CAGAAATTTG GCTTGGATCA AAGCTATATT TTAGGCATGG AGTGGGCCAA 

32851 ATTTCAAGAA ATCGGAAGGA AAATCCGTGC TGTTTTAGAA TATCATCAGG 

32901 GATTTTCTAA AGAAGGCCAA TTCATTCGTG AACCGTGTAA TTACTACGGT 

32951 TTCCGTCTTA CCTATGGATT CTAAACGATA ACAAAACCAT CTTCAGGGAC 

33001 AGGGTGTTCA GGTCCTATGG GCAGCGTATG ATTGAGATAT CTGCGGAAGA 

33 051 TTGCAATGCC TGCCTTTGCT GAGGATAGGC AGAATAGGCA GTTGTGTACC 

33101 CATTCAGAAC CTCGAATGGT TCCTACAGCA TGATTGGAAT TATACAAAGC 

33151 TGTGATCTTG GATTTCCAAT AGTCTACAGG TCCGATTAGG AAGACGGGAA 

33201 CAAGAGCTTT TTTCCCTGTT TTGAGACTAA TAAGCTCCAG AAGGAGTTCG 

33251 AAATCGGTTC CCATGCCTCC GATAACAAAT ACAGCAAGGT CGACATGGAA 

3 3301 GTCGGCCTGA CGTTCTAAAA GATCAGGAAT AGCATAGCTC ATTTTAGCTT 

33351 CTACATAGGC ATTCGTGGTA TCCAAGCTAA TTAGATTCCC ACAAGAGAGT 

33401 ATGGAGAGTT CTGTAGCTAC ACGATTCGCG AGTTCCATAG CTCCAGAACC 

33451 CCCTCCTGTA AGGATTGCTA ACGGTGTCTG TGGTGGAAAT TCTGGGATCG 

33 501 TGAATTGCTG AGAAAGAGTA TGCATTCCTG TCAGGAGCTC ACGGAGAAAC 

33 551 TCATCATAAT CCCCAGCGAT TAGGCAGGAA CCATGAATTC CTATAAAGTA 

33 601 GGATTGAGCA AACTGTTCAG CTTGATGTTT AGGGACAAAC ATGCCCACAT 

33 651 CTTTATTTCT GCGTTTGATG TATTGTAAGA GTCGTTTCGA TTCTAAGTCT 
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33701 GCCCAAAATA CAGAAATTCC TGCAAAATAT AGATCGAGAA GGAAAGAGCG 
33751 ATCTCGATTC GAGAAAAACT CTCCAGAAGT GGGAGAGGGA ATCTGAAAAT 
33801 AGATATGTTG CAGGTAATAG CGAGAGTAGT TAGAGAGGAA CATGCCCTTC 
33851 AGCGATGCTG AAGGGAAGTA GCGGGAAAAT AAAACTCCTT GGCTTGTGAT 
33 901 ATGATCTGTT TCCATGGCTT TTAAAAAAGG GAAACAAGGT TGGTCTTCAA 

33 951 TGTGCTTTTG AATTTCCCTA GCATGTCTTT CATCTGATGG GGAGATTCGA 

34 001 GGTTTGATGA TCCAAGAGTC TTGGGAGAGC TCAAGCAGCT CACTACCTTT 
34051 GGAGATAAAC ATCGCAGCTT GATCTTCGCC TTCCGGTATG GATTCAAAAA 
34101 CACGAAATAC CTCTTGAGGA GATTCTAAGG TTTCCTGGAG CATATCTCTA 
34151 TAGAAGAAAA ACGAATGCTC TTTGTAAGGC TCAAGAGTAA AAAATTCTAA 
34201 AGGTATTCTC TCAATAGGTT CTGAAGTGCT GCCGTAGAAT TCATAAATAT 
34251 CTCCAGATTC TTGTGTGGTA GGTTCGAGAA TATCCGCTGC GGTGTGACGA 
34301 AGCCCTTGGG GGAGTAAGTC CTGAACGACT CTTGCAAATA CGGTTCGGAT 
34351 GTGCAGAGGC TCTGTCTTTA TGAGAAGAAT TTTATGATCT TCGGGAACGG 
34401 GAGGACGATC TGTTACCATT TGATACAAAG GAAGAAACTT ACGTATTTTT 
34451 AAATGGGGAC GCGTGAGTGA TTTGCTCATT AAGGGAAGGA ACCCATAAAT 
34501 TGTCTCTTCG TAACAGATTG TTCCTGGAAG GATCGGAAGG AAGACAACAA 
34551 GCCGATCATT AATGATCTCT AGAGTGATGA AGTGCTCAAG TTTTTTCCCA 
34601 AAGCGTAGGA GCGGAGATCC TGTACGGTCT GTGTGCGTAA ACATCCTGTT 
34651 GAGATAACAA GGCGAACGTA CGAGTCGGCG ATCATCAGCA GCAAAGAGCT 
34701 TGCAGACAAA ACTTCCAGGC TCTAGGAGCT CCAACATAGC AGTGGCTATA 
34751 GGATCTTGGC TCATGAAGAG AACGTGTAGA CGAGCTTCTT TTCGGGCTTT 
34801 ATTTAGCTCC AAGTGGTTTA AAACGGCTTC GACACCTAGT TGGGCTAAGG 
34851 AACTTTTTAA ATTTACTTGT ATACACTGTT GAGGCAGATG AAATCCAAGA 
34901 AAGTACGCAG GAATATTCTC AATGAGGACC TCTCCTTCGT AAATGTGGGG 
34951 CGAGAGTTTT TTCAAATGGG AAACGAGTCG TCCGTCTGGG GAGGCTGCAT 
35001 CATGATGCGC GTGGAGTAGG TTATACATAA TTCATCGTAT TTTAAAGTTA 
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35051 AAAGCAATAG GTAAGGTCCC TCTTTGCGCT TCTAAGACTA GCGTTTCCAG 

35101 AAGAATTTCC TACAAGGACT TCATTCTGCT TGTTGTTATA ATTTTGTGCA 

35151 ATTTGTCAAG GAGGGGGTCT TGGAACGAGA AAATTGTGTT GTGGAAGGGA 

35201 ATTTTAGGAA AAAAGTTTCT TAGAAAATAG GGCAGAAACT TTCTCTTGGA 

35251 GAGATTGAGT AAAATGTAAA GAATAGTCTT TGCAATTGAG AATTATTTCT 

35301 CTCTGGTTAC AATGGAGGAT TGGCTAAGGA GGATAGTAGG TATGCAGATT 

35351 CCAAGAAGCA TTGGTACTCA CGATGGTTCT TTCCATGCGG ATGAGGTCAC 

3 5401 AGCGTGTGCT CTCCTTATTA TTTTCGATCT TGTGGATGAA AATAAAATTA 

35451 TACGCTCTCG AGATCCTGTC GTATTATCGA AATGTGAATA TGTTTGTGAT 

3 5501 GTCGGTGGTG TTTATTCTAT AGAAAACAAG CGTTTTGATC ATCATCAAGT 

3 5551 CTCTTATGAT GGATCTTGGA GTAGTGCAGG TATGATTCTG CATTATCTTA 

35601 AAGAGTTTGG TTATATGGAT TGTGAAGAAT ATCATTTCCT TAACAACACT 

35651 TTGGTACATG GTGTGGATGA ACAAGATAAT GGCAGATTCT TCTCTAAGGA 

3 5701 GGGATTTTGT TCGTTTTCTG ATATTATTAA AATTTATAAT CCTCGCGAGG 

3 5751 AAGAAGAAAC TAATTCGGAT GCGGATTTTT CTTGTGCTTT GCATTTTACC 

3 5801 ATCGACTTTT TGTGTCGGCT AAGGAAGAAG TTTCAGTATG ATCGAGTTTG 

3 5851 TAGGGGGATT GTCAGAGAAG CCATGGAAAC CGAGGATATG TGTTTATATT 

3 5901 TTGATCGTCC TTTAGCATGG CAAGAAAATT TCTTTTTTTT AGGGGGAGAG 

3 5951 AAGCACCCTG CAGCTTTTGT TTGTTTTCCT TCCTGCGATC AATGGATTTT 

36001 ACGAGGGATT CCTCCGAATT TAGATCGCCG TATGGACGTT CGTGTTCCTT 

36051 TCCCTGAGAA TTGGGCAGGT TTGTTAGGTA AAGAGTTGTC CAAAGTATCA 

36101 GGGATTCCTG GGGCTGTGTT CTGCCATAAA GGTCTTTTCC TTTCTGTATG 

3 6151 GACAAATAGA 'GAAAGTTGCC AACGTGCTTT GCGGTTAACG TTACAAGATC 

36201 GAGGGATCAT ATGACAGTAT TCAAACAAAT TATCGATGGA TTGATAGATT 

36251 GTGAAAAGGT ATTTGAAAAC GAAAATTTCA TAGCTATAAA AGATCGTTTT 

36301 CCTCAAGCTC CTGTTCATCT TCTTATCATT CCTAAAAAAC CTATACCACG 

36351 ATTTCAGGAT ATCCCAGGGG ATGAGATGAT TTTAATGGCA GAGGCTGGAA 
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36401 AGATCGTGCA AGAGCTTGCT GCAGAATTTG GAATTGCCGA TGGGTATCGT 

3 6451 GTGGTTATCA ACAACGGTGC TGAAGGAGGA CAGGCGGTAT TTCACTTACA 

3 6501 TATTCATCTT TTAGGTGGGC GTCCTTTAGG TGCTATAGCC TAATTTTCTT 

3 6551 TTGTTTCTGT GGATCCTTGT TCGGCTCGGA GTCCCTCCGT TATCAATTGT 

36601 TGATCCAAGA TTTTGCAAAA GTTTCAGAAG AGGGCATAGG CCTTTTGGAG 

3 6651 TCTAAAGAGT ATTCTTTACT TCAGGCTAAG CTAGTTTTAA GGGCTCTGGC 

36701 TCAAAATTCT TCTTTTGATG ATTGGTTTAG AAGTTTTAAG AAGTGTCAGA 

3 6751 TTTCCTATCC AGAGTTAGCT CATGATCGCG ATGTCTTAGA AGAATTTGGG 

36801 ATTCAAGTTC TGCGTGAGGG AATCGAAAAT CCTTCCGTGA CCGTTCGTGC 

36851 TGTGAGTGTC CTTGCTATTG GGCTTGCTAG AGATTTTCGC TTGGTCCCTC 

36901 TCCTGCTCCA AAGTTGTAAT GATGACAGTG CTATTGTTCG ATCTTTGGCT 

36951 CTTCAGGTTG CTGTGAACTA TGGCTCTGAA AGTTTAAAAA AGGCCATTGT 

37001 AGAGCTTGCC CGTAATGATG ATTCTATTCA TGTTCGGATT ACAGCATATC 

37051 AGGTGGTCGC TCTTTTACAG ATAGAGGAGC TATTGCCATT TTTAAGAGAG 

37101 CGTGCTGAGA ACAAACTTGT AGATAGTGTA GAACGTCGAG AGGCGTGGAA 

37151 GGCTTGCTTG GAACTCTCTT CTCAATTTCT AGAGACGGGT GTAGCTAAGG 

37201 ACGATATTGA TCAAGCGTTG TTCACTTGTG AAGTGTTGCG TAACGGTATG 

37251 TTGCCAGAGA CTACTGAGAT TTTTACAGAA CTCTTATCTG TAGAGCATCC 

37301 TGAAGTGCAG GAGTCTCTCT TACTTTCTGC TTTAGCTTGG AGTCATCAGC 

373 51 TACAGAATCA CAAAGAGTTT CTTAGTAAAG TGCGCCATGT GATGTGCACT 

37401 TCTCCATTTG CAAAAGTACG TTTTCAAGCT GCTGCACTTC TCCATCTGCA 

37451 TGGAGACCCT TTGGGCAGAG ACTCTCTGGT TGAGGGCTTG CGCTCTCCTC 

37 501 AACCTCTTGT GTGTGAGGCA GCTTCGGCGG CTCTCTGCTC TTTAGGAATC 

37551 CATGGAGTCC CTTTGGCAAA GGAGCATTTG GAGAGCCTTT CTTCTCGAAA 

37601 GGCTGCTGCG AACCTCTCCA TTTTGCTTCT TGTGAGCCGT GAAGATATTG 

37651 AAAGAGCTGG AGATGTGATT GCTCGCTACC TCTCCAATCC TGAAATGTGC 

37701 TGGGCTATAG AGTATTTCTT ATGGGATGCA CAATGGAATT TACGTGGTGA 
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37751 TACCTTCCCT CTATATTCGG ATATGATTAA ACGTGAGATT GGTAGGAAGC 

37801 TCATTCGCCT TTTGGCAGTA GCTCGCTATA GCCAAGCCAA GGCTGTAACA 

37851 GCAACGTTCC TTTCAGGACA GCAAGCTCAG GGATGGAGCT TTTTTTCTGG 

37901 AATGTTCTGG GAAGAGGGAG ATGTGAAAAC TTCTGAGGAT TTGGTTACAG 

37951 ATGCTTGCTT TGCAGCAAAG TTGGAAGGAG CGTTAGCCTC GCTATGTCAG 

38001 AAAAAAGATC AAGCTTCCCT ACAGAGGGTC TCTCAACTTT ATAATGACAG 

38051 CCGTTGGCAA GATAAATTAG CAATCTTAGA GAGCGTTGCT TTTTCTGAGA 

38101 ATCTTGATGC TGTGCCTTTT CTTCTAGACT GCTGCCATCA CGAAGCTCCT 

38151 TCGCTGCGAA GTGCAGCAGC GGGTGCTCTT TTCTCTATTT TCAAATAAAT 

38201 ATTAATAAAA TTATTCAAGA TATAGAAGAA AAACCACGCA GTTGTAATTT 

38251 CTATTTCTTA AAAATAATTT TTCAGACTGA CTTTATTCTT TCATTTTTAA 

38301 GTCTTTGGAG ATAGAAAACT TTGTTATAGA TTTTTATCTG GTAGCTTTTA 

383 51 TAATTTATGA AGAGCGTAAG CTCAGAGCCT GTATGTCATG CACAACCTGT 

384 01 ATATAAAATC AGATTTGTTT TTTGAATCTC TATTCTCGTT AAGATTTCGT 
38451 TATTCTGGAC GTTATCTCCA TCACCACTCC TAATTTTCCT AGCATTTCTA 
38501 TCTTTAAGCT CAGCACCGTA GCTTGCTTAA AGGAAATATT TTTCATTTAG 
38551 GTTGTGGAGT TCTTTATTTT ATGAATTTTT CATTATTTTT ATTTTTCCTG 
38601 ATAGCTATTC AGGGAATCTG CTTGTACGTG GGACGTCGTG GTAGCAAAAA 
38651 GGTAGAAGAT CGCGAGAGCT ATTTTCTTGC AGGAAGGAGT TTAAAAATCT 
38701 TTCCTTTGAT GATGACATTC ATTGCCACCC AAATCGGTGG CGGTGTACTT 
38751 CTTGGGGCTG ,CTGAAGAGGC CTTCTGTTAT GGTTATGGGG GGATTCTTTA 
38801 TCCTTTAGGA GTCGCTTTAG GGTTGATTTT CTTAGGAATG GGGCCCGGGA 
38851 AGCGGTTGGC AGAGGGATCG TTAACGACCG TAGTCTCTAT CTTTGAAGTG 
38901 TTTTATGGTT CTAAAAAGCT CCGTAAGATC GCATTTTTAT TATCCGCAGG 
38951 TTCCTTATTT TTCATCCTGG TCGCTCAGGT GATTGCTTTA GATCGGTTGT 
39001 TTAGCAGCTT CCCTTTTGGC AAGTACGTAA CCGTAGCATT TTGGATTGTC 
39051 TTAGCATCCT ATACCTCAAC AGGAGGGTTT CGCGGGGTCG TACGTACTGA 
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39101 TGTGATCCAA GCAGGATTTC TTCTTATTGC GGTGCTCGTC TGTGGTGTTT 

39151 CTGTATGGCT CTCTGTCCCT AAATCCTTGT CTGTGTTGGA TCCTTTCCAA 

39201 TCACTTCCTT GTGCGAAGCT TTCCAATTGG ATATTCATGC CTATGCTCTT 

39251 TATGCTTGTT GAGCAGGATA TGGTGCAAAG GTGTGTGGCT GCCTCCTCTC 

39301 CAAAACGCTT GCAATGGGCG GCTGTAGGCG CAGGCCTTGT TCTTCTTCTT 

39351 TTTAACTTTA TCCCTTTATT TTTAGGTTCT TTAGGAGCTA AAGCAGGCCT 

39401 TAAAGCAGGA TGCCCTCTGA TTGATACCAT TGCATATTTT TGCAATCCCT 

3 9451 CACTAGCAGC TGTGATGGCT GCTGCCATCG GCGTTGCGAT TCTCTCTACC 
39501 GCGGACTCTC TTATGAATGC TGTAAGCCAG CTAATCGCTG AAGAATACCC 
39551 TACGTTGAAA GCCCCTTATT ATCGTTATTT AGTATTGGGT TTGGCGGTTG 
39601 CAGCTCCTCT TGTTGCTATT GGTTTTACAA ACATCGTAGA TGTCTTGATT 
39651 TTAAGCTATA GCCTGTCAGT GTGTTGTCTT TCAGTCCCTG TGGGTTTCTA 
39701 TCTTCTAGCT CCTAAAGGTC GCCGTGTGAG CGGAGCTGCT GCTTGGGCAG 
39751 GAGTGCTCGT TGGTGCTCTG GGCTATGGAT GGGTTCAGAT AGTCTCTTTG 
39801 GGGATGTTTG GGGAGCTATT GGCTTGGGTA GGTTCTCTAG TCGCCTTTTC 
39851 CTTTGTAGGA TTTATTGAGA TCACTTGGAA AAACAAAGTC AAAACGCAAA 
39901 CTTAGATAAC CACTGCATGA GAAGATATAA CTAAAATAGA TCCTGAGTTG 
39951 TTTAGGTTTC TCTTAGATCT GATAGGTTGC GCTTAGTAAG AGATCGTCAG 
40001 TTTTTTAAGT TGTGTTTAGA ATCTGATACC TCTCCTTCTT TTCCAAGAAG 
40051 AAGAGGGGTT CGTTTTATTT TTTATTATTC ATTCGTAGGG GCGGGAAGCT 
40101 GTTTTAACTG ATAGAGCAGG TCGATAAGAG AGGAGAGCAG GGCTTGATAG 
40151 AGCGTCTCTT CAGAATGGTG AAGAGTGCCC TTAAGAAGCT TTCTGCAATA 
4Q201 CTTTAATAGC CAATACAGAG CGCAAAGTAA ACACTGTAAA AGGTAAAATT 
40251 TACACGCTTG TTTTATCATA AACCTCCAAA GAAAGACGCG TGTTGGCGTT 
40301 CTTCCAATTG GCTGTTAGAT AGTGAAAATA GTATTTACTA TTCAATAAAA 
40351 ATATTTATAT TAAATATAAT AAAACAAAAT TTCTAATAAA CTTTTTAAAA 

4 0401 GTATTTGGCT AAGTTTAATT TAAGAGTTCT CAAATAAAAG ATTTTTTAGT 
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4 0451 CTCTTTTATT TGAGAATCAT CAACCGATTT CAAGAAATCG CATGAGCCCG 

4 0501 TGATTCTGAT GGGCAGAAGT GCTTTTTAGC AAATAGGTGA AGCTCCTCGG 

4 0551 GCGTGATCTG TTGCTTCGCG TCACAAAGAG CCTTTTCAGG GTGTGCATGC 

4 0601 ACTTCGATCA TCAGACCGTC GGCACCTACC GAGAGACCAG CAGAGGCGAG 

40651 AGGAAGAACT AGAGAACGCT TCCCCGCTGC GTGGGAAGGA TCTACAATTA 

40701 CAGGGAGAGA AGAGATCTCT TTAAGGAGAG CCACGGTATT GAGATCTAGC 

40751 GTGTAGCGCG TAGAGTGCTC AAAGGTACGA ATTCCTCGTT CACAAAGGAT 

4 0801 TACCCCAGGA CAGGAGGGAG AAGAAGCAAG GATGTACTCC GCTGCGCATA 

40851 GCCACTCTTC AAGAGTAGCT GCTGGACTGC GTTTTAGGAT AATCGGACGA 

40901 TGTGATTTGC TGACCTCTTG TAAAAGAGGG GTGTTATGCA TGTTTTTGGC 

40951 TCCGATACGG AGGATATCCA CATGTTCGGC AGTAATTTCA ACATCTCGGA 

41001 CATCTAAAAC TTCGGTTTCT GTAGGGAGAC CATGGATGCT CTGTGCTTCC 

41051 TTATGCCAAA GCACACACTC TTTCTCCCAT CCTTGAAACG AAAATGGGCT 

41101 TGTCCGTGGT TTTCTGATTG ATCCTCGGAA TACCTGAGCT CCTGCTTCTT 

41151 TAACTGTAAG AGCTGAAGAG ACTGTATGCT CGTAACTTTC TAAGGTGCAG 

41201 GGGCCTGCGA TCAGTATTGG CGATCCTTCT CCAAACGATA GATTTGGAGA 

41251 AATAGGAACG GTATGGACCT CGTCAGGATG CTGTTTGAGG GTGCGCGGTA 

41301 GGGGATAGGT AAACGTAAGA ATAAGTACCT CATGCAAAAC TAGGGATTTG 

41351 AGCTGTCCGG TTTTGAGTTC TGATCTTCAG TTCTAGGGTT TCTAGGCAAG 

41401 AGACAACCGT AGTGGTTGGG GAAGCGGATT AAGAAGATAC GAGCGTCTCT 

41451 TTCCGGAGAA ATTGTATTTA AAAATCCATG CATCGCAACA AAGTTTTCAA 

41501 TATTTTCTTC TGAGACAGTA TCTCGATCAG ACATGATCAG ACAAACAGCA 

41551 AACGGCTGCA AAGGTACAGT AGTCATAGCA GCGACGTCGT TAGCCTCAGC 

41601 TTGCTCTTTG ACGTCATCCA AGATATCACC GAGTTCATTA GAACTTAGGG 

41651 AGTGGAGGTA GCCTATCAGT GATTGTCTAA AACCAGAGCG ACAGAAACCA 

41701 TCAGATGCTG TGACCGTGTT TTTTAAGAGA TTACAAATCA ACTGATCTCG 

417 51 AGCTCTAGAG TATTGATTTA TAGTCGCTGT AGCTAGAGGG AAACCGTATT 
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41801 TTAACGAAAG TAAAAGCTTA TCTTTTACTT GTGTAAATCT ATGAGAAGCC 

41851 TCTCTTTGGA AAGCGTCTAA TCGAGCAGCT AGCGGAGTCT TTTTTTGTAG 

41901 AAGTTTAGGG TAGTTGTTAA GGAAACAAAA GATTAAAAAA GATTTCGAAA 

41951 ACTGATCACT TAAAGCTGTG GGAGAAGAAG GCGAACCCAG TTTTGATCCG 

42001 AAGAGATTTA ACCGTGTTAA AAAAACGTTC CACCCATTCC TAAAGGTACT 

42051 TTCGATTTCT TTTTCTAAGA GTATGGCCTC GGTTTCCAGT TGATAATT TG 

42101 GGAAAGTAGC TCGTAGAAAC TGCATACAGT TCTTTTCAAG TATGGGATCT 

42151 AAAACCATCA CTTCAGGATG TCGTGACCAG AACTTGCAGA ACGTCTCTTT 

42201 ATCTTTATTA GAAGGGGTTG AAGGATAGCG TGCTGGAGGC TTGAGTACTA 

42251 GAGACCCTAG AATGGAACGT AGAAGTTTTT GGAGTTTCTT TTCGTTTTCT 

42301 ACTTTTTTAA AGGCATATTG TAGAAAAGGA AGTAGAAGCA TTTTTAACTC 

42351 TTTAACGGCA TCTTTAGTTT CTGGACGATT ATTCACTTGA GGATGCTGTA 

42401 AAATAAAGAA TAAGAGCTGT ACTGCCTGTG TATAGAGGAG GTCCTGCTCT 

42451 TGTGAGAGAC CACTGCCTTC ATCACGCGTT AAACGTAAAC TTCTTGCCGA 

42501 GCAGACAGTA TTGAAGTGCT CTTTTAATTC AAGCTTACTA TGAATTGCGA 

42 551 CTAATAGCTT GTTTACAAGA TTTGAAAGTA AATCCGGCTG ATTGCTTAAA 

42601 GGAGGGGTTT CAAAGGATGT GGATATTACA TTCTTTAATA GGTTTGCTAT 

42651 GCGTTTATGT AGTGTTTCGA GTTGAGGCAT AGAGCTCTCT AACGTCAAAC 

42701 TCTCATGAGT CGTAAGGATA GTCATGATGT TAGAAGAAAT CACCTCTCGT 

42751 TGCCAAGAAG AATCTAAATC TTCATTAAGG TAGCTGAGGG AGGCGTTACT 

42801 TATTTGTAAG ATCTCTTGTA TAGAGCGTTG GGGGGGCGCG GACGCTATAA 

42851 TGATATCCTC TAATGATTTG TAGAAAGTAG CGCAGAGTTT GTGAAATCTT 

42901 ACAGAGGGGC AGGTACATAG AAAAGCAAAG AAAGAAAGAG CTGGAGACCA 

42951 AGGGAGATCC CCAGATCGTC TGTTTTCTAT AGTTGCAAGA AAGCGAGAGT 

43001 ATTGACATCG GATTGCTTCA GGGAGGTGAT CTTTGACTAT AGCATTGAAT 

43051 CTCTCAGTAT CCCACCCTGA ATCAGCAATG CGTCGACTCA GCTTAGCAAA 

43101 AGCCTCTCGT ATTTCTCTTT CATATTGCTT TCGAAGTTGT TGATCTTCTG 



32 



WO0027994 [file:/A\dcwas03 \ firmdata\l p \Fo )ey Pat\PatentDocume nts\ WO00 2 7994.c pc1 _ . Page 160 of 330 

WO 00/27994 PCT/US99/26923 

43151 GAGACATCTT TTGAAGCTCT GCATCAGTAA GAAAGAGAGC GGGCAGATGT 

43201 TCCAAGAGAA GAAAGAGTTT GTCTCTAGAT TGCATGTAAG GAAGTGCGGA 

43251 ATGATCAATG AGATTTTGGA ACAATTCCGA ATAGGGACGA TTTGCAGCAA 

43301 GAAATTCCAG AACACTGGGG AGGAGATCGT CCTGCATATC AGAAATAAAT 

43351 AAGGCTCTCG CTGTTTCTTC ATTACTTGAA GCTGCGATTT GTTGGCGAAT 

4 34 01 CGCATATGCA GAGAGTTTTC TTAGAAAGGC TATCAGAGTT GCAGTGTGTT 

4 3451 TCTGAGAAAG AATCACCCCG TCATAGAGGT CTATGAAGGA GCAATAAGTA 

4 3501 CTGCATAACT GAAGGAGTTC AGCCATTTCT GCACAAAGAT TCGCATTTGC 

43 551 CGGAGAAGAG GAGGCGAAAG GCAGGTCAAG GAGACGTGTG GCTTCCTGCT 

43 601 CAAAGACAAT ATCATTTCTG CTGCTCTCTT CGTAGAGAGC AGATAGCCAT 

43 651 CCTACAGCAT AGGCACGATA AAAGCAGTTC CCATCTCCCG GTACATTCAC 
43701. AAGGTAGTAA TTGTCATTTA G AT AG AG AG C CTGTTCAAGA GAGAGTTGCG 
43751 CAAGTCGCCG GTGTTGTTGA GGAAGATCCG GATTTTGTGC GATTTTCTTG 
43801 AACTGTTTTA TTTGGAAATA CATCGGTTCG TTATCAATCC GATTGGGATA 
43851 GGAAGCTACG AAATGAGGGT TAAAGTCGCC AAGTATTGGA TCAACTTGCA 
43901 TGGCAGGAAG AGGCAGAGGC GCTCGTCTTA GTACCATGCT CACCATGCGA 
43951 TTTAAGCATT GCCAATAGCC TTGACTAGAT GGGCGGAGAG GCATGGGGGT 

44 001 AGCCACCCGT ACGGGAGGAG GAGGGGCCTC TGGTGGTGGA GTCGGCTTTT 
44 051 TATCAGCAGG TTGTTTAGGG ACTTTTGGGC TCGCTGGTGA AGGAGCTTTG 
44101 GGGGGAGGCG GGGGTGTGTC CTCTGGGGGC GGCGTGCCCG GCTTGGGAAC 
44151 ATCGGGTTTT TTGTCTTCAC CATCCTTAGG CGGTTGTTTG GCAATTTCTA 
44201 TAGTTTTTGG CTCTGGTCCT TTGGGAAGAG TGGGAGGCGT TGGCAAGCCT 
44251 TCTTTTCTGA CAACCCGATG ATGCTTGTAG TAGTGAATCA GAAGAAGCAA 
44301 ACCAAGAGTA ATGATATGGA GCAGAACGTA TCCTATGGTA CGTAGAATTC 
44351 TAAGTAACAG AGGGTCTTTA GTATCAGTCG TTAAGTGGTA AAAATTATTC 
44401 TTGTTATTGG GCGGGCAATG TGGTGGGGAA TTGGATACAT ACGTTCAAAA 
44451 ATTGCTCGTT TTTTAATCAA AATTATTCAA AGTTAAAACT TTTTCGAGTT 
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44501 


TGATGTATTT ATTTTTTTAT GTAAACTTTA ACACATGATA 


GAATTTAGCG 


44551 


TATAGAGCGC 


AAACTTTCAT 


GATAAAACAA 


ATAGGCCGTT 


TTTTTAGAGC 


44601 


ATTTATTTTT 

f \ x x x n x x x x x 


ATAATGCCTT 


TATCTTTAAC 


AAGTTGTGAG 


TCTAAAATCG 


44651 


ATCGAAATCG 


CATCTGGATT 


GTAGGTACGA 


ATGCTACATA 


TCCTCCTTTT 


44701 


GAGTATGTGG 


ATGCTCAGGG 


GGAAGTTGTA 


GGTTTCGATA 


TAGATTTGGC 


44751 


AAAGGCAATT 


AGTGAAAAAC 


TTGGCAAGCA 


ATTGGAAGTT 


AGAGAATTCG 


44801 


CTTTCGATGC 


TTTAATTTTA AATTTAAAAA 


AACATCGTAT 


CGATGCAATT 


44851 


T T AG C AGG AA 


TGTCCATTAC 

X\JX\vwnx x nv 


TCCTTCGCGT 


CAGAAGGAAA 


TCGCCCTGCT 


44901 


T C C CT ATT AT 


GGCGATGAGG 


TTCAAGAGCT GATGGTGGTT 


TCTAAGCGGT 


44951 


CTTTAGAGAC 


CCCTGTGCTT 


CCCCTAACAC 


AGTATTCTTC 


TGTTGCTGTT 


45001 


CAGACAGGAA 


CGTTTCAGGA 


GCATTATCTT 


TTATCTCAGC 


CCGGAATTTG 


45051 


TGTCCGTTCT 


TTTG AT AG CA 


CCTTGGAGGT 


GATTATGGAA 


GTTCGTTATG 


45101 


GGAAATCTCC 


GGTTGCCGTT 

Uw X X V\tVw X X 


CTAGAACCCT 


CGGTAGGACG 


TGTCGTTCTT 


45151 


AAAG ACTTC C 


CT AATC TTGT 


TGCAACAAGA 


TTAGAGCTCC 


CTC CTG AATG 


45201 


TTGGGTGTTG 

X X V\jw X W X X V 


GGCTGTGGTC 


TCGGCGTAGC 


TAAAGATCGT 


CCTGAAGAAA 


45251 


TACAAACGAT 


TCAACAAGCG 


ATTACAGATT 


TAAAGAGCGA 


AGGGGTGATT 


45301 


CAATCTTTAA 


CCAAGAAATG 


GCAACTTTCT 


GAAGTTGCTT 


ACGAATAGAG 


45351 


fif? T ATTCTTA 


TGGCAACCTC 


TGTTCCTGTA 


ACTTCATCTA 


CTTCTGTAGG 


45401 


AGAGGCTAAC 


TCCTCCAACG 


AAAGATTTAC 


TGAACGAACA 


TCGCGAATGT 


45451 


ATTACGCAGC 


TTTAGTCCTA 


GGGGCTTTGA 


GCTGTTTAAT 


TTTTATTGCT 


45501 


ATG ATTGT C A 


TTTTCCCACA 


GGTCGGATTG 


TGGGCTGTGG 


TCCTCGGGTT 


45551 


TGCTCTTGGA 


TGTTTACTTT 


TAAGCTTAGC 


TATCGTTTTT 


GCTGTCTCCG 


45601 


GTCTCGTTTT 


AGGCAAGACT 


TTAGAACCTA 


GTCGAGAAGC 


GACTCCTCCA 


45651 


GAAATTGTTG 


CGCAAAAGGA 


GTGGACTACA 


CAACAAGATG 


TCTTAGGGAA 


45701 


TGAGTATTGG 


CGTTCCGAGT 


TGATTTCCTT 


GTTCTTACGA 


GGGGATCTCC 


45751 


ACGAATCTCT 


GATTGTTGAT 


TCTAAGGATC 


GATCTTTAGA TATTGATCAG 


45801 


AGTTTACAAA 


ATATATTGAA 


ACTTGAGCCC 


CTATCTACGA 


CACTTTCGCT 
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4 5851 GTTAAAGAAA GATTGTGTCC ACATCAATAT CATTTTACAT TTAGTGAGAC 

4 5901 AGTGGAACTT ACTGGGAGTG GATCTTAGTC CTGAAGTCAC TGCGCACGCC 

45951 GAGGAACTTC TACTCTTTTT GATAGAAGAG CAGTATTACT CTCCTGATAT 

4 6001 TTTGAAATTG ATTCGCTACG GAGATGCTTT ACAAGCAACG TCTCCTTTGA 

4 6051 TGGATTGGGC AGATTCAGGT TCCTTTAGTG TAGACGCAGA CGGGGTATTT 

4 6101 AGCTGTCGCA GAGAAGAATG TTCTCCTGAG GATGCTTTGG CGCAATTCGA 

4 6151 TCTTCTTTTG GCGTTGGAAA ATCCCGACAG ACGCTTCTTA AAGGATTCTT 

4 6201 TTCTTACCTA CATTTGGTCG TCTTCATTTT TTGAGAAGTT TTTACATCGC 

4 6251 CATCTAGAGA GCTTGCAAAG AAAGCTCCCA GAGACAGCGA TCGATGTCGC 

46301 CCGCTATGAA GCACAAATAC AAACATTTCT CTCTCGCTAT TTTCAGAAGC 

46351 TCGATTTGAT AAACGCAATG TCCTTAGATT GGGGATATAA CTGTGCTGAG 

46401 GGAGAAAAAT GTTATGAGAG CGCAAATCAA AGATTAGACA ACCTATTTAT 

46451 TGCTTTTTCT TCTTCTGTTC CTGCTATGAA GCGGCTCTTT GACAAATATG 

4 6501 GTTCTGTGGT ACGGGTAGAT CGTAGGCAGA TTCGTGAGCA GATTCTTTCG 

46551 AACACTGAAA TCTTAGAAAA TGAGTCAGGG TTCCTCTGCA GTTTGTATGA 

4 6601 ATATCCTTTA TCCTATTTGA TAGATTGGGC TGTTTTGCTA GACTGTGTTC 

4 6651 GCGGTACCGA AATCTCTCTA GAAGATCAGG CCGATTACAC CGTTTGTTTG 

4 6701 CAAGGCTTGG ATTCTATGTT ATCTCAATTT GCGAGTCGTT TACAGTCTGG 

4 6751 ACAAAAAGTA TTGAATCCTA GAGATGTTTT AAGTGAACAG GCTGCGGTTA 

46801 TGCTTGTTCA TGGCTTGGCA GCACAGGGCG TGTCGTTTCA AGGATTGAAA 

4 6851 GCTTTGATGT ATTTGACAGC CGTTCCCCAA AGAATGTGGT TAGGAGCATT 

4 6901 GCCTTTATTT GAATCTTTTC CTGTCTTTAA TCGGATGAAA GAATTTCTTG 

4 6951 GGGAATCTCT GGGAGACTAG GTGAATTTGT ATCAAAGAAG GAACAAGATT 

47001 GCATGTTAGG TTCTTTGCCA TGTTATCCTG GTGCTGGCAA TATTGAAGAA 

47051 TACAAAAATA GGTATTTCTA TTGTCAGTTA TGTGCTGAGG TCGTTAGTCC 

47101 CTATGTTGTT CCTGTTATTG TAGTTGATGT GCAAGGGGCT CCTCCTACAG 

47151 GTATCTTGCA GGTCTTGCGT TGTAAGCAAC ATAAATTTCA AGGCCTACCC 
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47201 GTACATGGCC CCATTACTTC TTTATGGGCT TTGGAGCCCG TGGGTAAGGG 

47251 AGCTCCGCAG CTGGAGTCTG CAATGTACGA GCTCTGTTCT CAAGTAAGGA 

47301 ATTTTGACAT CTGCTCTATT GTGAGTTGGG TCTTTGGTGG GTTGTGTATT 

47351 TTTGCAGGTC TGATTGTCGG GGTAATGGTT GAAGCCCCTT TGATTGCGGG 

47401 ATTAAGTGCT TGGGTGATTC CCTGTATCAT TGGAGGGGTT GGTGCCATTT 

47451 TATGCTTGTT TGCGATCTTG ATGGCGTACT TGGGAAGAGG GAGAGTCCGT 

47 501 GAGTGGCTCA ATCTTTCACA CGAATATATA ACGCAATGTC ATTGTCGTCA 

47551 GATACAGGCA CATTCTCAAA ACTATTCTGT GATCACAGAG TATCCTGCAA 

47601 CCTGTGCATT ATCTCAACCG ATTACAAAGT TACCTAATGG ATCACGCAGA 

47651 GATAACTAAG CGTGTTCGTC AGTTATTTCT CACATTTTCT CATGAATCTT 

47701 TTACTGCGCT GCACGAGATC CCTCTCGAAA ATTTTTAAGG ATAGATACTT 

47751 GGAAACTATG GTTTAAAAAG CTATAGAGGA TTCTAAATTG GGGTTCTAGC 

47801 AACTTCTTGA CTTTAAGATC CAAAGTTAAG AGACTGACTA ATTATTTTTG 

47851 TTTGCTTGTG TTTCCAGATG AGCAATTGGT ATGGTAAGAG ATATTCAGAG 

47901 TGAATCTATA GGGAAATTAG TATTTTTAGG CACAGGAAAT CCCGAAGGAA 

47951 TTCCCGTGCC GTTTTGCTCA TGTAGAGTGT GTCAAAACAC AGGGATTCAT 

48001 CGTTTACGAT CTTCGGTACT CATTCAATAT CAAAACAAGA CTCTAGTGAT 

48051 TGACGCAGGC CCTGATTTTC GTACGCAGAT GTTAGTTGCA GGGGTTTCCG 

48101 AGCTCGATGG GGTATTTCTG ACCCATCCCC ACTACGATCA TATCGGTGGT 

48151 ATTGATGATT TACGTGCGTG GTACATAGTC ACGCAGCGTT CGTTGCCTTT 

4 8201 GGTCCTTTCT GCAAGCACCT ATAGATTTTT AAACAAGGCT AAAGAGTATC 

48251 TCTTCGCCAC TCCGAATGTA GAGTCTTCAC TTCCCGCAGT TTTAGAGTTT 

48301 ACAATCTTGA ATGAGGACTG TGGGCAGGAG GAATTTCAGG GCATTCCCTA 

48351 TACTTATGTT TCCTATTATC AAAAGTCGTG CCATGTAACG GGTTTTCGTT 

48401 TTGGAAATCT TGCTTATCTT ACAGATCTCT GTAGCTATGA TGCAAAAATT 

48451 TTCAGTTACT TAGATAATGT AGAGACATTG ATCTTGTCTG CGGGTCCATC 

4 8501 GGAAACTCCT ATTCCTTTTC AGGGACACAA ATCTTCGCAT CTTACTGTAG 
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48551 


AAGAAGCCAA AGCTTTTGCG AATCATGCAG GGATAAAGAA 


TTTAATTATT 


48601 


ACACATATCA 


GCCACTGTTT 


AGAAGCAGAG 


CGTGACCAGC 


ATCCAGAGGT 


48651 


CACATTTGCT 


TATGATGGCA 


TGGAGGTCCT 


TTGGACACTA 


T AG AT ACGCC 


48701 


CGGGGAACAG 


GGTTCTCAAT 


CTTTCGGAAA 


TTCGTTAGGG 


GCCAGGTTCG 


48751 


ACTTGCCTCG 


TAAGGAACAG 


GATCCCTCTC 


AAGCTTTAGC 


TGTGGCTTCC 


48801 


TATCAAAATA 


AGACAGATTC 


TCAGGTCGTT 


GAAGAACATT 


TAGACGAGTT 


48851 


GATCTCACTT 


GCGGATTCCT 


GTGGTATTTC 


TGTTTTAGAG 


ACCCGTTCTT 


48901 


GGATTTTAAA 


AACACCCTCA 


GCTTCCACCT 


ATATCAATGT 


GGGGAAGTTG 


48951 


GAGGAGATCG 


AAGAAATCTT 


GAAAGAGTTT 


CCCTCTATAG 


GGACTTTGAT 


49001 


CATAGATGAG 


GAGATCACTC 


CATCCCAACA 


ACGGAATTTA 


GAGAAACGCC 


49051 


TTGGCCTTGT 


CGTTTTGGAT 


AGGACGGAGT 


TAATTTTGGA 


AATCTTTTCC 


49101 


AGCCGTGCCC 


TTACTGCAGA 


GGCAAATATC 


CAAGTCCAAC 


TTGCACAAGC 


49151 


ACGTTATCTC 


CTTCCTCGTC 


TTAAGAGACT 


TTGGGGGCAC 


CTATCTCGGC 


49201 


AAAAATCTGG 


GGGAGGTAGC 


GGAGGCTTTG 


TTAAGGGGGA 


AGGAGAAAAA 


49251 


CAGATCGAGC 


TAGACCGTAG 


AATGGTCCGT 


GAGCGTATCC 


ATAAGCTGTC 


49301 


AGCACAGCTG 


AAAGCTGTGA 


TCAAACAGCG 


TGCGGAACGC 


CGTAAAGTAA 


49351 


AATCTCGACG 


AGGAATTCCT 


ACCTTTGCTT 


TGATAGGGTA 


TACAAATTCA 


49401 


GGGAAGAGCA 


CCCTATTAAA 


TTTGCTGACG 


GCTGCTGATA 


CGTATGTTGA 


49451 


AGACAAGCTA 


TTTGCAACTT 


TAGATCCCAA 


AACGCGCAAA 


TGCGTACTTC 


49501 


CAGGAGGCCG 


TCATGTCCTT 


CTTACTGATA 


CTGTAGGCTT 


CATTCGAAAA 


49551 


CTTCCTCATA 


CTTTGGTAGC 


AGCATTTAAA 


AGTACTTTAG 


AAGCAGCTTT 


49601 


CCATGAAGAT 


GTTCTTCTGC 


ATGTTGTCGA 


TGCTTCGCAT 


CCTTTAGCTT 


4 y D3 1 


TAGAGCATGT 


ACAGACGACC 


TACGATCTCT 


TTCAAGAGTT 


GAAGATTGAA 


49701 


AAGCCTAGGA 


TCATTACTGT 


GTTGAATAAG 


GTAGATCGGC 


TTCCTCAAGG 


49751 


AAGTATCCCT 


ATGAAATTAC 


GTTTGCTCTC 


TCCTCTTCCT 


GTATTGATTT 


49801 


CAGCAAAAAC 


TGGGGAGGGG 


ATCCAGAATC 


TTCTTAGTCT 


TATGACGGAA 


49851 


ATCATTCAGG 


AGAAAAGTTT 


GCATGTGACT 


TTGAATTTTC 


CTTATACAGA 
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49901 


ATATGGAAAA 


TTTACGGAAC 


TTTGCGATGC 


CGGGGTTGTG 


GCCTCGTCAA 


49951 


GGTATCAAGA 


AGATTTTTTA 


GTTGTTGAAG 


CGTATCTTCC 


TAAGGAGCTG 


50001 


CAAAAGAAAT 


TTCGTCCTTT 


TATTTCTTAT 


GTTTTCCCTG 


AAGATTGTGG 


50051 


AGATGACGAG 


GGTAGAGGGC 


CCGTCTTGGA 


GAGTTCTTTC 


GGGGATTAGG 


50101 


TAGTTTTCTT 


CTAGGACATC 


GAATCTTTGT 


TAGTGAGAAA 


AAGAGTGATA 


50151 


TTTTAAAATA 


GCCACTCATC 


GCTAAATCTA 


TTGAAGTCTC 


TAGAGGTATA 


50201 


TGACGGTTGC 


GGAAGTCAAA 


GGAACATTTA 


AGCTGGTCTG 


TTTAGGCTGT 


50251 


CGGGTGAATC 


AGTATGAGGT 


CCAAGCATAT 


CGCGACCAGT 


TGACTATCTT 


50301 


AGGTTACCAA 


GAGGTCCTGG 


ATTCTGAAAT 


CCCTGCAGAT 


TTATGCATAA 


50351 


TCAATACGTG 


TGCTGTCACA 


GCTTCTGCTG 


AGAGTTCGGG 


TCGTCATGCT 


50401 


GTGCGTCAGT 


TATGTCGTCA 


GAACCCTACA 


GCACATATTG 


TTGTCACAGG 


50451 


TTGTTTGGGG 


GAATCTGACA 


AAGAGTTTTT 


TGCTTCTTTG 


GATCGGCAAT 


50501 


GCACACTTGT 


TTCCAATAAA 


GAAAAATCCC 


GACTTATAGA 


AAAAATTTTT 


50551 


TCCTATGATA 


CGACCTTCCC 


TGAGTTCAAG 


ATCCATAGTT 


TTGAGGGAAA 


50601 


GTCTCGAGCT 


TTTATTAAAG 


TTCAAGATGG 


CTGTA^TTCT 


TTTTGCTCGT 


50651 


ACTGCATTAT 


TCCTTATTTG 


CGGGGGCGTT 


CGGTTTCTCG 


TCCTGCTGAG 


50701 


AAGATTTTAG 


CTGAAATCGC 


AGGGGTTGTA 


GACCAAGGAT 


ATCGCGAAGT 


50751 


TGTAATTGCA 


GGAATTAATG 


TTGGAGATTA 


TTGCGATGGA 


GAGCGTTCAT 


50801 


TAGCCTCTTT 


GATTGAACAG 


GTGGACCGGA 


TTCCTGGAAT 


TGAGAGGATT 


50851 


CGAATTTCCT 


CTATAGATCC 


TGATGATATC 


ACTGAAGATC 


TGCACCGTGC 


50901 


CATCACCTCA 


TCGCGTCACA 


CTTGTCCTTC 


GTCACACCTT 


GTTCTTCAAT 


50951 


CGGGGTCGAA 


TTCAATTTTA 


AAGAGAATGA 


ACCGGAAGTA 


TTCTCGCGGA 


51001 


GATTTTTTAG 


ATTGTGTAGA 


GAAGTTCCGT 


GCTTCTGATC 


CTCGCTATGC 


51051 


CTTTACTACA 


GATGTGATTG 


TCGGATTTCC 


TGGAGAGAGT 


GATCAAGATT 


51101 


TTGAAGATAC 


TTTGAGAATT 


ATTGAAGATG 


1 TAGGCTTTAT 


TAAAGTGCAT 


51151 


AGTTTCCCTT 


TCAGTGCTCG 


TCGTCGTACT 


' AAGGCATATA 


CTTTTGATAA 


51201 


TCAGATTCCC 


AATCAGGTGA 


> TCTATGAGAG GAAGAAGTAT 


1 CTTGCTGAGG 
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51251 TTGCTAAGAG GGTAGGCCAG AAAGAGATGA TGAAGCGTTT AGGAGAGACT 

513 01 ACAGAGGTGC TTGTTGAGAA AGTAACGGGG CAGGTTGCTA CGGGTCACTC 

513 51 TCCTTATTTT GAAAAGGTTT CTTTCCCTGT TGTAGGAACG GTAGCTATCA 
51401 ACACTCTAGT TTCTGTGCGT CTTGATAGGG TAGAGGAAGA AGGGCTGATT 

514 51 GGGGAGATTG TATGATAGAT ATAATGCAAC ATTTTAAGCC CTATACTATG 
51501 GTCCCAGGAC AAAAACTCCC TATTCCTGGA TCTTTGTTAT ATGCTCAGGT 
51551 ATTTCCTACC CTGTGGCGTC TATTTTCTTC GAAACACGAA ATCTTAAATG 
51601 AGCAGACCTT ACAGGTGCAA GGGCCTTTAA AACGCTTTGC TGTTTTCCAA 
51651 GATTTACATC GTGGGGGGCT TGCAGTGACT TCTGAGCGCT ACAAGTATTA 
51701 TCTCCTTCCC TCGGGAGAGT GCACACAATC TATCAAAGGG AAACTGCCTT 
51751 CGGCAGCGCA AGCAGGGCCC CTGTTATCTC TTGGGGTGCA TAAGCATGCA 
51801 GATTGGCAAA AGGTCCGTTG TCGTCGTGAT CTTAAAGAAA TTCTTCCCCT 
51851 ATGGTTCCGT TTCGCCGCTA TGGCTCCTAA GGGATCCTAT CGGGATCTAG 
51901 AGACGACGGC TATCGGTAGC TTGGTAAAGA CTGCCCATCA AAGAGTTTTA 
51951 CATAGGGAAA CTACAGAGAT TGCTCCTGCG TTACTCTCCA TAGCCCTTGC 
52001 GGGATTTTCA GAGTGCTTTC TTCCTAGGAG CTATGATGAA GAGTTCCAAG 
52051 GAATCCTCCC CCAAGATGGA GATCCAGAGG GGGGAGTTCC TTTTGAGCTT 
52101 CTCTCGTATA GCTTTGGTAT GATCCAAGAT ATTTTTCTGA GACACCAGGG 
52151 ACAGCTAGTA GAGATCCTTC CTGCATTACC TCCTGAATTT CCTTGTGGCC 
52201 GCTTGATTCA TGTTGCCCTT CCTAATCTTG GGACTTTGTC TATCGTCTGG 
52251 ACTAAGAAAA CTATCCGTCA GGTCGAGCTC CATGCAGAAT ATAGTGGCGA 
52301 GGTATTTTTA AAGTTTTGTT CTTCACTATG CAGTGCGCGC CTTCGGGAAT 

523 51 GGTCGGAGCG ACGTCTCTCT GGATCTAAGA GACTTTCTTT AGGAGAAACT 

524 01 CTGGAGATAA AAGCAGGAAC CACATATTTA TGGGATTGTT TTCATAAATA 
52451 GATAGCCTTC CATGGTTGAT AAACTGATCC ATCCTTGGGA TCTTGATCTG 
52501 CTCGTCTCAG- GACGACAGAA AGATCCCCAT AAACTCTTAG GGATCCTTGC 
52551 TTCTGAAGAT TCTTCAGATC ATATTGTTAT TTTTCGTCCA GGGGCGCATA 
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52601 CGGTTGCTAT TGAACTTCTA GGAGAGCTTC ACCACGCTGT AGCTTATCGT 

52651 TCGGGGCTCT TTTTCTTATC CGTTCCCAAA GGAATCGGAC ACGGGGATTA 

52701 CCGTGTGTAT CATCAGAATG GACTTCTCGC TCATGATCCC TATGCGTTTC 

52751 CTCCTCTGTG GGGAGAAATT GATTCTTTTT TATTCCATAG AGGAACGCAT 

52801 TACCGCATTT ATGAACGCAT GGGGGCAATC CCTATGGAAG TTCAAGGAAT 

52851 CTCAGGGGTG CTCTTTGTTC TTTGGGCTCC CCATGCGCAG AGAGTCTCTG 

52901 TAGTCGGAGA TTTTAATTTT TGGCATGGCC TTGTCAATCC TCTACGTAAA 

52951 ATTTCCGATC AGGGGATCTG GGAGCTTTTC GTCCCAGGCT TGGGAGAGGG 

53001 AATACGGTAT AAGTGGGAAA TCGTTACCCA ATCGGGGAAT GTGATTGTAA 

53051 AAACAGATCC TTATGGGAAG AGCTTTGATC CTCCACCCCA GGGTACAGCT 

53101 CGTGTTGCGG ATTCTGAGAG CTACTCTTGG AGTGATCATC GTTGGATGGA 

53151 GAGGCGCTCG AAGCAGAGTG AAGGGCCCGT CACGATCTAT GAAGTGCACT 

53201 TAGGCTCTTG GCAATGGCAG GAGGGAAGGC CCTTAAGCTA CAGCGAAATG 

53251 GCGCATCGCC TTGCTAGCTA TTGCAAGGAA ATGCACTACA CTCATGTGGA 

53301 GCTTCTTCCC ATTACGGAGC ATCCCCTGAA TGAATCTTGG GGCTATCAAG 

533 51 TGACGGGATA TTATGCTCCA ACATCAAGAT ACGGGACTCT CCAGGAGTTT 

53401 CAGTATTTTG TAGACTATCT ACATAAAGAA AATATTGGTA TTATTTTAGA 

53451 TTGGGTGCCG GGACATTTTC CCGTAGATGC GTTTGCTCTT GCCTCTTTTG 

53 501 ATGGGGAGCC TCTCTACGAG TACACGGGGC ATAGTCAGGC TCTTCATCCC 

53 551 CACTGGAATA CGTTTACCTT TGACTACAGT CGTCATGAAG TGACCAACTT 

53 601 TTTACTAGGG AGTGCTTTAT TTTGGCTCGA TAAGATGCAT ATTGATGGCT 

53651 TACGTGTGGA TGCTGTGGCC TCTATGCTGT ATCGTGATTA TGGCCGTGAA 

53701 GATGGAGAAT GGACGCCTAA CATCTATGGA GGTAAGGAGA ACTTAGAGTC 

53751 TATAGAATTT TTGAAACACT TAAATTCTGT AATTCATAAG GAGTTCTCTG 

53801 GAGTGCTCAC CTTTGCAGAG GAATCCACAG CGTTTCCAGG AGTCACTAAG 

53851 GACGTAGATC AGGGAGGTCT GGGGTTTGAT TACAAATGGA ACTTAGGTTG 

53901 GATGCACGAT ACCTTTCATT ACTTTATGAA GGATCCCATG TATCGTAAAT 
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539 51 ACCATCAGAA AGATCTGACA TTTAGCCTTT GGTATGCCTT CCAAGAGTCT 

54001 TTTATTCTTC CTCTCTCGCA TGACGAGGTG GTCCACGGTA AGGGCAGCTT 

54051 AGTGAATAAG CTTCCCGGGG ATACCTGGAC CCGATTTGCT CAAATGAGAG 

54101 TGCTCTTGAG CTACCAGATC TGTTTGCCTG GGAAAAAGTT ACTGTTCATG 

54151 GGTGGGGAAT TCGGACAATA CGGCGAGTGG TCTCCTGATC GTCCCTTAGA 

54201 TTGGGAGCTT TTGAATCATC ACTACCACAA AACTTTGCGA AACTGTGTCT 

54251 CTGCATTGAA TGCGTTGTAT ATTCACCAAC CCTATTTATG GATGCAAGAG 

543 01 AGCTCTCAAG AGTGCTTCCA TTGGGTAGAC TTCCATGATA TAGAAAACAA 
54351 TGTCATTGCC TATTATAGAT TTGCAGGCAG CAATCGTTCT TCGGCGCTTC 

544 01 TCTGTGTCCA TCATTTCAGT GCGAGTACTT TTCCTTCCTA TGTTTTAAGG 
54 451 TGTGAAGGTG TAAAGCATTG TGAACTCCTT CTCAACACTG ATGATGAGTC 
54 501 TTTTGGAGGC TCAGGGAAGG GAAATCGGGC TCCTGTGGTC TGTCAAGACC 
54 551 AAGGGGTCGC TTGGGGTTTG GATATAGAGC TCCCTCCTTT AGCTACTGTG 
54 601 ATCTATTTAG TTACTTTTTT CTAAAAATTT AAATACTTTA TTTGTAAATT 
54 651 GTTGTGGGAT TGTTCTATTT TGTGGTGTAG TTGATATTAA TAATTTATTT 
54701 TATAATTAAA AATAATTATT AGTATTTCTT TTATGTCTAC ATCACCAATT 
54751 AGCAACGATC CCCGATATTT GTCTTTGTCT AATGCAACTG AGAAAACTTC 
54801 TCTTCTTGCA AATAGCCGGA GTCTCTCGCC AGTACCAAAT TCCCTAGTTC 
54851 CTAGCAATCC TGAAGATACA GGATTGCGAA AAAGTATTTT CACCCATTCC 
54901 GTGACTTTAT TTGCTGGCCT GGTTGTTTTG CTGGTAGCGG TTTCTGTTGT 
54 951 TGTTGTCGCT TTGACCGTCT TAGCTCCCGG AGTTCCTCAG GCTATTCTTC 
55001 TTGGAATCGC CATTTCAGGC GTGGGTATTG GTGGATTTTC TATAATGAAG 
55051 AGCTTGGTTT ATATGGTCCG AGACTATATG TCCCCCAGGA TGCAGGAGTC 
55101 GAGCAGAATC AAAAGTGCTT TAGCTGTAGG GACTGGATTT ACTGTCATGG 
55151 GTTTGGTCAT GAAGGTGGGG GCGAATTTTG TTCCTGGAGG GTATGGGGGT 
5 5201 CTCGTGGGTA GCTTGGGATC CAGTGCGTAT TCCCGGGGAA GCCAAACCAC 
5 5251 ATTAGCAAGC TTCAGTCATT ATATTTATAC TAAGTTTTTC CGTTCTGAAA 

41 



WO0027994 ffile:/A\dcwasQ3\firmdata\lp\FoleyPat\PatentDocuments\WO0027994.c pc1 



Page 169 of 330 



WO 00/27994 PCT/US99/26923 



55301 


AAGTTGCTAA 


AGGGGAGAAG 


CTTACAGAAG 


CAGAAACTAT AAAAGAGG CG 


55351 


AAAAAATTAC 


ACTATATCAC 


GTTGTCAATT 


GCCACTATTG 


GCGTTGGTCT 


55401 


TG CGGTTTTG 


GGGATTCTCC 


TTGCCATTGC 


AGGAACGGTA 


TTGCTAGGAG 


55451 


GCGCTCCCGC 


AACGATTGCT 


ATTATTTTAG 


CTCCCCCTTT 


AATTTCTATA 


55501 


GGGCTTACGA 


CGGTTTTGCA 


AACGATACTC 


CATAGTAGTA 


TCGGAAAGTG 


55551 


GAGAGCCTTT 


CTGCTTACTC 


AAGAAAAAAA 


AGATCTTTTT 


GTAGACACCT 


55601 


CCCTGAAAGA 


CATTCGCTTA 


GAAAAATTGC 


CCCCCAGTGA 


GGTGGAAGAG 


55651 


AGTGAAACTT 


CCCAATCTGT 


GATAGAAGTT 


CCAGATTCAG 


AGGGGATTGC 


55701 


AGAGACGAGG 


ATCTCTGCGG 


AAGAAATCGA 


TACGAGGCTT 


TCCCTGACGA 


55751 


CAAGACAGAA 


GGTCATCTTT 


GCTCTTGCGA 


CACTCTTGCT 


CTTAGCAAGT 


55801 


ATTGCTGCCT 


TCATAGTCAC 


GGGATTTGGT 


GGATTGACAG 


TCATGCAAGT 


55851 


TCTCCTTGTT 


GCTTCTGTAG 


GATCGGCGGT 


TGCTTCTGTA 


ACACTCCCTA 


55901 


TGGTTTCCTC 


AGGATTTTCC 


TACGTCGCCT 


ACCAACTGAA 


AGCAAGATTG 


55951 


AATATCAGTA 


AATTACGTTG 


GAAAGAAGCA 


AAAAATAAAA 


AGCGGGTGCG 


56001 


CCAGTTCTTA 


ATTGAGTCTG 


GAGTGATTGC 


CTCGGATCGA 


GAATTTAACC 


56051 


AAATGTGGAA 


GACAGTCTAC 


AAAAAACAGA 


TTCAGAAGAC 


TGACGCTGCA 


56101 


ATTCGTGAAG 


AGGTTCGCAA 


TTTTGAGAAG 


GGTGGGGAAG 


TGAACAGCGC 


56151 


CCTTGTTG GT 


GGAATCTTAC 


TTGGTGTAGG 


AACTGGGATC 


ATGCTTCTTG 


56201 


CCCTGGTCCC 


TGCATTTGCT 


CCTATCGTTC 


CTGGTATTCT 


TGCTCTTGGA 


56251 


GGATCGACGT 


TAGGAATCGC 


GGGATCGATT 


TTAATGAGGA 


AGTTTGTCAA 


56301 


CTGGCTCTAT 


GATGAGCTTG 


TGAAGCTCTA 


TGAGCGTCGA 


CGTAATCGCC 


JOJ Jl 


fiTCSAGCTTCT 


CTATGGTCCT 


GAAAGTAAAA 


TGCGCTCCAT 


TGCTACGGAT 


56401 


TTAGTTGTTG 


AGGCTCTTGC 


TGCTAGCCAC 


GATCATCTAT 


TTGATCTTGA 


56451 


TGGTCCCGTA 


GATTTTATTG 


ATGTGGATGT 


AGATATAGAT 


GGAGCTGCTT 


56501 


AGGCCAGGTC 


CTTGAATGTA 


AGATCCTCGA 


GCTTTGGGAG 


CTTGTCTGCC 


56551 


TTCTCTAATT 


TTATTTTCTC 


TTTTACATCT 


' AGTAGTTTAC 


TTAATTATAT 


56601 


AATCAGCATT 


' CTTTTTGTTT 


. ATTTTAATT1 


, ATATTTTGTT 


' TTTAAAATAT 
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56651 TTTTTATTTT AACATTTGTT TAATAGTTTT TATTAAATAA TTTATCATTT 

56701 TAAGGTTCAA TTATGGCAGT TGGTGGCGTA GGCGGCTCAA GATCTCCTTC 

56751 CCCCATTCCT CCTAATAGAA GGAATAGTGA GGATGGAAAA GTAAGTCCTA 

56801 AAGACAACTT AGGGGAACAT ACAGTTAGCA GTAGTGACAG TAGTCTTGCA 

56851 AGTCAGGGCC CTACAATAGA AGAGAGAAAA GCCCAGTTAG GCGGGACTGA 

56901 TAAAATTCGT TTGCCATCTG TCAAAGAACC CGGAGATTCT CAAACTTCAG 

56951 GACGTTCTGG GGTACTTCAG AGAATTTGGA AAGGCGTTAA AGGGGTCTTT 

57001 AAAAAAACCC CTCAAGCGCG TCCTGAAGTT TCTAGTCCAC GTCTTCCATC 

57051 CCATGTGCAA CATGGCCAAC GTCTTCCTGG ACTCGAGGGC TTTAGAGATC 

57101 GTATCCAGAA AAGATCTGAA AATCCAGAGG CAGATTTAGG GAAGATGAAA 

57151 CGTTCCTATT CTGATGGTGA CCTTGATCGA GTAGGACACG ATTCTAATGA 

57201 AGATTCTACA GAGGATAGCC GTTCTGAAGG AGGAGAGCCT TCTTCAAAGA 

57251 GTTCTTCCTT CTTATCAGGA GTTCGAGGAG CGGTGTCTAA AGTTCATGGT 

57301 GCCCTAGGTG ATATTAAAGG AAAGTTCCAG CGTTCTGCTT CCGAAGATGA 

57351 TTTAACAACT CAGGGCGAAG ATTCTGCCGG CGATACTGTA AAAGAAAGGC 

57401 GTTCCGAAGA AGCAGAGGCT TCTTCGAAGA GTTCTTCTTT TTTATCAGGA 

57451 GTTCGAGGAG CGACGTCTAC AGTTCAGGGA GCCTTAGGTG ACGCTAAAGA 

57501 GAAGGTTTCG GCGTTCGGAG AGCAGGCTGC AGGTGCAATC AGATCAGCAC 

57551 CAGGGAATAT CAGAACTAGA TTCCAACGTT CTTCATCGGA AGGTGATCTT 

57601 TCTAATGTGA ATAAAGCAGC AAAACATCTG CGTAAGGCTT TAGAAAATTT 

57651 GGAAAAAGTA GCTCCAGAAC AAGTGTCACC AGAGGTGGCT TCTAGGGTGC 

57701 AATCTCTTCT TGCACGCATG GAGCAATTGA CTCATCAGGA ACCTCCTACT 

57751 GTGGAGGATC TTATTACTTT CGTAGAATCC AATGTAGGTA GTGATTCTGT 

57801 GGAGTATGCA TCCATCGTAC CTCAAGATGG ATCGCAAGCC CCAGCAGAGA 

57851 CTGCGGAAGC TCCCGAAACA GGTGGGGTAG AGGGATCTGC AGCGCAGGGA 

57901 GCATGGAAAG CGTTACGGGA TTTTGTAGTT AGCATATTCC AAGCGGTAGC 

57951 GAGCTTCTTT AGGGCAATTG CTTCAAGATT AAGTTCAGCA CGACGTGAAT 
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58001 


CAGCTGTAGA 


TGATCTTGCA 


TCAGAAAGTA 


ATACACAATG 


GTTTGTGGAG 


58051 


CAAGAGGGCG 


TTTCAAATCC 

A ± A ****** * WW 


ATCGGCTGCA 


CCTAGCTTAT 


CTTTTGCGGA 


58101 


AGAGATCGCT 


CGTAGAGCTG 

W *»J 4 *»VJi*>J w 4 w 


CAGAAATGAG 


TAACAGAAAT 


GCCCAGAGTC 


58151 


TTGAAAAATT 


GGAATCAGGC 

\J*W* 4 W#^WWW 


AATGTGACTG 


ATCCTGTCAT 


TCAACAAGGC 


58201 


TTAGGATTAG 


CTAGATCATT 

W A J* A W*» * * 


TGCTCCAGAG 


GGACAGTAGT 


CGTTATCTCA 


58251 


CI T G TTT CTCT 


ATGCGCAAGG 

m \ *> w W ww/ulwv 


GAAACTTGAA 


GAGTTTTAAT 


TAAAACTCTT 


58301 


PAATATGTTG 


ATTATTTTAA 


TATATTTAAA 


AGCATTTTTG 


TTGTTTTTTA 


jOJ Jl 


ATA A A ATT A A 


ATTGTTTPAG 

rt 1 1 v3 1 1 1 wAU 


AAAAAAGATT 


ATTCTTTTTA 


GGAAGTGTTT 




ATP.P.PATPAG 


G A ATPGG AGG 


ATCTAGTGGA 

4* ± W X *»\J *v W W4l 


TTAGGAAAGA 


TTCCACCTAA 


584 51 


AG AT AATGGG 


G AT AG AAGTC 


GATCGCCCTC 


TCCTAAGGGA 


GAACTTGGCA 


JOJUX 


GPPAPGAGAT 


TTPPPTGCCT 


CCTCAAGAAC 


ATGGAGAGGA 


AGGAGCTTCA 




r^o ATPTTPGP 


AT AT AP AT AG 


CAGTTCCTCT 

W**w X * WW * v * 


TTTCTACCAG 


AAGATCAGGA 


JOOU-L 


G TP TP AG AG C 


TPTTCTTCGG 


CAGCTTCTAG 


CCCGGGATTT 


TTTTCTCGCG 




1 rav*\J 1 1 \v 1 ova 


GGTAGACAGG 


GCCTTAAAAT 


CATTTGGCAA 


CTTTTTTTCC 


JO / Ul 


GPAGAGTCTA 


CGAGTCAAGC 


GCGTGAAACG 


CGACAAGCTT 


TTGTTAGATT 


58751 


ATCAAAAACC 


ATCACCGCGG 


ATGAGAGACG 


GGATGTCGAT 


TCATCAAGTG 


58801 


PTGCTGCTAC 


AGAAGCCCGA 

*^ wr*nw w w w * 


GTGGCAGAGG 


ACGCGAGTGT 


TTCAGGCGAA 


58851 


AATPPTTPTC 


AGGGGGTTCC 

w* w' w* ww *> A w w 


AGAAACCTCT 


TCTGGACCAG 


AACCTCAGCG 


58901 


TTTATTTTPT 


CTTCCTTCAG 

W *> *V W *^ *> *> W*»w 


TAAAAAAACA 


GAGCGGTTTG 


GGTCGGTTGG 




TAPAGAPAGT 


TCGCGATCGC 


ATAGTACTTC 


CTAGTGGGGC 


TCCACCTACA 




^iapap-pgagp 


CTTTAAGTCT 


CTACGAGCTA 


AACCTCCGTT 


TGAGTAGTTT 


jj U Ji 


AP/1TP A12P.AP. 


CTCTCTGACA 


TACAAAGTAA 


TGATCAGTTG 


ACTCCAGAGG 


59101 


AAAAAGCAGA 


AGCCACAGTT 


ACCATACAAC 


AGCTGATCCA 


AATTACAGAA 


59151 


TTCCAATGCG 


GCTATATGGA GGCAACACAA 


TCTTCGGTAT 


CTCTAGCAGA 


59201 


AGCTCGTTTT 


AAGGGGGTAG 


AAACTAGTGA 


TGAGATCAAT 


TCCCTCTGTT 


59251 


CAGAACTGAC 


AGATCCTGAG 


CTTCAAGAAC 


TCATGAGTGA 


TGGAGACTCT 


59301 


CTTCAAAACC 


TATTAGATGA 


GACTGCCGAC 


GATTTAGAAG 


CTGCTTTGTC 
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593 51 CCATACTCGA TTGAGTTTTT CTTTAGACGA TAATCCAACT CCGATAGACA 

59401 ATAATCCAAC TCTGATTTCT CAAGAAGAGC CTATTTATGA GGAAATCGGA 

59451 GGAGCTGCAG ATCCTCAAAG AACTCGGGAA AACTGGTCTA CAAGATTATG 

59501 GAATCAGATT CGCGAGGCTC TGGTTTCTCT TTTAGGAATG ATTTTAAGCA 

59551 TTCTAGGGTC CATCTTGCAC AGGTTGCGTA TTGCTCGTCA TGCAGCTGCT 

59601 GAAGCAGTGG GTCGTTGTTG CACGTGCCGA GGAGAAGAGT GTACTTCTTC 

59651 TGAAGAGGAC TCGATGTCGG TGGGGTCTCC TTCAGAAATT GATGAAACTG 

59701 AAAGAACGGG CTCTCCGCAT GACGTTCCAC GCAGAAATGG AAGTCCACGT 

59751 GAAGATTCTC CATTGATGAA TGCCTTAGTA GGATGGGCAC ATAAGCACGG 

59801 TGCTAAAACC AAGGAGAGTT CAGAATCAAG TACCCCGGAA ATTTCGATTT 

59851 CTGCTCCCAT AGTGAGAGGT TGGAGTCAAG ACAGTTCCGT CAGTTTTATT 

59901 GTTATGGAAG ATGATCATAT TTTCTATGAT GTTCCTCGTA GAAAAGATGG 

59951 AATCTATGAC GTTCCTAGTT CCCCTAGATG GAGTCCTGCG CGAGAGTTGG 

60001 AAGAGGATGT TTTTGGAGAT TATGAAGTTC CTATAACCTC TGCTGAACCA 

60051 TCTAAAGACA AGAACATCTA CATGACACCT AGATTAGCAA CTCCTGCTAT 

60101 CTATGATCTT CCTTCACGTC CAGGATCGTC TGGAAGCTCA CGTTCTCCGT 

60151 CTTCAGATCG CGTACGAAGC AGCTCACCAA ATAGACGGGG TGTGCCTCTT 

60201 CCTCCAGTTC CTTCACCTGC TATGAGTGAG GAGGGGAGCA TTTATGAGGA 

60251 TATGAGCGGT GCTTCAGGTG CAGGTGAAAG TGATTATGAA GATATGAGCC 

60301 GTTCCCCCTC TCCTAGAGGC GACTTGGATG AACCCATATA TGCTAATACT 

60351 CCTGAAGATA ATCCATTTAC TCAGAGAAAT ATAGATAGAA TTTTACAGGA 

60401 GAGGTCAGGC GGTGCTTCCG CTTCTCCTGT AGAGCCTATT TATGATGAGA 

60451 TCCCATGGAT TCATGGCAGG CCCCCTGCTA CACTTCCAAG ACCCGAGAAT 

60501 ACATTGACTA ATGTTTCGCT TAGAGTGAGC CCAGGGTTTG GACCAGAAGT 

60551 AAGAGCCGCT TTGCTTAGCG AGAGCGTGAG TGCTGTTATG GTCGAAGCAG 

60601 AGAGTATTGT TCCTCCAACA GAGCCGGGGG ACGGAGAATC AGAATATCTA 

60651 GAGCCCTTAG GGGGACTTGT AGCTACAACG AAAATCTTAC TACAAAAAGG 
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607 01 ATGGCCTCGT GGAGAGTCGA ATGCTTAGGA TTTAAGTAGT TCTTTCGAAT 

60751 CTCTAGTGAG GATGTATCGG GTTCTTAATT TTTATGGGGG AAACGTATCT 

60801 GTGGATTTCC CTTAGCTTCT CCCATAAGAT TCATGATGGT AGAGAGTGTT 

60851 CACTACTCTC TATTTGGTCT TTACAGGTTG CATTGTCTAT ATAACATGCT 

60901 TTTAAAACTT AAAGGTCGTT CCTGCGTGAA GGTAATGTGT CGTCGTTGAT 

60951 GAGGATACCG AGCCTTGATA GTCTAAGAAC ACCGAAAGTT TAGGGAAGAT 

61001 AAAAATTTGG TTTCTTCCTT TAAAAGCAAT GGCATTGCGA GCAAGGGTGG 

61051 TTCCTGATAG GAGCCATGAC GATCCACTAG ATTCTAGACT CACGTTGATC 

61101 TCAGGATTTT GTTGGTAGAG GACAGGCTGA TAAGCAAGCT CTATGTTCCA 

61151 ATAGGTAGGA AGACGGAACT TGGATTCCCA AGCGCTCTGA ATTCCCAGAG 

61201 GGACTGTCAG GTTATATAAG GGTTTATGAA CAGAAAATTT TCTAGCTTTA 

612 51 TCTCCACTTT CTTGAAACGC AGTTTGATTA GAACGAACGG CAATTGCTTG 

613 01 GATAAAAGGA GTGAAGTGGA GAGGTCGTGA TCGCCATTGT AGAGATAGAG 
61351 AGCAAGAGAG AGCCGCCCCT AATGTCGTAC TATAACATTT GCCTTCCGTT 

614 01 TGTATTTTTC CAGAATATCC AGATGCTTTG ATATGGTGGT TGCTGTAGCT 
614 51 GTAGGCTAGA GATGCAGATG TAGAGAATCT CTCTTGCAGC CAAGGATTAT 
61501 TGATCTGGAG CGCTACAGTT GTCGTATGCG AAGCCACGGA ATTGTCGGAG 
61551 TGGCTCTCGT AGAGATTACT GAAAAGTTGG GAGAAGTTTA CACCAAAGCT 
61601 ATGATTAGAA GCAGTGTTTG AGGTTGTTCC CAAAGAATAA CCCGTAGCTT 
61651 CCATATGGAA TCCTTTCGCA TCATTGTTGC TATTTTGATG CACGAAGAGT 
61701 CGAGTAGCTT CTCCAGAAGC TGTAGGTGCT ATTTGGCCTT GCTGTGTTTG 
61751 ATAACGTAGT GTCGCAAATA AGTTATGGAA AGATTGCCAG AAGGCAGATA 
61801 GGGCAATGTC TCCTTTGTTT TCTGGGTTTA CCTTATATCC TGTAGGTGTC 
61851 CAATCACCAT AAAGCTGGCG ATGTAAAGTA TTCACAGTAT CTTCAGAAGA 
61901 GGTATCAGAA GXTGTGATTG TTTCGATCCA GTAAGGGGAC CAAACGCCTT 
61951 GGTAGCCGTA GTGTTGAGTT GTATTTAGAC CCTCAGGGTA GAAATTATCC 
62001 GTATTAATAT GTTTAGCTGT GACGTCTAAG AGATACAGAA GAGGAACTTC 
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62051 TGCGATAGGT TGGGCAAGGT CTGCAGTATC ATAGGGATCT AGGTTCTCGT 

62101 CATCCAGTAG GCTCAAAGGT CCTGAGAGAT TGATTATAGG GTTATTATCT 

62151 TCGCTATAGG GTGCTGATGA ACCTGTGGGG CGAATCCATA GCTTGGGAGC 

62201 AACTCTGTTG CCTAAGATAG AGGGAAGGTT AATTGCAAGA TTATTGATGT 

62251 TAATTACAGA ACCCACACTA CTGCTACTTT GTTCTTCGTC TGTTGTAGAA 

62301 AACACAGCTC TACTGCCTAA CCGTAGAGTC CCACCAAATT GATCAAATTT 

62351 ATAGACTTTC CACTCTGCTC GATCTTCAAG AGCGAGTGTG CCGTTGTACA 

62401 GTCCAATGTG GTTTCTGAAA TGTGAAATGA AGTCATCACG AGAAGTCGAT 

624 51 GTATCCGGAA TATATGTTGA GGAGAACAAG ATAGTTCCGA GGTGTTCT GG 

62501 ATTAGGATTA AATTTTTGGA TAGAGTTTTG TATAGTATAT CTTTGTAGTA 

62551 TGGGATCATA GAAGGTAGCA GAATGACCTT GACTTGCTCC AACTGTTAAT 

62601 GAGACATTAC GCGTGCAGTT TACAGAAACA TGATTGCTGA AAGTATCTTT 

62651 GAAGTGTCTA TTATTATAAA AAATAATATC TCCCTGATCA GCAAATAAAG 

627 01 TGCATGCACC ATCTTGACGG AGCATGATAG CGCCGCCCCA AGTTCCCTGA 

62751 TTGTTTGTGA AATAGACGGG ACCACTGTCT TGTATAGTTA GAGATTGTGT 

62801 ACAGATAGCA CCTCCATCTC GTGCTGCAGT ATTATTATCG AAGGCTGCAA 

62851 TTCCTGGGTT GTCTTTTATA GAACAACTAA TGCAATAGAT AGCCCCTCCA 

62901 GAGGAATGGT TAGCAGAGAT GTCCGCTTCC ATGGCAAAAT TATTGTTGAA 

62951 GATCACAGAA CCGGTATTCT TTGTAAGAAT GCACTCTTGA TGTACTCTTA 

63001 TTGCACCACC CAGACCTGAT TGGTTATTCA AAAAATAGAT AGGCTGAGAA 

63051 TTATTCTCAA TTCTACAAGC ATTAGCGAAC AACGCGCCCC CCGCTGTTCC 

63101 GCCTGCAGCA TTATTAAAAA ACAGGCAAGG GCCAGTGTTG TCCTTAATGT 

63151 TTATGATTGC -AGCTTGGATT GCTCCTCCTG AAGATTTTGC CTTGTTGTTA 

63201 ATGAAGTATG CGGTTCCTTG ATTTTTTGAG ATTGTAACAT TTTTCGAACA 

63251 TAAAACAGCT CCCCCTGTAC AAGTATCAGC GAAATTACTT GCATTAGGAA 

63301 AGCTTAAATT CCCAGAGAAA ATGATGGAAC CATGATTCTC AGAAAGATCG 

63351 AAATTACCAT TCACATACAT CGCACCAGCT CTTTTAATAG CAAAGCTATT 
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634 01 TAGGAAAAGA ATTTGGTTTT TTGTATTCGT TATGGCAAGT GATTTGCAAG 

63451 AGAGAGCACC GCCGTCTTGA GAGAAGTTTT CG AACCAGCT TTCTATGGAA 

63501 TTCTGGTGAT CGAGGACAAT GTCTTGGTTA GTGTCATCCC TAACTCCAAA 

63 551 AAGTGTTGCT CTATGAGAGT AGGGAGTCAT GTTAGTAAGA GTATCAATTA 

63 601 GAGGGAAGAG TGTTGTGAGT TGATTTGCTT GATTATCAAA ATAGTCAGAC 
63651 AACGGAGTCG CATTAAGGAG TATTGTAGTT TTACCTAAAA TTAAGGCTCC 
63701 AACAAAGAAG GAAGATTTAC TAAGGGATCT GTTATTTTGC ACTGTTTATT 
63751 CCTTGATGTT TTCTTTGTGG TTAAAATGTG CAATGACTCT CAGCTTTAAG 
63801 GTATTGACCT ACAGTTGACG AGGAGACTTG AGCTGAATAA TCTAAGGATA 
63851 AGGTTACTCT TGAGAAAAGT TGGGAAGTAT TTTTTATTTT TGCAGCTACG 
63901 GAATTATAGG AGACGGGAGT TGCTTGTGTT GTCCATGTTC CATTGCTGAT 
63951 GAGTAGTGTC GTGAACATTT CTGGATTTTT TCTGTATAGG GTAGGTACGT 
64001 AGGATATTTC CGTAGTCCAT AGCATGGGGA TATGATGTGA AGTTTTCCAT 
64051 TCAGAACGGA AGCCTATGGG AGAGGAAAGA TCTGTAAGGG GATGTTTTGG 
64101 ATGGAATTTT CTTATATGGT CTCCAGTTTC TTGGAACGAG GCCTGGGAAC 
64151 AGCGCAGAGC AATGGCACTG ATAAAGGGCT GGAGTTCGAG AGTGCGGGTG 
64201 ATTCTAGCTG GTAAGAATGT GCAGTCTAGA GAGGCTACCA AAGTGTGGTT 

642 51 ATTAAAGAAG GCTTTGGACG ACCCTTTTAA GATTTCTGTA TAGTGGCAAA 

643 01 GCATATGGTG ATCTCCGTAG CTATAACCTA GGGATAGCCC TGTAGAGATG 

64 351 AAGTCCCTGA AGAGGAGACT GTCGAAGCGG AGTCCTGCAA AGTAGTTGTG 
64401 GGAGGAAGTC GTACTTGGAG ATTGACGTTC TCTAGTTTTG GAGAACATTT 
64451 GTGCGAATCC TAAAGAGAAA CTATGTCGTG CTGCAGTTTT TGCTGAGGTT 
64501 GTTGCTGCAT AGCCCGTAGT ATGGTTTCGG AAGCCTTTGC GTCCCTCGCG 
64551 ATTATGTTGG TTAATTAGAA GCCCGAGTCC TTGCAGAGAG GCTTCAAGGT 
64601 CATGCTCTTT GAGGTTTTGT GGAGGTAAGA TGCGGATTCC TAACAGAGCG 
64 651 TTATAGGCAG ACTGCCATAA GGTATTAGCA ATAAATTCTC CGTGACGTTC 
64701 CGGGTTAGGG CGGTATCCTA CAGGAGTCCA GTCTACGTAG AGCTGCCTGT 
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64751 


GGTTTGTATT 


GGTCTGTTCC 


GGTACTGTAG 


AGC I 1 b 1 1 vj I 


x fTPPT A PTT 
no 1 v.o i. /to J. l 


64801 


TCCATCCAAT 


AGGGAGACCA 


GATTCCCTGA 


TATCL ATAQj I 


bL 1 L.A1 V- 1 AA 


64851 


GTTCATGGCT 


TCTACAATGA 


GATTCGAAGT 


A J. LuAJ. 1111 


TTT P. P & P. TP A 
ill oV»nv7 1 wn 


64901 


CATCGAGGAG 


GTAGAGGAGG 


GGGGATATCC 


1 1 ILonwl i 


PIP. iP.CiP.ATP 


64951 


TAAGCTATCA 


TAGGGGTTTT 


CATTTTCATC 


bill Ab/Wift 


PTPA APP^TP 


65001 


CTGAGAGAGT 


GATAGTAGAA 


GAAGTGTCTT 


/-"•a/"* A AT A PPT 
LAunn 1 Abo I 


PP ATPPTPTT 

Vj i Vw>w lull 


65051 


AATGTAGGAT 


AAATCCAGAA 


CTTTGGAGL 1 


P A PP PTTPTP 


ATTRTAAAAT 


65101 


AGAAGGAAGA 


TTGATCGCGA 


TTGCATTAAA 


A 1 1 1 A 1 uunu 


PTTP PPGGG C 


65151 


CTTTCGTCCT 


GATTAATGCT 


GCGTTTCC I A 


AAL.U 1 Aunn 1 


P.PPPPPAP.TT 


65201 


TGCGATAGGG 


TTTTGCAAGA 


AATAGCAGCC 


fT* 7i TPTTP A A 
LoAlLl Itnn 


TAPPPAPPAP 


65251 


ACCCCTTTCA 


AGTCGTGAAG 


AGTTAGAAAA 


Till GA 1 Abb 


1 IPTTPi 3iTP 
AAo 1 1 bnn J. u 


65301 


GATTTGTTGC 


GTTAGAATCT 


AC ATTG ATTC 


r>r>r* a a a a r 1 A & 
LbunnAnLnn 


P A PPP/TPPP A 


65351 


AGGTGATGGG 


GTTCATAATT 


AAATACTATA 


GGATbibl lb 


TPPTPTPATP 
1 Lbl V. 1 on i Vv 


65401 


GTGATCTATA 


GGATCATAAA 


AGAG AATT 1 1 


A 1 AACt-^- lbl 


PTTPPTPPTA 


65451 


GTTTTAAGTT 


AATCCCCGGA 


GCAGCATAGA 


/^mp f» A TTTPT 

bl uLnl 1 1L1 


LTATPPPPPT 


65501 


TGAGGAGAAG 


AAGATGTGAT 


TGTATTATTG 


1 i AAA 1 nunn 


TATPPPPPTA 


65551 


GTCTGCAGAG 


AGGAAGAAAT 


TTTG AoGAG 1 


AV. 1 il«l«lrVlrV 


ppAPAAAGAT 


65601 


TGATGAGAGC 


CCCTCCTGAA 


GTCGCAGAGT 


1 A 1 1 AA 1 AAA 


TPPTPTPPP A 


65651 


CCGTTATTTT 


GGAAGATGAA 


AGATCTCGiu 


Tr*T 2i T A P. P TP 
ibl Al nbL 1 U 


PPPPPPTAAP 


65701 


TGCTGCCGTT 


TTATTGTTAA 


AG AT AAG AL. L 


TTTrPP ATTP 
Ill uubnl 1 \-* 


TTPTP A ATP A 


65751 


CTAAGGAGGT 


ACACATGATA 


CCGCCACCAC 


LbobAl A 1 Ab 


TTTTPPTP A T 


65801 


GCTGTGTTAA 


TTGAGGATGC 


GGAGTGATTG 


/Tn/"» a Tf"*TPT & 
L IbAl L1L1A 


T A A TTTPTTT 


65851 


ATTTGAAGAG 


ATAATGACTC 


CTAGGGCAGA 


AAATATACCA 


CCCCTTGTCC 


65901 


TGAAGCATTA 


TCATTGATTT 


GGATGTTTTG 


ATAATTCCTT 


TCTATATTTA 


65951 


CGGCTCTGCA 


GAAGATGCCT 


CCTCCAAAGC 


TGGAATCTTC 


TAATGTTTGA 


66001 


TTTTTCTTGA 


TAACGATAGG 


ACCTAAGTTG 


TCTGTGATTG 


ATAAACTCAC 


66051 


CCCAACATAG 


ATAGCTCCTC 


CTCTATTTTT 


AGAGACATTA 


TCTAAGAATT 
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66101 GTCCTTCTCC TAAGTTATTT GTAATATAGC TATCTAGTGC ATGTATAGCC 

66151 CCACCAAATC CTGAAGTTGT AGTAGTAGCT AAGGAAGCCG CATTTGAAAT 

66201 AAACGAGTAG TTCTGATTCT TAGAAATCCA GCATTCCCGA ACACTGTAGA 

66251 GAGCCCCTCC AGAACTGTGG GAGCTATTCC TTTCAAAACT TAAGTTTCCT 

66301 TTATTTTCAG AAATGAACAA ATTTTTACAT GCAAGAATGC CTCCATCCTC 

66351 GTAGTGGTAG TTATAATCTA TAACAGAATT GATAGAGTTT CCTGTAATTG 

66401 TGATGTCCTG GGTTATATCG TGACCGAATC CATTAAGAAG ATTAGAGGGA 

66451 CGAAAGGAGG AAACCAAGGG CTCTGGAGTT ACATTCAGAC GGAAGCTTGG 

66501 AGACGTGTGG AAAGCAGAGA TGTCTTTTCT AGACATCTGA CAAGAGGCGA 

66551 GGTTAGGGAC TTCATTTCCT GATAAGGAAC AACATAGCGC AGTGGATAAA 

66601 ATGCTGAGAC AAATGGGTCG CATAAACATC CAAACTTTGA TATGAAAATA 

66651 GAGTTTGTCA TAAAAAATCA TCAATTTGTT GTCTAGAAGT TTATAAATTT 

66701 ATATTTAAAT ATGCTTATTA ACATGTTGAA TATTACAGAG TTAGTAAACT 

66751 CATATATTTG AATGATTTTC TATATTTAAA TTTTGAGGGA GTCCTCTACC 

66801 GATTGGGCTA ACGTAGCTCA TCGGATAGTT GTTAATTCTA AAGATTTCAA 

66851 TTTTATCGTT CTTTTATTTT AGAATTTTAA GGTACTTCCT GCTTGGAGAT 

66901 GGTGCGTAGA TGTCGAGGAG GAGACCGATC CTTGGTAATC CAAGAATAGA 

66951 TCGAGAGAAC GGAAGAGCGC AGTTTGATTG TGGACTTTGT ACCCTAAAGC 

67001 ATTGCGAACA TAGTTATGGC CTAGGATATC CCAGGAACCT CCGCTCGCAA 

67051 GTAGCGTGAC ACCGATTTGG GGATTTTGTT GATAGAGTAC CGGTTGGTAA 

67101 GAAAGTTCTA GAGTCCATTC TGTAGGTACG TGGAATTTTG ACTGCCATTT 

67151 TCCTTGGATT CCTAGAGGTA AGGTCAGATT ATAGAAAGGC TTTTGAGAGA 

67201 CAAACTTTCG GGGATTGTCA CCAATCTCTT CGAACGCTGT TTGGTGAGAA 

67251 CGTATTGCAA TTGCCTGAAC GAACGGGCTG AGGTGAAGAT AGGATTTCTG 

67301 TTGCCAAGGG AAAGAACAGC CGATAGCTGC TGCTAATGTA TGGCTATAAC 

67351 ACGTCCCTTC TGCCTGTTCT TGATGTGAGG GATGTAGGCT GTGGAGGTGA 

67401 TGGTCCCCAT AGCCATACGC TAACACTGTG GATGTTGCAA AGGCCTCTTG 
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67451 GAACCACGGA AGCTCAACAT AAAGTGAAGA GACTGTATTG TGAGCCGAGA 

67501 CGTTGTTGCT TGATCCGATT TCTTTAGTGC GGGTGAAGAA CTGTGCAAAA 

67551 CCTAAGGAGA TTTTCTGATG TAAAGAAGTT TCGGAGGATG CTTGTAAGGA 

67601 ATACCCTGTA GATTGGATAC GGAATCCTGG AGCCCCGGGG ATGCTATTTT 

67651 GATGAACAAA GAGGCCGTCG GCAATCCCTT GAATTTCTAA GAAAGGCCTC 

67701 TCGATATCAG AATCACCAGT TCGATTATAA CTTCTTAATA GAG AGAACAT 

67751 AGTATGAAAG GATTGCCATA GGGGAGTCGT AGCAAGATCT CCTTGGTATT 

67801 CAGGATTGAC CTTATATCCT AAGGGAGTCC AATTGGCATA CAGAGCTCTG 

67851 TAGAGGGTGT TTGCCGTCTC TATAGAAGCG TTATTTGTTG TTGTTATCGT 

67901 CTCTACCCAA TAAGGAGACC AGATGCCTTG ATAACCGTAA TGCTCAGTCG 

67951 CATTTAAGCT TTCAGGATGA AAGTTATCGG TATTGATATG ACGTGCTGTT 

68001 ACATCCGATA AAGAAAGAAG ATGAATGTTT TGTAAAGGCT CAGAGAGATC 

68051 TATACTGTCG TAGGGATCGC GGTTTTCCTC ATTTAAGAGT GTCAGAGGAC 

68101 CTGATAAAGT AATTGTAGGG TTATTGTCCT CTGTGAAAGG AG C ACT AG AT 

68151 TGTAGAGGAC GGATCCACAA GGTAGGAGCT TTTCCTTTTG CTAAGATCGA 

68201 GGGGAGGTTA ATCGCAAGGT TATTAATGAT GACCTGGGAG CCTACACTAG 

68251 TTGATGGAGT CTCAGAGTTG GCAGTTGTTG CAATACTCGC CGCATGCCCT 

68301 AATTTAAGGA TACCTCCTTT TTGAGTGAAC TTATAGAATT GCCATCCCGC 

68351 ACGATCCTCG ATAGAGAGGA CACCATTGCG AAGTTCAGAG GTATTTTTCG 

68401 AGCTGCTAAT GAAATTATTT TCGTAGTCAG AAGCTTCTGG GATATAGGCT 

68451 GAAGAAAATA AGATCGTTCC CTGATGGTTC GCATTGGGAT TAAAGATTAG 

68501 AGGATTTGTA GTTGGATGTT GGTGTTCTAT AGGATCAAAA AAAGCAGTCG 

68551 TATACCCCTT ATTAGCTCCA AGTTGTAAGT TGCTATTTGG TGTACAATGT 

68601 ATGGCGTTGT ATCTACCAAA TGTGGTGAGG AAAACCTCAT TATTTTGAAA 

68651 TGCGATATTT CCTTGTTCCG CGAAGAGTAG GCAGGTGCTG TCCTGTAGGA 

68701 GCATAAGAGC ACCTCCCCAG TTTCCTTGAT TGTTGGTGAA ATATACGTGG 

68751 CCACTATTTT TGATTGTCAA AAATTGTGTA CAGATAGCTC CGCCATCGCG 
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68801 AATGCAGTAG TTATTATTGA AAAGAATAGT TCCAGGGTTA TCGTCTATGG 

68851 ATAGGTTTGT TGTATAAATC GCCCCTCCTG AACCATTTCC TGAATTTATC 

68901 GAACCAGATA ACGCTGTGTT GTTATTGAAA ATCACCGACC CGGAGTTATT 

68951 TTTTATCGCA ACAGTAACGC TTGTTTGAAT GGCCCCGCCA TTGTTCCCAC 

69001 AGTTGTTCTT AAAATAAATA GGACGCGTGT TATCAGAGAT CGTTGTATTT 

69051 TCACTACGAA GCGCACCCCC TCCACTAGGG GCTGTATTGT TAAAAAAGAG 

69101 TAGAGGTGCC CTGTTGCTTT GGATGCGGCA GTGTCCATTG GTGGAGAGGG 

69151 CTCCTCCCCA GTTGTTGACG GAATTGTTGA CAAAGTAGAA AGTCCCTTGA 

69201 TTTTGAGAAA TCGTGAAGTC TCCATTACAG GCAATCGCAC CCCCACGAGT 

69251 TTCTCCTCCT GTACTCGCAT TGTTAAGACC TCGATTGCTG AAAAAAATAA 

69301 GGGGTCCTCT ATTCTTCGTG ATTGTGCAGG CTCCCTGGCA AGCAATCGCG 

693 51 CCTCCAGTCC CAATCGCGAG ATTTTTACTG AAGAAGGCAT GGTCTTCAAC 

69401 ATTTGATAAT AAGAAATTAT TACAGGACAC AGCTCCCCCA GCCGATGTCC 

69451 AAAGAAGAAG GATGTTATCA ATAGACTTGT AGTTAGAAAG TACAATGTCT 

69501 TGAGAGGAAT TATGTCTATT TCCAACAAAC GTAGTTATTG GAGAAAATCC 

69551 TGTAAGAGTG GAGAGAGAGT CTAAGAGAGG AAAGCTCGTA CGAAACTCTT 

69601 CATCCCTCTC TAAAGCAAAC TTTTCAAGGG AGTCCGTTTG TAAACTATAC 

69651 ACTGCAGGAG TCATCCCGAA CATGCAGGCT GTGAAATTCC CGAGATAGAA 

69701 TAAAAACTTA GGAGGAGTCT TTGACACTAG AGGTTCCTTA ATCTTTCTGT 

69751 TTTGGTCACT TATTGTTAAA ATCTCATTCT ACTCGCCACG TTTAAGTAGT 

69801 GACTCAGCGT GGAGGAAGAA ATATCCGCAG AGTAATCTAA GGAGAGAGTG 

69851 ACTTTAGGAA ACACCTGCAT GGTATTTTTC ACTTTGATCC CTAAAGCATT 

69901 GTAGGTCACA GGAGTGGCCT GCGTCGTCCA CGTACCTTGG CTAATCAGTA 

69951 ATTTCGAGTG GAGTTCAGGA TCTTGCCTAT AGAGAGTAGA GCGATAGGAA 

70001 ATTTCTGTGA GQCAGACTAG GGGAACTCGG TGGTGGTTCT TCCAAGAAGC 

70051 GCGGATTCCT ACAGGGAGGG AGACGTCCGT TAGGGGGCGG TGTAGGGAAA 

70101 ATTCCCGAGC ATGGTCTCCA GA7TCTTGAA ACGCAGCAAG ATTTCCTCGG 
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70151 ATGGCTAAGG CAGTAATAAA GGGATAGATC TGCAGGGACT CGCCGTGAGG 

70201 TTGAGGTAAG AAAACACAGG AGAGAGCCCC TGCTAAGGTA TGGTTGTGGA 

70251 AAGATCCCTG AGAGTTCCCT TCCAGGAGAC CCTGATACAT TGTATGGGTA 

70301 TGTTCCGAGG TAAACATATA AGCAAGAGAC ACAGATAGAC GTATCCACTC 

703 51 TTTGAAGAGA GTATTTTCTA TGCACATTCC AGAGAAATAG TGGTGAGAGG 

70401 ACGTGCTATT TTGAGATTCA TGTTCTTTAG CTTTGGAGAA GAACTGAGCA 

70451 AATCCTAAAG AGAAATTCGG ACTTTGAGAA GAGGTTGCTT CGGTGGTAGC 

70501 ACTATAACCT GTCATATGAC TACGAAATCC CTTAAAACCG TTTTTGTCTT 

70551 TTTGATGAAC CAGAAGACCA ATGCCTTGTA GGGAAGCTGC ATGACCCTTC 

7 0601 TCTTCATCCC AGGAGGAGAG GGAGTGGAGT CCTGCAAGAG CCGTATATGC 

70651 CGATTGCCAC AAGGCATTCG TAATGAATTC TCCTCGACGT TCGGGATGAG 

70701 GACGGTAGCC TAGAGGAGAC CAGTTTGCAT AGAGCAGCTT GTGTTTTGTA 

70751 TTCGCGCCTA GTAGAGATGT AGGGTTCGTG ATTGTTGTAG TTTCTACCCA 

70801 ATAGGTCGAC CAGATGCCTT GATACCCATA GTGTTCGCCA GAATTTAATG 

7 0851 TGGATAGATC CAGTTGCGAA GAGTTAATTT TTTGTGCAGC GACATCGACA 

70901 ATATAAAGAA GGGGAACTTT CTCAAGAGAG TGCGAGAGAT CCAGACTATC 

70951 GTAGGGATCT TCGTTGTTGC TGTTGCGTAA GGTGAGAGTT CCTGAGATTG 

71001 TGATTGTCGG GTTGGAATCT TCAGTATAGG TAGATCCTGT TTTTGTGGGG 

71051 TAAATCCAAA TTTTTGGAGC CTGAGCTTGA AAAGAAAGAA TAGAAGGAAG 

71101 GTCAATGGCA ATGTGATTTA AAGTTATAGT ACTTCCTACT GTCGTTGGTG 
71151 TTGAGGATGG TGTGGGAATC GTTCCTGCTG TCGTGATCAC CGCACCTTGA 
71201 CCTAGAAGTA GAGTGCCTCC TCGTTGGAAG AACTTATAGC AGGCCAGCCC 
71251 CGCACCATCT TCAACAGCAA GGACTCCTTG ACGTAGTTCC GAAGTGTTCC 
71301 TTAAATAGGA AAAGAAATTC ATTTCATCGG TAAAGTTCTG GTGTACATGT 
71351 TCCCCTGAAA ATAAAACTGT ACCTGTATGA CCGGTTTCGA AATTAAAGAG 
71401 TATGGGGAAG GAGGAAGGGA GCTCATGTTC TATGGGATCA TAGAACAGCA 
714 51 CTCGATAGCC GGGACGGGCT CCTATTTGCA GATTCATATT AGGAGTCGAG 
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71501 TGAATGGCGT TTCTGTATGG AGGATTGAGG "GCATGCTTGG AGGCCGTATT 

71551 ATTGTTAAAG ATAATATCTC CATTATCTGC AGATAAGATG AAGCTTCCGT 

71601 TTCCAGAACC TGCTGATAAG TTGAGGAGAG CCCCTCCCCG AGTTGCAGTG 

71651 TTATTTAAAA AGTATACAGG ACCATTTTCT TTGATAATGA TAGATTTCGC 

71701 ATGAATGGCT CCACCGTTGC TCTGGCTTTG GTTATTGTTA AAGAGTACCC 

71751 CTTCTCGGTT GTTCAATATC GTGCAAAAGG TGGTAGTAAG ACCTCCTCCT 

71801 CCTGGATTGA AGTTCGATCC ATAGTTATTT GCGAACGCGG AATTTTCACT 

71851 GATTTCTATG AGTTTTTTAT TCGAGGAGAT CGTGAGTGTT TGGGTAGAAA 

71901 ATATGCCTCC CCCAGATCCT GAAGAGTTGC TTGTGATCTG TATAGCTCCA 

71951 GAATTTCCCT CTATATTTAG. AGAGTTCCCA CTATAAATCC CTCCTCCTAA 

72001 ACTGTCCGAA TTTAGTGCCC GATTCTGCTT GATTATGATC GGGCCTTTAT 

72051 TGTCTTTAAT AGATAAGTTC GTCTCAGTAT AGAGGGCACC CCCCTTATTT 

72101 AAAGCGAGAT TGTCAACGAA AGTTCCCTGT CCTAGGTTAT TAGTAATAGA 

72151 GCAATTTATG GCAAAGAGAG CTCCACCCAA TAGTGATCCC GCAGTGGCTG 

72201 TAGGATTGTC AGAGACCAAG TTTGTAGTAA ATGCATAGTT CTGATTCTTG 

72251 GAGATCGTGC AATTTTGAGC AGCATAAATT GCCCCGCCAG AATTGGGACA 

72301 GACATTCTTC TCAAAGAAGA CATTCCCTAT ATTTTCAGAG ATCAGAAGAT 

72351 TCTTACAGGT AAGAGCACCT CCATTCGACC GATAGTACTT ATAGTCCAAG 

724 01 ATGAAATCAT TGTGATTCCC GACAATTGCG AGATCTTGAT TTTGGTTATG 

72451 AGTAAACCCT ACTTGAGGGG CTGCTTGATA TTCAGGACTT AATGTAATAT 

72501 AGGTCTCCAA AGGAAGTTGG AGACCTTCAT TAGCCAATAC AAAAGTAAAA 

72551 GGAAGCAACA TTCCGAAGCA AAAAAAGCGC ATACTCGTTC ACAAATAGAA 

72601 AGAAATATTC TGAATAAATC AATGCTAGAT TTTTTGTCTA TAATTTGTTT 

72651 TATAGTGTCT AATTATGAGT AGTTTAAAGT TATCTTATTT AAGACAAGAC 

72701 GATTTTAATT TCCTTTTTTC ACAATAAAAT GCGAGTCCAA GAGTTTTAGT 

72751 AAACGTACAT CATACTTTGG AAAATGGCGG CAAGTAGTCA GAGGATCAAG 

72801 CTCTTTGCAT ACTTTGCTTA GAGCAGTTTT TGGTTCTCAG GGTCGTTTGC 
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72851 CTTTAGGGAG CTTGTGATGT TTGGGATCGG AGAGCCTCCT TATATCCCTA 

72901 GGGGATGCCC TAGGAACTCT TCCGAAACAC CGAGGGTCTG TTAGAGATAA 

72951 AAACAAAGGA CCATCGGGGA GACTTGTACT CATAAGAGCC ACTTATCTCT 

73001 ATTCCTATAG ATTTTTGCTG ATTTTGATTA TTAGAAAATA ATAGATTTTT 

73051 TCTGTTTTTT AAAGTAAAGT ATTTTTTAAA AGACTCATTT TTAATGAGTT 

73101 ATTACTTTTC TCTTTGGTAT CTGAAGGTGC AACAGCACTT TCAAGCAGCA 

73151 TTTGATTTTA CTCGCTCCCT GTGTTCACGA ATTTCTAATT TTGCTTTGGG 

73201 AGTGATTGCA TTGCTTCCTA TTATTGGGCA GTTGTATGTA GGGCTGGACT 

73251 GGCTCCTCTC TAGGATAAAA AAGCCAGAAT TTCCTTCCGA TGTGGATCAG 

73301 ATCGTGCGAG TAGAACACGT CGTGGGTCAC GACCATAGAA GTCGAGTTGA 

73351 AGATATTCTA AAGAGACAAA GGCTCTCATT AGAGCCTAGA GACGAGGGGA 

73401 AGGTTCACGG AGATCTGCCT TCAGCTCCTT TTTTTTGATA TCCAAAGTCT 

734 51 CAAGTTCCTA CAGTTGTTCT CTGAGGGGAC AGCTCTAAAT TTATTTCGTA 

73501 TATTTGCTCC ACTACGCAAC CGTGTGACTA CAGAATACAG TCGTGCTAGG 

73 551 CAACCCGACC TACATAGAAT TGCCATCGTC TATATAGGAG TTCTCGATTC 

73601 AGAAAGTTCC AAGATCCTAG AGCGGCTAAT CTCTTATATG AGTTGTATCT 

73651 ATTCTGAATC GCAAATGTAT TTAAGATTCT TTATGGGCAA GAATGTAAAT 

73701 CAAAGTGCTG TACTCTCAAA ATTACATGTA GAAAATCTGC ACATCCGTTG 

73751 TGGGTTTTTC AGCGAGGATG CTGTTCCAGA GAGTGAGCCC TTCGATCTCT 

73801 CCATCTACGT GCACACAGAT CGTAGCTGTC CTCTCCCTAC GAAAAAACGG 

73851 AGCAGCTCCT GGGAACTCCA AACTGTAGAA CTCCCAGAGT CAATATATCC 

73 901 ACAGTCGGAA TTCCTATTGA TGAGACCTCG AATGCTTTCG TAGACTCTAT 
73951 GATGAAACAA GGAGTCGGGC AGGATGCTAA AGAGCTATAC ACATTTCTAT 

74 001 CTCGTGGGAA TGAGCATTAC CAACCGTGTC TATGGTTCAG TCTCGAAGAG 
74051 GAACTCGGAT TCCTTTTCGA TGAAAAAATG CTCTGCGCCC CTCTATCTGA 
74101 GGATCACTAT TGCCACTCGT ATCTTGTAGA TCTAGTGGAT CAACATTTAA 
74151 AGGATTTAAT ATTATCGATG TTTTTAGATC CTCAGAATAT CTCAGCAGGA 
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74201 GAACTCCTCA AGGTCTCTAT AAACGTTGGA GATTCTTTTT CTCCTCTACA 

74251 ACAGAAAGAT TTCCTCTCGA TGGTCTTACG TGATGAAACG GGAAAAAACG 

74301 TCGTCGTGGT TTTTAAAGGA GTTCTCTCCT TACCCGCAAC CCAAGTCTGC 

743 51 AAATTAGTAG AGGAATTGAA CTCTAAGGAC TACTCCTACC TCAATATATT 

74401 TTCTTGTCAC GGAGATAGTA GTCCTCAGCT TTTATTCCGT AAGGAATTAG 

74451 AGGGAACTTC AGGGCGTTAT TTTACAGTGA TTTGCGCTTT ATATCTAGGG 

74 501 GATACAGACA TGCGTAGTTT ACAACTTGCT TCTGAAAGGA TCATGGTCTC 

74 551 TAGAGAGTTT GATCTTGTAG ATGCCTATGC TGCAAGATGC AAGCTCTTGA 

74 601 AAATCGATCA TACAAATTGG AGACCTGGAA CTTTCAGTCG CCACGCCGAT 

74651 TTCGCAGATG CTGTAGACGT ATCAGCAGGA TTTAACTCAA GAGAATTTAA 

74701 ACTGATTACG CAGGCGAATC AAGGGATCCT AGAGTCTGGA GAACTCCCGC 

74751 TCCCTTCAAA AACCTTCTGG GAAGGATTCT TAGCATTCTG TGATCGAGTG 

74801 ACTGTCACGA GACACTTCAT TCCAATGTTA GACGCCGCTA TAAAGCAAGC 

74851 GGTATGGACT CATAAACATC CCAGCTTGAT AGATAAAGAG TGTGAAGCCC 

74901 TAGACTTGAA AACACAGTGC TTGCCATCTA TCGTATCGTA CCTTGAATAT 

74951 GTCACAAACT CTCACGAAAA AACATCGAAA GGCCCGTTCA TACAAAAAGA 

75001 GATTATCGCA GACTGTTCTC CTCTTAAAGA GGCGCTCTTC CCAGGTTCTG 

75051 ATGAAGATGT TCCCTCTACC TCTGAGGATC CTTCAGATGA TCATCCTTCG 

75101 GATCTTGAAG ACTCTTAATT AGTTGCGATA GAATTCAATT TTTTATATAA 

7 5151 AAACTATCGT GTTGTTCTTA TTAAAAGATA GTTAATTTTC TATCTTTTTT 

75201 TAAATCTTTA TATAGCCTGC GTACGCTTTC ATTTTCAATG TTGGTTTGAT 

75251 CCTATGGCAT GCTATATTTC TATTTGGATA TCTACAGTTA AGCAGCATTT 

75301 TATTAGGGCT TTTGATTTTA CACGTCCTCT TGGT.TCTCGG ATTACAAATT 

75351 TTGCTTTGGG GGTCATCAAG GCTATTCCCA TTTTAGGATG CGTTGTTATA 

75401 GGGGTAAGTT GGCTAGTTTC CACATGTTCT GCACGAAGGT TTGGGAAACC 

75451 GGCATTTACT TCTGACGTTG CTAGTATCGT GAAAATAGAA AAAACTCGAG 

75501 GTTATAATCC CCTTGCTTGG GTGGAACAGT ACTTGAGACA GCTTAGGGTT 
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7 5551 CGACTTCCTG AAGGAGATTT AGGAAAAATC CATGGGAAGG TCTCCAGAGA 

7 5601 TTATGTTTGC GACAGGACTC CCCAAGAAAA TCTGAATATG GTTCCTCATC 

7 5651 AATATCTGGG AGAGCTAGGT CGCGCGTTTT ATGGAATCCG CAACCGAGTA 

7 5701 ACCAAGGCGT ATCAACGAGT CACTCCTCTG GAAGTCCCTT GTCTTACGCT 

75751 CGTCGGTTTT GACATTTTAG ATCCCGAAGA TCAGGTGAAT TTCGTTCGTC 

7 5801 TGGCTAACGG CATACAAACT CAGTACCCCC AAACTCAAAT AAAACTTTAT 

75851 TTAATCTCTA TCCAAAAGAT ATGGAATCAG TGTGACGGTA CGATTTCTCA 

75901 AGAAAAAGAA CAGCAACTCC GCTCTCTAGG TTTGGATGCT AAAATCAAAT 

7 5951 GTGTGTCGGC CCCCGCTCTC CTGCTCCAGA AATATCTTCA ATCCGAGAAC 

7 6001 TTGCCTTCCT GTGATCTTCT CATTAATTAT TACGGGAAAC AACAGTCCGT 

76051 CAGAGACGTG GACTCTATAA AGAGTCTACT CAATCTTTCT TCCGAACATA 

76101 TCCCTGCGAT TTCTGTAACC TATAGACCTG ACGATCCTTT TTATAGCTAC 

76151 TATTTCTTTC CTGGTTCTCA AGGAGGAACG GCACCCGATC AGAGGATCCC 

76201 TTGGAGTGAG CAGGAGCATC TTCAAACGTA TACCACCCTG TCTAACCCTA 

76251 GATGTGATAG ATATGCTGTT CACTTGGGAA TGGAAGATTT TGCCTCTGGA 

76301 GTATTTTTAG ATCCTCTTAG GGTTTCGGCT CCTTTATCTG GAGAGTATTC 

7 6351 CTGCCCCTCA TACCTCTTAG ATTTAAAAAG TGAAGAGCTT CGTTGTTTCT 

7 6401 TGTTATCCGC TTTTATAGAT CCCAACAATT CTGGTCAGGG AAATCCGCGT 

76451 CCTATGTCCA TAAACTTTGG AAACTCTCCT TTGGGTCAGA GGTGGTCTGA 

76501 GTTTCTATCT CGTGTTCTAC ATGATGAAAC AGAAAAGCAT GTGGCTGTAG 

7 6551 TCTGCAATAA TCCACAACTT ATAAAAAAGA GTTTTCCCTC ACATTCTTTA 

7 6601 TCTCTATTAG AGAACGAACT GGAAGAGTCA GGTTATTCTT ATTTGAATAT 

7 6651 CGTTTCAGTG AGTCAGGAAC GCACGTGTGT TAAGGAACGT AGAATTTTAA 

76701 GTTCTGATCC TTCGGGGAGG TCATTCACTG TAATCCTCAC TGATCTTCCT 

76751 GAAGGGAGTT CGGATATCCG CAACTTGCAG CTAGCGTCAG ATAGGATCTT 

76801 AGTTTCTAGT GCTCTCGATG CTGCTGATGC CTGTGCTTCT GAATGTAAGA 

76851 TCTTAGAATA TGAGGATCCC GAGCAAGAGT GGGCGCAACA GTATGCGTCG 
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76901 


TTCTATAGAA 


ACATCGACAG 


GGCAGGCGAT 


CTTCAACGTC 


AGGGGATTCC 


76951 


AGGAGAGCCT 


TTAGGGGTCT 


CAGCATCTAC 


GAGAGTAGTT 


TTAGAAAAGG 


77001 


ACATCGTATT 


CAATCTCAAT 


GCGGTAATCC 


AACAGGCCAT 


GTGGAAGTTT 


77051 


AAAAAACGGG 


ATCTTTTTGC 


TGTAGAAAGT 


CAGGCTTTAG 


GAGATGACAT 


77101 


GCGACGTGCT 


TTAGAAGGTT 


ATATCGGCAG 


CAGTCTCTTA 


GTTGAGGGGA 


77151 


CTATACAGCC 


TCAAGTCGCA 


TGTAATGTCA 


ATGTGAGTTT 


TGCTACGTTA 


77201 


GACGAGGCTG 


TGTGTGCAGC 


TTGTGACTCA 


GCTCAAGATG 


CACCTTCTGA 


77251 


GGAGAACAAT 


ACAGATGACT 


AAAGATCGCA 


ATCTTGTGAA 


CGAAATCGCA 


77301 


GATTGATGGG 


AACTAATTAG 


ACACACCTTT 


CTAAGGTGTT 


TGTTTTGATG 


77351 


AACCTTTTTA 


TTAGTCCAGC 


AGAGCTCTTT 


TTTGAAGATT 


CTTCTTTTTT 


774 01 




TCTGGGTTTT 


TTGAAGGTAT 


CGAGGGTTCT 


TATTGTCTAG 


77451 


TTGTCTATAG 


AGGGTATCGA 


GGTTTTTTCT 


CTTAGGTATC 


CCACGATTCT 


77501 


TTTGTATAGA 


AAAATTTTAT 


GAAAGCTTGA ACTCTTTACA CTGACTTTTT 


77551 


ATTTTTCAAA 


TAAAAACGTT 


TTTAAAAATA 


TTATTATCAT 


AATTAGATAC 


77601 


T T ATT TG TTT 


TAATGTCTTA 


TTTGATTAAA 


ATAACTTTGT 


TAAAATTTTT 


77651 


ATACATAAAT 


TTCTATTGTG 


GCTTGTCCAA 


GTATTTCTTC 


TTGGTTTACT 


77701 


GTCGTTCGAC 


AGCATTTTGT 


AAACGCCTTT 


GATTTCACCC 


ATCCCGTTTG 


77751 


TTCTCGGATT 


ACAAATTTTG 


CTTTGGGGAT 


CATTAAGGCA 


ATTCCCGTAT 


77801 


TAGGACACAT 


TGTCATGGGA 


ATCGAGTGGT 


TGATTTCCTG 


GATTCCCAGA 


77851 


CACACCGTTC 


GTCATGGAAT 


GTTTACTTCT 


GATGTCTCTA 


GTGCTATTAA 


77901 


AGTAGAACAA ACACGGGGTC ATAATTGTTT 


AGCTCCCCTA GAAGCCTATT 


77951 


TAAGTAGCTT 


GAGAGTCCCC 


ATTTCCCAAG 


AAGATCTAGG 


CAAAGTACAC 


78001 


GGGAGAACCC 


CAGAAGATCC 


CTTCGTAGAT 


ATCACACCCA 


CAGAAATTGT 


78051 


CCAACTTCTC 


CCTGATGAAG 


AACTCTCTAC 


TGTAGATGAG 


GCACTGCAAG 


78101 


GCGTTCGTAG 


TAGGTTAACC 


TATGCCTATA 


GGTCCGTAGA 


GAAACCTATG 


78151 


ATTCAAGATC 


TTGCTCTTGT 


GGGTTTTGGT 


CTCCGAGATT 


CTGCGGACCT 


78201 


CATAAATTTC 


GTGCGTCTTG 


CTAATGGCGT GCAGAATCAC 


TATCCCCATA 
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78251 CTAAAGTGAA GCTCTATTTA GCGAAGAACT TGGCAGATGT CTGGGACTGT 

78301 GAAATTTCTG AAGAGGAAAA AGGGCAACTC CGAGCTCTAG GTTTAGACCC 

78351 TAAAATAGAG AGTATATCCC TTACGAGTGC AGGTCTTCCT TCAGTGCCAG 

78401 AAGTCGCTAC TGTCGATTTT ATGATTACCT GTTACGGGAA AGATCAGGAA 

78451 GTCCAAGATC CCTAGGTGAT ACAACATCTT CTAAACTTTG CTCTAGAAGA 

78501 GACCCCTTCC ATTTCCGTGC AATACCAAGA ACAAGAGAAG CTCTCTCCGT 

78551 GCGATCATTC CCCAGAAATA GGTAAAAAGA AAAGATGGAA TAAGCTGGAA 

78601 TCCTTCTCCA CGTATTGTTC TCTGTTTATG TCTGTTAAGG ATCATTATAA 

78651 GCTGAATCTA GGAATTCAGA ATTCCCTGTC AGGGTGGCTT CTGGATCCCT 

78701 ATAGGGTTTG CGCGCCTTTA TCTTCACCGT ACTCGTGTCC TTCCTATCTT 

78751 TTAGATTTGC AAAACAAAGA GCTACGTCGT TCCCTTCTGT CAACGTTTCT 

78801 AGACCCTAAA AATCTCACTA GCGAAACATT CCGTTCTGTC TCTATAAACT 

78851 TTGGCAACTC TTCGTTTGGA CAGAGATGGT CAGAGTTTCT ATCTCGTGTT 

78901 CTGCACGACG AGAAAGAAAA GCACGTAGCT GTTGTTTGTA ATGATGCAAA 

78951 ACTTCTGGAA GAAGGATTGT CCCCAGAGGC ATTGTCTCTA TTAGAAGAAG 

79001 ACTTAAGAGA ATCAGGGTAT TCGTATCTAA ACATTCTCTC GGTGAGCCCC 

79051 GAAGGAGTCT CCAAGGTTCA GGAACGTCAG ATTCTAAGGC GAGATCTCCA 

79101 AGGACGGTCC TTTACTGTCA TGATTACAGA TCTTCCTTTA GGTAGCGAAG 

79151 ATATCCGTAG TTTACAATTA GCCTCGGATA GGATTTTAGT CTCCAGTTCT 

79201 CTTGATGCCG CGGATGCATG TGCTTCGGGA TGTAAAGTCT TAGTCTACGA 

79251 AAATCCAAAT GCATCCTGGG CTCAGGAATT GGAGAACTTC TACAAACAAG 

79301 TTGAGAGAAG AAGGTAGTGT TTCTTTCAGA GAATATTTCA GAGCCTATAT 

79351 GTGTGATAAA 7VTCGTGGCAC AGAAGAACTT CTTATTTACT TTAGACGCTG 

79401 TAATTAAACA GGCCGGTTGG AGATCACAAG AGAAACTCAA TTTATTTTAT 

79451 GTTGAAAGTC AGGCTTTAGG AAGAGAAATC AAAGTCAGCT TAGAGGAATA 

79501 TATTCAGAGT ATGGTCGGGA TTTTGGGATC TCAGAGAACC AAGAAAAGCT 

79551 TTAAGTTTTC TGTCGACTTT ACCCCTTTAG AGCAGGCTCT ACAAGAAAGA 
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79601 


TGCTCTTCTG 


ATGATGACGA 


AGATGCAACA 


GCAACTTCGA 


CCGCTACAGG 


79651 


GGCAACAGCA 


TCTCCGACTG 


ACATGCACGA 


AGATGAGTAA 


CGTTTGTCTG 


79701 


ATACCTTAAA 


AGTTCCTTGC 


AAAGGGCTCC 


CTGAAAACTA 


AATTCCCTCA 


797 51 


GAATCTCGAA 


TTCTCCTGAC 


TCTGAAACAA 


TCTTAGGTTT 


TCCTGAATAG 


79801 


AATCTGACTG 


AAATTTCTGC 


TCGAATCTAA 


GGGCTGTTTC 


TTATTTTACC 


79851 


TCTAGATGAG 


GATATTAAAT 


CCAAGCTAGG 


ACTTCAAAAG 


TAGTTGGTTA 




TTAfiTTTATT 


AAAGAAAATA 


ATACTAAAAA 


TATTTAAAAG 


CTGTTTATTC 


"7 QQ ^ 1 


* » TTTA ATTfi 


ATATTTTCTA 


TGTTGTTATT 


TAAAATTGTT 


TGTTTCTAAT 


onnni 
ouuui 


TTTATTTTTT 


TTGTTGTTAT 


GCCAATTCCC 


TATATTTCTT 


CTTGGATTTC 


QOO ^1 


TAPPP.TTPGA 


CAGCATTTTG 


TTAAGGCGTT 


TGATTTCTCT 


CGTCCCTTTT 


p m o 1 


uJ 1L1 nOOU x 


TAPGAATTTT 


GCTTTAGGGG 


TCATCAAGGC 


CATCCCTATT 


oUl jl 


r'TAfSP, HP ATA 


TTGTCATGGG 


GATGGAGTGG 


TTAGTTTCTT 


CCTGTGTTGC 




P^r^n ATT ATT 
Louonl xnx x 


ACTAGGTCCT 


CCTTTACCTC 


AGATGTCGTT 


CAGATTGTAA 




AGAPTGAGAA 


GGCGTTAGGT 


CGAGATCATA 


TATCTCGAGT 


GGCGGAGATA 


803 01 


TTGCAAAGAG 


AAAGGGGGAC 


CATAACTCCT 


GAGAATCAAG 


ATAAGGTGCA 




TGGGAAGTTT 

1 VVjVJ/ulw AAA 


CCTGTCTGTC 


CTTTTGGTCG 


TTTAAAATCC 


GAGGAAACTT 




TAAAACTTAA 


GCCGGGAGAA 


AGAGAGGGAA 


CTTTAGATAC 


TGTATTTTCT 




PPGATTCGCA 


CGCGCGTGAC 


TCGTGCGTAC 


TTACAGGCCC 


CCCGACCCGA 


AO ^0 1 


AATACGTACG 


ATTTCTATTG 


TGGGTTCGAA 


ACTTAAAACT 


CCTCAAGATT 




TPTPP.PA ATT 


TGTGAGTCTC 


GCGAATGAAA 


CGCAGAGACT 


GCATCCTGAA 


RO AO 1 


P.PPTTAGTTT 
X x tvo XXX 


GTCTGTATTT 


GACAGGCTTG 


AATCGCGAAT 


CTCAGATGTG 




PPATAPA APT 


APTGCAGAGA 


AGAAGCAGTA 


CCTACATAAC 


TCAGGTCTCG 


80701 


ACTCTAGAAT 


CCAGTGCAAA 


GACAGTAAAG 


AAGACGACGC 


fp r* r* f* fv r* /"* /"* *P 


80751 


GAAAATCCCG 


AACTTTGGAT 


TGGCTATTAT 


TCACGAGAGC 


AACAGCATAA 


80801 


TATAGACGGG 


CAGTATATTC 


AGCAGTGTCT 


AGGGAAGAGT 


GCAGATCCAA 


80851 


TTCCTTGGAT 


TCATGTTACT 


' GAAGACACAA 


> AGGATTTTTA 


> TTACCCACCA 


80901 


AACTTTACTT CATACTCACA TACAAGACAA 


> TCTACAGACC 


: CAACATCGCC 
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80951 ACCAAGACTC CCTGAAAGTG AGGGGGATAA GGATTCCTTG TACGGACAAC 

81001 TGAGTCGATC GTATCACCAT GAGTATATGC TTGGTTTGGG ATTAAAACCA 

81051 GAGGATGCAG GACTCCTGAT GGACCCGGAT AGAATCTATG CTCCTCTATC 

81101 CCAAGGGCAT TATTGTCATT CCTACCTTGC GGATATAGAA AATGAGGATC 

81151 TACGAACTTT AGTCCTTTCG CCTTTCCTAG ATCCTGGCAA TCTTAGTAGC 

81201 GAGGATCTTC GTCCTGTAGC ATTCAATATC GCTAGATTGC CATTAGAATT 

81251 GGACTCGTTA TTTTTCCGCC TTGTTGCGGG TCAGCAAGAA GGGAGAAACA 

81301 TAGTTACCCT TGCCCACGGA ACTCCTCGTC CAGAAGATCT TGATCCTGAC 

81351 TCAATGAACA TTCTGACCAG AAGATTACAA ATGTCTGGAT ATAGCTATTT 

81401 GAACATTTTC TCCTATAAAT CACGGAAAAT GATTGTAAAA GAACGTCAGT 

81451 TCTTTGGAGA TCGTTCTGAA GGGAAGTCTT TCACATTGAT CTTATTTGAG 

81501 GATCCCATTA GTGCAGCAGA TTTCCGTTGT TTGCAGCTAG CTGCAGAAGG 

81551 TATGGTTGCT AAGGATCTCC CCAGCGTAGC AGATATTTGT GCCTCTGGAT 

81601 GTTCCTGCAT TCAGTTTTCT GAGATGCAGA GTCCTCAGGC TATTGAATAT 

81651 AGACAATGGG AGGCACGTGT CGAAGATGAA GCAGGAGAAG AAGCCAGAGA 

81701 ACCAGTAATT TATTCTCAGG ATCAATTGAG CAGCATGCTC ACTACACAAC 

81751 AGAATTTTGT ATTTTCTCTA GATGCTGTGG TAAAACAGGC GATCTGGAGA 

81801 TTCCGTTCGA AAGGTCTTCT TACTATGGAA AGAAAGGCAC TAGGCGAGGA 

81851 GTTCTTAACT GCGATATTTT CCTATTTAGG GAGTCAGGAG CGTAATGAGA 

81901 ATATGGGGAA AAGAACTACC GAAGAACATG AGGTCGTTAT CAGCTTCGAA 

81951 GAGCTAGATC GCATGGTGCA AGTCCTCCCA GCCGAAGTCC CTGCAGATTC 

82001 AGGCAATGAT CCTACGCGTC CCGTTCCTAA TCCAGATAGT AACCCTGATT 

82051 CCTCGCAAAA TGAAGGCAGT TAGAAAGTAA AAATACTAGA GAAATTTCTT 

82101 ATCTCCAGAT GGAATCCGTG GTCCATGTAC CTAGGATTCC AGGAAGGGTT 

82151 TGCCTAGGAA TATCTTAATT TCACAAACCC CAQGAGTGCA CAAACTCCTT 

82201 AGAGGACTCT CTCTATATTT GTTCTTTCTA CACTAGTATT CCGTAGATTT 

82251 CTGATTTCCA GGATAGGATT CTAAATAGTT GTATCTAAGC GCTTTTTACA 
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823 01 ACCTTTCTCA GGTCTCCTCT TTCTAATTTT AAAAATAGCG AATTTCTTGT 

823 51 GGCTATAGGA GTCGCTAAGT ATCTTAGAGA TTGCTTTTTT CAAGTTTTTT 

82401 TTAAATACTT CTTTCCTAAG TATTTCTTCC TAGTCGTGTT ATGGCTTCTT 

82451 GTTTATCTGC CTGGTTTTCT ATAGTTCGTG AGCACTTTTA TCGAGCCTTT 

82501 GATTTTTCTT TGCCGTTTTG TGCTCGTATT ACGGAATTTG TATTAGGGGT 

82551 CATCAAGGGG ATCCCTGTTG TGGGTCACAT TATTGTTGGG ATAGAGTGGC 

82601 TCGTTTCTAG GTATTTAGAG AGTTTCGTGA CCAAGCCGAC ATTTGTCTCT 

82651 GATGTGGTGA GTCTTCTGAA AACAGAGAAA GTTGCTGGTC GCGATCACAT 

82701 TGCTCGTGTA GTGGAGACTT TGAAGAGGCA GAGAGTCGCT GTGGCTCCTG 

82751 AAGATGAGGA TAAGGTCCAT GGGAAGATTC CTGTGCATCC TTTCGGGGGA 

82801 ATCCAACCTG TAGAAGTTCT CACTCTCTAT CCCGAAGTTC AAGATGCAAC 

82851 GTTAGGGCTT GCCTTCTCTA AAATTCGTAA TCGTGTAAGA CAGGCGTATT 

82901 TGCAAGCTCC ACGGCCAAAA CTGCAGAAGA TTTACATCAT AGGAAACGAT 

82951 ATGAATCCTT TTGAAGTTGA CGACTTCTTG CATCTAGCCC GTCTCTGTAA 

83001 TGAAACTCAA AGACTCTATC CTGACGCTAC GATTTCTCTA TATCTAACAG 

83 051 CTTCTGGTGG TCGCAATGCT ATGGACAAAA AGAATCGGAA GTTACTTAGT 

83101 GATTGCGAAC TAAACCCCAA GATTGCTTGT TTGGACTTTA ATCAGGGTGA 

83151 TGTAGTCAAA CAAGCAACTT GTGACTGTTG GATGGTGTAT CATGGGGAGA 

83201 ATGATCAAGG TACGTTGAAT CAGATTCAGG AAGAGTTAGA AAAGTCAGGG 

83251 GAGGAAACCC CTTGGATTCA TGTGGGGCAA AAGCCTCTTT CACAATCCTT 

83 301 GTGGGATTTC TCTCCATTTT CATCTTTGGA GATGAAGGGA GATAAAGAGA 

833 51 AAGCTCTAGA GTACTCTGAA TTAGAAAAAG AACAGCTATA TTCTCGATTG 

83401 GTATACGTAG GAGAGCGCTC TTCGGTTCTT AGTTTGGGGT TTGGAGATAG 

83451 TCGGTCAGGG ATCTTGATGG ACCCAAAACG GGTGCATGCT CCCTTATCTG 

83 501 AAGGGCATTA TTGTCATTCC TACCTTGCAG ACTTAGAAAA TCCCGGGTTA 

83 551 CAAAAAACAA TTTTAGCGGC ATTTCTGAAT CCTAAGGAGT TGAGCAGTAC 

83 601 CATACTGCAA CCTATATCTC TAAATCTTAT CTTAAATAGC AAAACTTACT 
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83651 TAAGGCAGCA CTTTGGCTTT TTTGAGAGGA TGAGCAGAAG TGATCGCAAT 

83701 GTGGTTGTCG TTGTATGTGA TTCTTGGTGG GGTACCGACT GGAAGGAGGA 

83751 GCCAAGCTTC CAACACTTTA TTATGGAGCT AGAGTGTCGA GGGTATTCGC 

83801 ACTTCAATAT TTTTGCCTTT AGATCTAATA GCATGTGTGT AGAAGAACGT 

83851 AGGATCTTAA ATGAAAGTTC TCAAGAGAAA GCCTTTACCA TGATTTTCTG 

83901 TGAGGATTCA GTATCTCAAG GAGATATCCG CTGTTTGCAT TTGGCGTCTG 

83951 AAGGAATGCT TTGTGGTAAA GAGTGCTATG CTGTCGATGT CTATACGTCA 

84001 GGATGCGCGA ACTTTATGAT GGAAGAAGTC TTAACTTTGG AGCGAGAATC 

84051 TAATCTGTGG AATAGAAAGC ATGGTCTTTG GAAAAGAGAA GTTAGAAAAC 

84101 AGAAACAAGA AGCTGCTTTG GATCAAGACG AGAGCGAGAT TTACGTTTGT 

84151 AATCAGCTGA CGGCGCAACA GAACTTCGCT TGTTCTTGAG ATGCTGCAAT 

84201 CCGCCAGTCT ATATGGAGAT CCCGTATGCC AGAACTTCTC TCTATTGAGA 

84251 GACGGGCGTT AGGGGAACAA CTCTTTACTA CTGTACATCA CTACCTAACA 

843 01 ACGCAAAAAA AGATCCTCAG GGGAATCTAG AAACGCAGCA ATCCGCGCAA 

843 51 TTGTCTATAG ATTTCACAGC ATTAGATGAA GCTGTTGAAT CTCTAGGATC 
84401 GACTCTTAGC AGAGCTCCTT CAGAAATATC TCCAATTCCA GAGGAGGAAG 

844 51 CTCACTTAGG AGCCAACAAA TAGAGACAAA GAAAATTCGA CGGTTTGAGG 
84 501 ATAACGATAC GCAATGTCTA AGCTTTGAAT CAGGATACTC TGCTTTACAG 
84551 GCGTGTCTTT GTGCCTATGG TCTCTCTCTC ATAACAGAGT CTCTCAAATC 
84601 TAATTGTCAG AACCTATTTC CCCTAAGAAT CGATAACGTA TTTGTTAGGA 
84 651 AAACGTGCTC AACAAGAGCG TTTTATTTTC AGTGTACTTG ATGTCTAATA 
84701 CAAATTGTTT ATTTTGTATT TTTGGCACAT CATTTAAATT CCTTGTACTT 
84751 TTGAATCTAA ACGTAAATTT CTTATGACTC ATTGCTTACA TGGTTGGTTT 
84801 TCTGTAGTTC GTCATCACTT TGTGCAGGCG TTTAATTTCT CACGTCCTTT 
84851 ATATTCTCGA ATTACCCACT TCGCTTTAGG GGTGATTAAG GCCATCCCCA 
84901 TTGTAGGGCA TCTTGTTATG GGAGTCGATT GGTTGATCTC TCATTGCTTC 
84951 GAGAGGGGAG TCTCACACCC TGGGTTCCCT TCAGATATTG CTCCTATACT 
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85001 GAAAGTAGAA AAGATCGCGG 

85051 AGCTAAAGAG CCTTAGGAAA 

85101 CACGGGCAAT ATCAAGAGAA 

85151 TCTTAAACTC GATAAGGGAG 

85201 CTAGAGTTCG CAATCGCATC 

85251 CAGTTGGACT CTATAGCTAT 

85301 ACAAGAGAAT TTAGTACGCT 

85351 AATCAAAGAC AACTCTATAT 

85401 GGGGATATCT CCTCTGATAA 

85451 TTCTGAAGTT CAGTGTCTTT 

85501 AAGATACGAA ACACTTTGAC 

85551 TACTTAAGGG AGGGTAAAAT 

85601 TACTGTTCCC TGGGTGAATG 

85651 CTCGGCTTTC CTTACCTATA 

85701 AAAGAGATTT CTCGTACACA 

85751 TGGAGCCCAG GATTCAGGAT 

85801 CTTTATCTCA AGGGTCTCAC 

85851 GAAGAGCTGA AAATTTTGTT 

85901 AAGTAAGAAA GAGCTTCGTG 

85951 CCGTAGAGTG TGGCTGCGCT 

86001 AGAAAGACGT AGTTGTCGTT 

86051 CTGCCTCCTG AAGCAGTCTC 

86101 CTATAGCTAT CTGAATGTAG 

86151 TTCAACAACG TCTGCTATTG 

86201 GTGATCTCAG AQCTTCCTGA 

86251 GGCATCCGAA AGAATTTTTG 

86301 ATGCTTCAGG ATGTAAAGTG 
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GCCGAGATCA TATTTCTAGA ATCGAAAATC 
ACTATCGAGG TTGAAGATCT AGATAAAGTC 
TCCTTATGCA GATATGGCCT CTAGTGAGGT 
TTCATGTTAG CGAGCTTGGC AAAGCCTTTT 
ACCAGATCCT ATAGTTATGC CCCTACTCCT 
TGTTGGTATA GATCTCGTCA GTCCTGAAGA 
TGGCGAATGA GGTCATTCAA CTCTATCCCA 
CTTCTTATCG ATTTTAATAA GGAGTGGGTA 
GGAAAAACAG CTCCGTTCTC TAGGTCTACA 
CCGTCTTGGA ACCTCAGGGT GCCGAGGGCG 
CTTATGGTCG GCTGTTATGG GAAGGATTCT 
TTTACAGCAG GCCCTAGGGA CTTCGTTAGG 
TTATGCACAC ATTGCCATCT AGGTATAGAT 
AATACCGAAA AGGATAAGAC AGAGCTTTAT 
CCATCAGTTG CATACTTTGG GAATGGGACT 
TGCTCTTAGA CCGGCAACGA CTCCATGCTC 
TGCCATTCCT ATCTTGCAGA TCTCACCCAT 
ATTTTCAGCA TTTGTGGATG CTAAGAACAT 
AGGTATCTCT AAATTTTGCT AACGATACTT 
TTTTACTTTT AGTGTCCTAT GATGAGAAGG 
TGTAATCATT CTGAACCTAA TATCCTCGGC 
TCAGCTTATT GAAGAGCTTA GCGATGAAGG 
TGCGTTGTGA TCTCTCCGGG GAGACTACGG 
AATGCCGATG AAGGGAGATC TATGACGGTG 
AGGGCACCCC GATATTCGGA ATTTGCAGTT 
TTTCTCGTGA AAAAGAAGCT GCTGATGCCT 
GTCGCTTTCG ATGATGAGCA TCTCCCTTGG 
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86351 GTCTCCAGTC ATATTGCCTA CGCGGAGGAG ATCAGAGAGA AACAAGAACA 

86401 AACAATGCAA GGGTCTTTAA CTGAAGAGCA GTTAGGAGCA CTCCTCTGCA 

86451 ACACAGTCTC CACAGAGAAA AATCTAGCCT TTGCTCTAGA CGCCGTGATA 

86501 AAACAGTCTG TGTGGAGATT CCGCAATCCG GATCTTTTTG CTTATGAGAG 

86551 AGAAGCTCTA GAGGCTTCAG TAACAGATGC TTTAGTATCT TACGTTTCAA 

86601 ATTTAGACAT GATACCGTAC ACAAGTTCTC AGGGCATAGT CATAGAAGAT 

86651 AG TAG TAT CG TCCGTACCTC TCAAGAGCAT ACACTCATTG TGAACTGTGC 

86701 AGCATTCGAT AAGTTAGCGA GCCAAATAGA GTTCTTATGC CCCAGTGACG 

86751 TGTTGCCCAT TTCTGGTAAA GACCCTTTGA TTTCTGATGA TGAGGATGAG 

86801 GAACTGAATC CTAAAGTTTC ATCTGCTGCA GACTCTAAAG ATAAAACCTA 

86851 GGGAGTGAAT TCTACACGAG AATCGAGAGG AGAGCGAGTC TTTCAAGGAT 

86901 TCATAATCCT TGTTAACGTA TGCATAAACA AGTGAAGCCA TTGCAACGTG 

86951 AAGTAATCGC ATTGATGAAA GATGCTTTCC CTAGGGATGC AAAGCAGATC 

87001 GTATTCCCTT CTTTCCAAAA CAGACTATAG ATTCAAAAGA TATTCTTTCT 

87051 TTTCAAATAG ACTTGAGAGA GGGGGGGGGG TTCTCATAGA GTGAGAATCT 

87101 TGGCCTTCAT TGCTAAGTTC TTCGATGATG GATAGAGGAT TCTAAGACGA 

87151 CCGGGGCTAC AGAAAACTCT AAGCAGAGCT TAGAGTTTTA AAATGTGGAT 

87201 TTTAGTCCTG TAGACACTCG GTGGTTTGTA AATCCATTTT TCCCGTCAAA 

87251 GGTATAGTTT AGAAAGGCCT GAGTGTCCTC GGTGAGATCT ACTACATCAT 

87301 GGATTTGTAC AAACAAACCG TGGCGGCGTA GATTTGCTCC TGAGATCGAA 

87351 GTGCTCTCTT GGTTTGAGAC GACAGTCACA ATATTGTGAG GGTTGACACG 

87401 ATAGATATCA GGCTTGTAGG CAAGTTTGAT GGTCAGTGTA GAAGGAGCCT 

874 51 TCTTAAATGG 'TGTGAACCAT TGAGAAGAAC ATCCTATCGG TAGGGAAACA 

87501 TTGTACCCTT TACCTCTACT AAAGCTACGT TGCAGATCTC CAGTTTCTGT 

87551 GAACTTACTT TGCCAACCAC CTAGGAATTC TGCGGAAATA AAACCTGAAA 

87601 GATCCCAAGC TTGAGCCAGA GGTCTTGTAA GAAGACACCA GTTTAGGAAA 

87651 GGATGTTCTG CAGAAATAAG AACATAGTAA CTATTGTTAT GCCATTGCCC 



65 



WO0027994 [fiie:/ A\d( was03Jfirm^taM^\FoleyPat\Pa tentDocument s\ W _ Page 193 of 330 

WO 00/27994 PCT/US99/26923 

87701 TTGAGATTTT GGAGCTTTGT CAGGTCTGAG GTAGGTGGTA TTTAGGTGAT 
87751 TTTTGGAATA ACCATAAGCT GCTTTCCAGG AAATTAAGGC CTCGCTCTTT 
87801 TGAGTCACGA TAGGGAATTG ACCAAAGAAC GAGAGTAAAT ACATTTGTTC 
87851 TGAGCAACGT GAATCGTAGG GGTTGGCGTT AGTTTTTCCA TAAAGCTGCC 
87901 CGAAAGAAAG TCCTAACGTA GTGTGGTCCG TGTAGTTCAT AGATAGCGCA 
87951 GCTTGGTAGC CTCCATAGCG ACCTGAAAAG CCCTCATGTC CTTGTCTTGG 
88001 TGTGTGTTCG ACATAGGCTC CTAAAGCTTT CGCGGTTATG GACAACCCGG 
88051 GATGATCTAT CAAAAGAACA TCTTGGAGAA TATCAGAGAA TGCCTGATTT 
88101 CCTAAGAAGG AAATCCATAA GCTGTTGCTG ACAATTTCTC CGTAACGCTC 
88151 GGGATCTAAG ATATAGGTAG AACGCACGAG AGTGTCTGAA TTCCATACAG 
88201 CATAGAGAGT ATTTGCGCTA GGAGAGGGAC CTCCAGGAAA TCCTCCATCA 
88251 GGAGCTGGAA TTAACAGGGG ACGGGACCAT GTGTAGGACC ACTTTCCTTG 
88301 GTAGCCGTAG TGGCTTGGAG TCGCAATCTC CCCATCAGGA AATCCTGTCT 
88351 TAGTAACGGT TGCTCCTTTG AAAACAGCGA TAGGAATTGC TACTGGAGTT 
88401 TGTAATGACA CCATATCATA AAGATCTGTA ACGTCATGTT CATCAAGAAC 
88451 CAGAGCTCCT GTTAAAGTGA CGTTTTTTGT GCCTGCATTT ACTGATGCTG 
88501 AAACAAAATC TCTTTTTAGG AAGGAAAAAG GATCGAATGC TAACTTTCCA 
88551 ATCGTAAAGT CTACAGCGGC AGGTGCTCCC GTGGGTGTTG CCAGCCCTAA 
B8601 GGTTCCTCCA GAGCCCAGGG TAAGCTGACC TGAGCCCTGA GTAGCGAAGC 
88651" CAAGAACATT GACAACCGCA TTGTCAGTAA TCTTCAGTTC TCCACTAGCG 
88701 ATCTTGACTG TTCCTAGAAG TATAGTTGTC GTGTTGGCAG GCAACAGGAG 
88751 TTCTGTAGAG GAGAGTCCCT TACTTGTAAA GACTACAGAT CCTGAAGCGC 
88801 CATTAGCGTT GATTGTAATG TCTTTATTAG ACGGACTTGT GGTTGGGAGG 
88851 CTATGTGTAA TGGGATCATA AAATACAAGA CGTGAGCCTC CTTGTGCAGA 
88901 TAGAGACACA ATCTCTCCCC CTGCTTCTAC AGTGATGGCA TTGCGGATTC 
88951 CAGGTTTTGT ATTGAGCATG TTTCCTTGGA ACGCAATATC ACCTTCGGAA 
89001 GCCAAAATCG TAAGTGTCGA TGTTTGCTTC GCAGGGTCTC CAACAGAGGG 
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89051 


GCCTATGAAA 


ATAGCTCCAC 


CCTTCTCCGC 


GCGGTTTCTA 


GAGAATTCAA 


89101 


TAGGACCGCG 


TCCTTGAATA TTGAGCACTT TAGCACAGAT GGCTCCGCCA 


89151 


TTTCTCGCTG 


CAGTGTTGCT 


ATCTAGGAGC 


AAGGCACCCT 


GGTTCCCTAC 


89201 


GATGTTGCAG 


GTTTCGGCGT 


AGATCGCACC 


CGCATCATTT 


GGTGTACCAT 


89251 


TATAAGAGAA 


GGTGCACTTC 


CCCTGATTGT 


TTTTTAATTC 


GAAAGTTCCC 


89301 


GTAGGGATGC 


AGATGGCTCC 


GCCACTTCCT 


AAAGCATAGG 


TTCCTGATGA 


89351 


AGGGGTGACG 


CCTTTTAAGC 


TGTTCACACA 


GGAGTTGGCG 


GTGAAGATGA 


89401 


TGCATCCAGA 


ATTGTTTTCA 


AAAAGGAGAG 


AGCTTCCTCC 


ATAAATCACG 


89451 


CCGCCTCCTG 


ACGAAGTAAA 


GTTATGGGAG 


AAGCTCATGG 


TAGCGGTGTT 


89501 


ATTGATGAAT 


TTAACGACTG 


CCGTGGGAGA 


ACTGCTAATT 


GCAGAGCCGA 


89551 


AGTTGGCAAG 


GTTCCCAAAA 


AACTTGATCG 


ATTGGGATAT 


GTTTTCGACA 


89601 


GTGAGTGAGT 


CTGTAGGTTG 


GAGAGCAGAG 


GCTCCTGTAG 


TTACTGTAGT 


89651 


GATTGCGGGA 


GTTGCCGACG 


TAGTTACAGA 


TGCTGGAGAA 


TAGGCAACAC 


89701 


GGTTCGTAGT 


GAAGATCAAA 


TCTTTGATAG 


ATTGGAAAAC 


AATATCTTTT 


89751 


CCGTAGATGA 


GGCCGCCAAG 


TCCTGTCGAT 


TGGTTCCCTG 


TAAAGTTTAT 


89801 


AGAAGAAAAA 


TTCTTGAAGG 


TCAGAGTCTC 


TGCGGCAGAA 


AGTAGCGCAT 


89851 


AGTTACTATT 


TGTAGGAGCT 


TGACAAGAGG 


TAAACGTTAA 


GGAAAGGCCC 


89901 


TTCCCTAATA 


AGGAGAGATT 


GCTGGAACTA 


TTGATAAAGG 


AACTTGAAAA 


89951 


ACTCGCCCCC 


AAGAAGTTTG 


AACAAACAAA 


ATCTGAAGTG 


AGTAAGATCT 


90001 


CTTCACCCTG 


ATTCGTAATT 


GGAGGAACAA 


AGTTCCCTCC 


GAGTCTAGTC 


90051 


TCAGCAAACG 


CGCAACTTGC 


ACTACATAAA 


CAGGCAAGTA 


GACAAAAAGA 


90101 


TGAAGATTTG 


AAAGAAAGAG 


GCATGCCTCA 


ACCCTGTCGT 


TAAATAAGGT 


90151 


TTAAAATCGT 


AATTTGCTTC 


CTATATCTAG 


AGTATATTGA 


CGGGAGGTTC 


90201 


TGCGAATATC 


ACATCCACAG 


TGCCCGAAGA 


TCTCTAGAAC 


ATCATTTACT 


90251 


GAAGTATGGC 


TGGATGCTTG 


TACTAGCAAA 


GTACTTCTGG 


TTAGATTATT 


90301 


CCCTATAGAG 


GTCCACGTAG 


CTCCATTAAT 


AGGTAATGTC 


GTATCGCAAT 


90351 


CAGGATTGTG 


ACGATAGACA 


TCAGGAGCAT 


AGGCTACGAT 


TATAGTGTAA 
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90401 AAATCTGGTC GATTATGAGA ATTTTTACCA AAGCGGACGC CTACGGGAAC 
90451 TGCAACGTTG AGTAGATGAC CGTGTCCAAA AATCCTCCCC TCGGGGGTAT 
90501 TTTCTTGGAT GCCCCCATGA GTCGCGTAAG CAACTTCAGC TTTTACAAAG 
90551 GGAATGATCT GCTTGAGGTT TAAGATGCGA GAAGAGAGAG TGATGGGAAG 
90601 GTTCCCTTCG AGTTCTCCTA ACCAGCAATT GTTACTCCAA GAGCAGCGCC 
90651 CTTTAGGCAA TTTTGTATAT GAAGTCTTTA CTTTCTCATT GCTACGGCTA 
90701 TAGGTAACTC GAGAAGTGCC TCCTGAGAAG AATCTCGATG ATCCAAACAG 
90751 AGACTTGGTG ATGTTAGAGT ATACTGTAGC GAAATAAACG TTAGAATGAC 
90801 CGTGACCTAC GAGGTAATCC TTAGATTTTG TAAACAGCTG TCCAAAACCT 
90851 AGACTTAACG CAGCATCTGG AGTGATGCGT GTGTAGGTAT TGATGAGGTA 
90901 GCCTCCACCC ATATGGCGGT AGCTGCGTGC ATCACCGGTA TGATTCGCAT 
90951 GG AAGAAATT TGTAATTCCT GTGATGCTCA GTTGCTTCCC AGGGACATCT 
91001 TCGCCATCAG CTGCTGACGC TTGACTTACA GCTCGTAAAT CTATGACGTT 
91051 TGCCCATAGG CTATTAGGAA TGAGGGGAGC AAGACGCTCC GGATGAGGAA 
91101 GATAACCGGT TTTTTTCCAA TTTCCTGTGA CCGTATGGGT TGTCGTGTCT 
91151 ATGGTAAACT CCCAAGTTCC TTGATACCCA TAGGGAGATT GCTGATAGCC 
91201 GTTTGTGCCG AGACTGAAGT CCGTAGTGGT TACAGTATTT GAAGTCGCTT 
91251 TGAGTTCTAA AATCGGAACT TGCTGTAAAT CTTTATTAAA CATCCCGTGG 

913 01 TTGTCACAGC AATCTTGAGA GTTTTTCACA AGTCCTAAAG TTCCGGATAT 
91351 AGTGAGAGCT CCATTGGTAC TCTGCACATT AACGACAGCC GCTTTAGTGC 

914 01 CATCCAAAGA ATCCAGATTG ATTACAAGCT TGTTTAAGGT GATAGCACCG 
914 51 TCAGTATTAT TAGCTCCATT TGTAGTTGCT AATGTGGTCC CTGCATCCAT 
91501 GATGACGACG GACTTTTCAT CTTGCGTGAA GTTATGAACA TTTAAGGTAG 
91551 CACCGTTTCT TAAAGCGAGA GTACCGCCTT CAAGTTCTAG CTTTTGGTTT 
91601 AATGTAGATG TAGCATTTGC AGGGGTTGCT GCTTCGGTAG CAGTGAGGGT 
91651 TTCTCCTGAA AAGACAATAG TCCCTGAATA CGCACCATCT GCACTGGCTT 
91701 TGGGATTGAC GACCACAGTA GCGGCTGCGG ATGCAGCAGA TAAATCATCA 
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94451 TGAAGGACGA CTCGCCTTAG AACCTAAAAG GTAGGGAGTA TAAATCGCAG 
94 501 CTCCACTCCA CGAAGAGTAG TTCCCACAGA AAGTCACTTC CTCACTATTT 
94551 TCGATCATCA CGGAACCAAG ACTATAAATC GCTCCTTGTC CTTGAGGTAG 
94601 TAGAGGTGCT GAGGTGAACG CTAAGTAAGA AAAATTAGAG AGAGTGAGAG 
94651 TGGTGTCTCC AACGCGGTTC GAAATGGCAG CGCCAAAACC CTCGGTCATA 
94701 AGGTTGTGAA AAGTGAAGTT GCAACGGTTG CCCATGAAAA AAAGATTCCC 
94751 AGATCGATTT ATAAAAACCC CAGCATCTTC TTGATCATGC TTAACGTTGG 
94801 AAATCCTCAC GTCATCTAGA AAGATGTAAG AAGTTCCTTC TGGATAACAG 
94851 GTAATTTTAG GTTCTAAGCT TTTATTATTG ATAGCACCGT TATAACCATC 
94901 ACTTTCATGA AGATATACAA CTTGTGCTGC TGCAGGGAGA GCGAGGAATA 
94951 AAGCCGAGCA GGTAAGAAAA TTTCGAAGTA TGGTCATGGT TTCCTCGTTA 
95001 AATCAATAAG GTTGAAGCAA CTTTAATAAA CAAGAAAAAA AGAAGTCAAT 
95051 AAGAATAGAT TATTGTCTAT TAATTATTTA ACTGTTTTTA AAATAAAATT 
95101 ATAACTAGAA ATTATTAAAA GAAATCTTTT TTGAAGAGGG ACAAATGTTA 
95151 TTTTTTACAG TTTGCAAGGA AAGCATTCCC TATAGCAAAT ATTTCCCTAA 
95201 AAGTATGAGA AAACTCCCTA GAAGAACTAG GGAGTTTTAG CAATCTAGAA 
95251 TCGGAGTTTG GTACCAACAT CTACATTGTA GTTCCTTGAA GATCCACGGA 
95301 GTTCCATAGC GTAATGTCCG AAGAGCTCAC AATTGGAGTT GTAGACGTAG 
95351 TTGTTGCTAC CCCTCAGTAA AAATGCCTGT CTTGAAAGAT TGCCACCGCG 
95401 AATTTTCCAA GAGTCTGGGC TCATCACAAG AGTCGCTGTA GATTGGGGAT 
95451 TGTTACGATA GACATCGGAA ACAAAGAATC CTGAGAGATC ATAGGTGTAG 
95501 GAATCTCCGA TATCCCCCTG CACGAATTTC GCACCCACAG GAATCGAGAG 
95551 GTTAAGCAGC CTTpCAATAC TAAAACCACG GCCATCACTA GAGCTTTCGA 
95601 AGAAGCTATT TTGTGATACA TAAACCATTT CGACTTTCAT CTGTGGAATG 
95651 AAGGTCTTGA AAAGAGGATG TGGGTTGGAA AGAACAAAAG GAAGGTCTAG 
95701 GCCGATACCA CCAGCTATAC ACTCGTXGCT CCAAGAACCT TCGGATTCTG 
95751 GCAATGAGGT ATAGTGCGTT TCCATACGGT TGTCTGAATG GCTGAACGAA 
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95801 ACTTGGACAT CCAAGGCTAG GGGAATTTCC CTAGGGAATT TTTCTATAGC 
95851 TGATTCAGAA AACTTTGCTC TTCCTAATCT CAAATAGTTT TGGGGTTGTA 
95901 GGGTATGAGA GTGCTTGAAG AATAAAGTTC CACCGTAGGT TCTAGAGTTG 
95951 TTGTGAGCGA TAAAACAATC TTTGTCTCTA GCAAAGAGAT GGCAGAACGC 
96001 AAAGGTAAAT AGGTCGTCTT TAGGAGTGTG AGCACTTCCA CCGATGACGT 
96051 AGCCTCCAGA GGTATGACGG AAGCCTTTGC GATTTTCATC TCCAGTCTTA 
96101 TGCAGGAAGT TCGTCATGGA GGAAACCCAG AAACCTTGTT TGTGTTCCAT 
96151 ACCAGTTGCG CCGATCTCTA CAAGCTGTTG CAGAGAGCGA ATGTCAGTAA 
96201 AGACTCCCCA TAGGGTATTG CATACTAACG CAGATTTTCT TTCGGGGCTG 
96251 GGAACAAATC CTGTTTTGGT' CCAAGTTGCC GTGGCCTCTT TTGTATTTGT 
963 01 AGCTGTATCC GTAGTCCAAT TAACATTCCA TTGTCCTTGG AATCCGTATT 
96351 CTGAATTAGG ATCCTCAGCA GGAACAGGGA TAAGGCTGCT GATGTCAACG 
96401 TTAGTATCAA CATCAGCATC AACCGTGATT TTTAATAGAG AGAAGAGCTG 
96451 GTCATGGCTG AACATATGAC TTTCATAAAT GTTCCCTTCA ATATCAATCA 
96501 GGTTGAGCTT CCCAGATACG ATCACTTTAT TTGAAGCACC TTTTGCTGTT 
96551 AGGCTGACGG GCTGCTTAAG ACCTAAGGAG TCAACATTGA TTCCTAGGTT 
96601 CGTGATTGTA ATACTCCCAG CTGTAGTTGA TAATGTCGTT CCTGAATCCA 
96651 TGCCGAGGAG AGAACCGGCC TCTTGAGAGA AGCTCGTGCT CTCTAAAGTG 
96701 ACTCCCTTTT GTAGCAATAA CTTTCCTCCG GATAGGGAGA CTGGCTGCGT 
96751 GAATGAAGAT TTTAAATTGT CAGCAACTTT AAGTTCATCT GCTGTTAGGG 
96801 TTTCTCCAGA AAATAGAATC GTTCCTTGAT ATGGATTGAG AGCTCCCGCA 
96851 GAGCCGTTAT TTATCTTCAA TACGTCTGAT GAGGTTCCTT CTGAAGTGAT 
96901 GGGATCATAG AAGAAAATTG TATGATTTTT AGCAGCCCGT AATTCCGTGA 
96951 ATTTCCCGTT ACTTCCTATG TTGATCGCAT TACGTTTAGG AGTATCGGTA 
97001 CTTCCGGTTG TTGTAAGGGT ATTTCTTACA AAGGTAATGT TTCCTGTCTC 
97051 TGCAGAAAGA CTGAGCTCTC CTGAGGCATC GATGCTGATA GCACCCCCCT 
97101 TAGGAGTTGC TGATGAGACA TTATTTCGTA GAAACTCTGT AAAGCCTCCA 
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97151 GAGGAAAGGG CTAGCTTTTT AGCATGGATG GCGCCACCGC TTGTTTCTGC 

97201 TACGTTTGAA GCAAAGATCA GAGTCTTATT GTTAGAGATT ATCAGTTCAG 

97251 GAGATCCACT CGCCTTGGTG TTGCAGATCG CACCGCCAGT AGTTTTCGCT 

97301 GCATTCCCTT CAAAATATAG AAATTTGTTG TTCGATAGTA TCGACGTGCC 

97351 TTCATCATCG ATAGCGCCTC CTGACGTAGA CGCTATGTTA GATAGGAATC 

97401 TAACATAACC TGTGTTATTT GCTATGCGAG CGCCTGCTGT AGTAGCAATT 

97451 GCTCCTCCCT TTGTTGATGA AGAGTTGTTA CTAAAAAGAG CATCTCCAGA 

97501 AGTGCCAGTT AAAAGGAAAG ACGCTCCTTT GATAGCTCCA CCATCTGCAG 

97 551 TAGAAAAATT CCCAGCAACT ACAAGTTTAC GAATATTTTC TAAATTTACG 

97 601 CCTCCTGCTG AGGAAAGCGT TCCCTGACCT GTAGTAACCG TTGTGCTAGG 

97651 AGAGGAATCA AAACTCAGTA AGGAAAACCC TGAGAAGGTA AGATTCTTAT 

97701 TTGCTGTTGT AGATGCAGCA GCACCTGCAT GAGTGCCAGC ATCTATAAAG 

97751 CCAAACGTTA AGCTATGACC GTTCCCCAAG AAGGTAAGAT TGTCCGTGGT 

97801 TTGCTTAAAA CAACTGTCAG ATAAGGGAGT GCCTTTTCCA GGCTCGTAAA 

97851 AGAAGACATC TCCTGTTAGA GAATATGTTG TGGCTGAAGT TTTTGGAGTA 

97901 AACGTTCCTG AATCGATATT TCCATTAAAG CTATCATCAG GTGATAAAAG 

97951 TTCCTCGTTA GCTAGTGACT GTAGGTGACA TGAGAAAGCT AACACGGAGG 

98001 AAACTAAAAC CCAAGGAATC GAAGTCTTCA TGGTAATGCT TTTGTTTTTT 

98051 AGAGAACTAT TCGCATCAAT ATAGAAACAA AATAAGTAAA TCAAGTTAAA 

98101 GATGACAAAA CAGCTGTCAA GAATTTTTAT CTTGACTCTC TGAGTTTTCT 

98151 ATTTTATATG ACGCAAGTAA GAATTTAATA ATAAAGTGGG TTTATGAAAT 

98201 CGCAATTTTC CTGGTTAGTG CTCTCTTCGA CATTGGCATG TTTTACTAGT 

98251 TGTTCCACTG TTTTTGCTGC AACTGCTGAA AATATAGGCC CCTCTGATAG 

98301 CTTTGACGGA AGTACTAACA CAGGCACCTA TACTCCTAAA AATACGACTA 

983 51 CTGGAATAGA CTATACTCTG ACAGGAGATA TAACTCTGCA AAACCTTGGG 

98401 GATTCGGCAG CTTTAACGAA GGGTTGTTTT TCTGACACTA CGGAATCTTT 

98451 AAGCTTTGCC GGTAAGGGGT ACTCACTTTC TTTTTTAAAT ATTAAGTCTA 
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98501 GTGCTGAAGG CGCAGCACTT TCTGTTACAA CTGATAAAAA TCTUTL'UCTA 

98551 ACAGGATTTT CGAGTCTTAC TTTCTTAGCG GCCCCATCAT CGGTAATCAC 

98601 AACCCCCTCA GGAAAAGGTG CAGTTAAATG TGGAGGGGAT CTTACATTTG 

98651 ATAACAATGG AACTATTTTA TTTAAACAAG ATTACTGTGA GGAAAATGGC 

98701 GGAGCCATTT CTACCAAGAA TCTTTCTTTG AAAAACAGCA CGGGATCGAT 

98751 TTCTTTTGAA GGGAATAAAT CGAGCGCAAC AGGGAAAAAA GGTGGGGCTA 

98801 TTTGTGCTAC TGGTACTGTA GATATTACAA ATAATACGGC TCCTACCCTC 

98851 TTCTCGAACA ATATTGCTGA AGCTGCAGGT GGAGCTATAA ATAGCACAGG 

98901 AAACTGTACA ATTACAGGGA ATACGTCTCT TGTATTTTCT GAAAATAGTG 

98951 TGACAGCGAC CGCAGGAAAT GGAGGAGCTC TTTCTGGAGA TGCCGATGTT 

99001 ACCATATCTG GGAATCAGAG TGTAACTTTC TCAGGAAACC AAGCTGTAGC 

99051 TAATGGCGGA GCCATTTATG CTAAGAAGCT TACACTGGCT TCCGGGGGGG 

99101 GGGGGGTATC TCCTTTTCTA ACAATATAGT CCAAGGTACC ACTGCAGGTA 

99151 ATGGTGGAGC CATTTCTATA CTGGCAGCTG GAGAGTGTAG TCTTTCAGCA 

99201 GAAGCAGGGG ACATTACCTT CAATGGGAAT GCCATTGTTG CAACTACACC 

99251 ACAAACTACA AAAAGAAATT CTATTGACAT AGGATCTACT GCAAAGATCA 

99301 CGAATTTACG TGCAATATCT GGGCATAGCA TCTTTTTCTA CGATCCGATT 

99351 ACTGCTAATA CGGCTGCGGA TTCTACAGAT ACTTTAAATC TCAATAAGGC 

99401 TGATGCAGGT AATAGTACAG ATTATAGTGG GTCGATTGTT TTTTCTGGTG 

99451 AAAAGCTCTC TGAAGATGAA GCAAAAGTTG CAGACAACCT CACTTCTACG 

99501 CTGAAGCAGC CTGTAACTCT AACTGCAGGA AATTTAGTAC TTAAACGTGG 

99551 TGTCACTCTC GATACGAAAG GCTTTACTCA GACCGCGGGT TCCTCTGTTA 

99601 TTATGGATGC GGGCACAACG TTAAAAGCAA GTACAGAGGA GGTCACTTTA 

99*651 ACAGGTCTTT CCATTCCTGT AGACTCTTTA GGCGAGGGTA AGAAAGTTGT 

99701 AATTGCTGCT TCTGCAGCAA GTAAAAATGT AGCCCTTAGT GGTCCGATTC 

99751 TTCTTTTGGA TAACCAAGGG AATGCTTATG AAAATCACGA CTTAGGAAAA 

99801 ACTCAAGACT TTTCATTTGT GCAGCTCTCT GCTCTGGGTA CTGCAACAAC 
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99851 TACAGATGTT CCAUUiUTiv UTAU'AUTAue aacivutac^ umchwwoi 

99901 ATCAAGGTAC TTGGGGAATG ACTTGGGTTG ATGATACCGC AAGCACTCCA 

99951 AAGACTAAGA CAGCGACATT AGCTTGGACC AATACAGGCT ACCTTCCGAA 

100001 TCCTGAGCGT CAAGGACCTT TAGTTCCTAA TAGCCTTTGG GGATCTTTTT 

100051 CAGACATCCA AGCGATTCAA GGTGTCATAG AGAGAAGTGC TTTGACTCTT 

100101 TGTTCAGATC GAGGCTTCTG GGCTGCGGGA GTCGCCAATT TCTTAGATAA 

100151 AGATAAGAAA GGGGAAAAAC GCAAATACCG TCATAAATCT GGTGGATATG 

100201 CTATCGGAGG TGCAGCGCAA ACTTGTTCTG AAAACTTAAT TAGCTTTGCC 

100251 TTTTGCCAAC TCTTTGGTAG CGATAAAGAT TTCTTAGTCG CTAAAAATCA 

100301 TACTGATACC TATGCAGGAG CCTTCTATAT CCAACACATT ACAGAATGTA 

100351 GTGGGTTCAT AGGTTGTCTC TTAGATAAAC TTCCTGGCTC TTGGAGTCAT 

100401 AAACCCCTCG TTTTAGAAGG GCAGCTCGCT TATAGCCACG TCAGTAATGA 

100451 TCTGAAGACA AAGTATACTG CGTATCCTGA GGTGAAAGGT TCTTGGGGGA 

100501 ATAATGCTTT TAACATGATG TTGGGAGCTT CTTCTCATTC TTATCCTGAA 

100551 TACCTGCATT GTTTTGATAC CTATGCTCCA TACATCAAAC TGAATCTGAC 

100601 CTATATACGT CAGGACAGCT TCTCGGAGAA AGGTACAGAA GGAAGATCTT 

100651 TTGATGACAG CAACCTCTTC AATTTATCTT TGCCTATAGG GGTGAAGTTT 

100701 GAGAAGTTCT CTGATTGTAA TGACTTTTCT TATGATCTGA CTTTATCCTA 

100751 TGTTCCTGAT CTTATCCGCA ATGATCCCAA ATGCACTACA GCACTTGTAA 

100801 TCAGCGGAGC CTCTTGGGAA ACTTATGCCA ATAACTTAGC ACGACAGGCC 

100851 TTGCAAGTGC GTGCAGGCAG TCACTACGCC TTCTCTCCTA TGTTTGAAGT 

100901 GCTCGGCCAG TTTGTCTTTG AAGTTCGTGG ATCCTCACGG ATTTATAATG 

100951 TAGATCTTGG GGGTAAGTTC CAATTCTAGG AGCGTCTCTC ATGTCTCAGA 

101001 AATTCTGAGA GAGATCGCAT TTAGGATTTT CTTAAACACG ACTCACCTTG 

101051 TTTTTGAACC AGGAGAGATC GGGGATTAAA AAGGCAAGAG GGCAGAGTTC 

101101 GTGAGGTCAC GTACTCTGCC TTTCTTGTTA CAAACACGTT TTAAAATTAA 

101151 GGAAATTTTT TAATAGAAAC CCGTTCTTTA AAATACGTTT CTTTAATTCT 
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101201 TATTGAATAA GATAATTCAC TATTTTTAGA TCCTAAATTT TAAGTGGTTT 
101251 TTGTTATGCT TCTTATAGAG AATAGCTGCA AAGATTAGAG TTGCAGAGAC 
101301 GGTACGTCTC TTTCTTTTTT AAGGGAAGGG GTGTTGTTAC ACCCATCCTA 
101351 AGATTTGTGA GATTCCCCTC AGGCAGTAAC TTTTACAATC GTACTTTATG 
101401 TTTTGATCTA GCTGTTTTCT TGTCTTTAAT TTATTCAACC ATCGAGAAGA 
101451 GAGATCCATG AGTGGAAATG TATTTTATTA GGATCATCTC TAAGGATGGA 
101501 AATGATGAGC CCATTCCAAC AACCTGAGCA ATGTCATTTT GATGTTGTGG 
101551 GAAGTTTCTT ACGTCCTGAA AGTCTTACAC GAGCACGCTC TGATTTTGAA 
101601 GAAGGAAGAA TTGTCTATGA GCAGATGCGA GTTGTCGAAG ATGCTGCTAT 
101651 TCGTAATCTC ATAAAAAAGC AAACAGAAGC AGGTCTTATC TTTTTTACTG 
101701 ATGGGGAATT CCGTAGGTAT AGTTGGGATT TCGACTTTAT GTGGGGATTC 
101751 CATGGCGTGG ATCGTCGCAG GGACTCTAAT GACCCTGAAA TTGGAGTGTA 
101801 TCTTAAAGAT AAAATCTCCG TATCAAAACA TCCGTTTATA GAACATTTCG 
101851 AGTTTGTCAA AACTTTTGAG AAGGGAAATG CAAAAGCAAA ACAAACGATT 
101901 CCTTCTCCAT CACAATTTTT CCATGAGATG ATTTTTGCTC CTAATCTGAA 
101951 AAATACTCGG AAGTTTTATC CTACGAATCA AGAGCTAATT GATGATATTG 
102001 TCTTTTATTA TCGCCAAGTC ATCCAAGATC TTTATGCTGC AGGTTGTCGT 
102051 AATTTGCAGT TGGACGATTG TGCTTGGTGT CGCCTCTTGG ATATACGAGC 
102101 GCCTTCTTGG TATGGTGTTG ATTCTCATGA CAGGTTGCAG GAAATTTTAG 
102151 AACAGTTTTT ATGGATCCAT AATTTAGTGA TGAAGGATAG ACCCGAGGAT 
102201 CTTTTTGTAA GTCTGCATGT CTGTCGTGGT GATTATCAGG CCGAGTTTTT 
102251 CTCTAGACGA GCTTATGATT CTATAGAGGA GCCTTTATTT GCTAAGACCG 
102301 ATGTGGATAG TTATCACTAT TATTGGGCTC TTGATGATAA GTATTCAGGA 
102351 GGTGCTGAGC CTTTAGCTTA CGTCTCTGGA GAGAAACACG TCTGCTTGGG 
1024 01 ATTGATCTCC AQCAACCATT CTTGTATTGA AGATCGAGAT GCTGTGGTTT 
102451 CTCGTATTTA TGAAGCTGCG AGCTACATTC CCTTAGAGAG ACTTTCTTTG 
102501 AGCCCGCAAT GTGGGTTTGC TTCTTGTGAG GGAGACCATA GAATGACTGA 
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1025 51 AGAAGAACAG TGGAAGAAGA TCGCCTTTGT GAAAGAGATT GCTAAAGAGA 

102601 TCTGGGGATA AAGAATCCGG AGTTTTTATC GACTCTAAGA GTTTTCGGAT 

102651 CATAGAAAAC ATTTAAATAT TCAAGAGTCT TTGGCTATTG GATCATAGAC 

1027 01 AGTCTTAGTA TACTAAAAAG TCTTTGGATT CTAAGACGGG CAGAGTTCGT 

102751 GAGATCACGT ACTCTGCCCA TTCTTTCTTG TGATCTAGCG ACTTCTTTGA 

102801 ATCTTCGACC TCTTGTAATC TGGGATTTTT TCTAGTTCTT AGATTCCTCT 

102851 GATCTTTCGA CTTCTCCTCG TCTAAACAAG GCGCATTGTC TTTGAGAAGT 

102901 CCCTAGATAC ACTCAGGATC TCTTAGAATT TCTAAGGGAT CAGGAACGCT 

102951 TTTAGAACTG GAACTTACCT CCAAGATCTG CATTGTAGCT GCGTGAAGAT 

103001 CCACGAATTT CCATAGATAG GTTACTTGTG ACCTCAAGAT TTGGAGAGAA 

103 051 GGCATAAAAG ATCCCTGCTC TTCCGATACC AGCTTGTCTT GAGAGATTCG 

103101 TTCCTGTAGT TTTCCACGAG GTATTGTTGA TTAGGAGAGC TGTCGTGCAG 

103151 TCAGGATTCT TACGATAGAC ATCGGGAACG TAGATGACAG TAGCTTCGTA 

103201 AGACGCACGC TCGTTTCTCG AGAATCTCTC GAAGGTAATT CCAATAGGCA 

103251 CAGAGACGTT AATTAAATCA CCGCTATCGA AAGATCGTAC CAAGGTAGTA 

1033 01 TTACGTTCTT TGAAGCTATC TTGGTGTATG TACGAAGCTT CTACTTTGAT 

1033 51 GAAAGGAAAA TACGCGTGGA AGAGACCCTC ATGGCTTAAA GCAGTGTGTG 

1034 01 GTAGGGAGCT CGCAAGTTCC AGAGCGCAAC CGTCATTATA CCACGAGCTC 
1034 51 TCTCCCTTTG GTGCTTGGGT GTAATAGGTT TTCATAGTAT TTTTACTATA 
103 501 GATATAGCTG ATCTGAGCAT CAAAGAGGAC AGGCTGCTCA CTTTCAGATC 
103551 CAGGAAGGTA GCGTAACAAG CTTGGAGAAG ACAAGGTCGC TAGATGCTGG 
103 601 AGATGGAGAG AAGCTGCATA GGCAGAAGCT CTATTTTTAT TTATAAAGTG 
103651 ATCTCTATCT TTCCCGAATA ATTGGCAGAA GGCTGCAGTG ATAAGATTAT 
103701 CAGAAGCTAA TGTTGTAGTC GCTCCTACAA CATAACCTGC ACTTATGTGG 
103751 CGAAAACCTT TATTTATCTT CGTGCTATCT TTATGGAAGA AGTTCGAGAT 
103801 CCCTTCACAC CAGATGCCGC GAGTTTCTTG AGATTGGCGT ACTTTAGTGG 
103851 CTACAAGCTG TTGTATGGAG CGCACATCAA CAAAGGATCC CCATAGCGTG 
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103901 TTAGCAACTA AGGTTCCACG ACGCTCAGGA TTCGGATTGT ATCCTGTTTT 

103951 TGTCCAGGTA AGAGTCGCTG CTTTGGATTT AGTCGCAGTA TCCTCTTGCC 

104001 AAGATAATGC CCAATTCCCT TGGTATCCCC AATGGATAGG ATTTTTTTCT 

104051 AGGGGATCAG CAGCTAAGTC TGTGATGTGA ATATTCGCGG GGTCGTCAGC 

104101 AGTAAGAGTG AGACAAGAAA AGACTTGAGG GTTATTCCAA GAGACATCTT 

104151 CGTAGACATT TCCAGAAGGA TCTACAAGAG AGAGCGATCC AGATAAAGTG 

104201 ACTGTCTGAC TTGCTTGTGT TGCTTTTAGC GTAGCCTTCT TGGTCTCTTT 

104251 TAAGGAATCT ACATTGAGAA CAAGATTATT GATAGTGATC CCATCAGCGG 

104301 TTTCTAATGT GGTCCCTGCA TCCATGAGGA GGGTAGAGCC CGGAGATTGC 

1043 51 GAAAAGGACT TAGCAACTAG AGTGACTCCT GATTTAAGAG AGAGTTGCCC 

1044 01 TCCCGCAAGA GTTAGAGGTT GCTGAATTGT AGATTTGAGA TTATCAGCTT 
104451 CTGCAGCTTC TGCTTCCGAG AGCTTCTCTC CAGAAAATAC GATGGTTCCT 
104 501 TGATATGCAG GATTCCCTGC AAGGTCAGGA CCATTTAAGT TTAGAGCATC 
104551 TGAGAGAGCT GCAGTGATGC TAGTTGTTAT AGGATCATAG AAGTAGATAG 
104601 TATTGCCTTG AGAGGCTCGC AGCTGTACAA TCTTAGCATT GGTGTTTCCG 
104 651 ATGTTAATAG AATTTCTGGT AGTGGTCTGA CTCGAAGAAG CTCCTTTGAC 
104701 TACTGTGTTT CCTTCAAAAG TGATGTCTCC ACCAAGAGCC GAAAGACTCA 
1047 51 AAGATCCAGA GTCAGCAATC GCAATTGCTC CTCCTAAGGG AGCTGCAGTA 
104 801 TCTATAGCAG AGTTGTTTTT AAAAAGCGTA GGTCCTCCAG AAGAAAGAAC 
104851 TAGATTGTCA GTATAAATCG CCCCACCACT AGTAATTGCT GTATTTCCTA 
104901 TAAAGTTCAG TTCCCCGTTG TCTGATAGAG TTAAGACTGG TTTGGGGGCT 
104951 GATGTACTAC TACAGTAAAT GGCTCCCCCT GTAGCTGAGG TTGCGGTCAC 
105001 ACTATTGTTT ATAAAGCTAA TTGCTTTGTT GCTGCTAATA AAACTGCTAG 
105051 CTTCCGTGTA AATGGCTCCG CCATTGTTCG CCGCGGTATT TTCAGAAAAT 
105101 GATGCTGAGT TTAACGTATT GTTAATTGTA ATCCCTCCCG TGGAATAGAG 
105151 GGCACCCCCT TTTTGCGTTG CTTTGTTTTT GGCAAACGTT AGGTTGGGGT 
105201 TTAGCGATAG ACTGATAGAG CTGCCTTGGA GGGCGCCTCC ATTGTCATTA 
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105251 GAAAAGTTTT GGCCAAAGTA GCAACTATAG TTCGACTGAA TAGAACAAGC 
105301 TCCTGTGGAC TtGATCGCTC CTGTTCCTGT GGTAGCATTC GTGGTTTGTA 
105351 TTAGTGACAA ATAGGAGAAT CCTGAAAAGG AGAGAAGCTT ATTTGCAGCT 
105401 GTATTGGTAA AGGTACAGTT CGCTCCCGCA TCGATATTTT GTAGGAGAAA 
105451 TTGGTAGCCG TGGCCTTGGA AAGAAAGATT CCCAGTAGTT TCTTTAAAGC 
105501 AGGAAGCGGT TAGAGCTGTC GGAGATCCTG CATTGGTGAT TGAGACATCC 
105551 CCTGTTAGAT TATAGATAGT TCCATCTGCA TTTGTTGTTT GGGCTGGAGG 
105601 AGTGTAGGTT CCTGGTCCAG AGAAGCTATT GGTAGGTCCT AGATTGATTT 
105651 CAACAACAGC AGCAAACGCA GAGAAATTTA GTGACAAGGG AAGTGCTAAA 
105701 GATGACGAGA TTAAAAACCA ATGAAGAGAG GATTTCATGT AGAGGGCTAT 
105751 AGGTGGTTTA ACAAATTATT TCACCACATA CTGCAATAAA TTAAAGAAAG 
105801 CAAGAGGAAA GGAGAGACTA GTAAGTTAAG AATCTACAGG GTTTTTATAA 
105851 GAATTCCTCC CTAAAAGTTT AGGGAGGAAA GTAGGAACTA GAATGAGTAT 
105901 CTTAGCCCAC AATCTACATT GTAGATGTGT GCTGAGCCAC GAAGCTCATA 
105951 AGCAGCTTCC CCAGAGAGTT CTACATGAGG GGAGAGAGTC AGATGGCTTC 
106001 CAGCACTTGC TAAGAAGGCT TGTCGTGCGA GGTTTTTACA TAGCGAAGTC 
106051 CAAGAGGCTC CACTGACCAT TAGAGAAGTA CGCGAACGGG GATTTTTACG 
106101 ATACACATCA CCAATGTAGG CTAGAGAAAT CTCGAAATTA TTTTTTTCAT 
106151 CTTCGGAGAT TTTTTCTAAC CGAATGCCGA CAGGGATAGA GCAGTTCACT 
106201 AGGTCTCCAT CATCAAAAGC ACGGGCTTCA GCGCCACTCT CTTTAAAGTT 
106251 TTGTTGGCGG CTGTAGACTG CCTGGAACTT TAAGAAGGGG AAATATCCCT 
106301 GGAAGAACGG TGCTTCTTTA GGGAGATATA GAGCCAGAGA TCCTCCGAGC 
106351 TCTAGAGCCC CAGAGTTATT GGTCCAAGAG CCTTGAGCTT CAGGATAGGA 
106401 AGTATAGCGA GTATCCATAT CATTTTTAGT GTAGCTGTAG CTTAGCTGGG 
106451 CATTCAAAAT GAGAGGAATA TCTTTCAGCA TGTCGGTGAT ACTTCCAAAT 
106501 GAGGGCATGG GAAGTCCTCC TAGGAATGCT CGATGTTGCA GGTATAGCGA 
106551 CGCTAAATAG TTATGAGAGG TATTTTCAAC TATAAACAGG TCTTTATCTT 
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106601 TACCGAAGAG CTGGCAGAAA GCTACACTGA AGATATTTTC AGAAAAATCT 

106651 TCAGCACTTC CTCCAACAAT ATAGCCGTAG CTTTTATGTC GGAATGCTTG 

106701 GTTAGTTCCT GATTTATCCT TATGGAAGAA ATTCGCAGTT CCTGATGCCC 

106751 AGAGTCCTCG TTGCTGATAG ATACTATTCG CTTGAGATGT CATGATCTGC 

106801 TGTAGAGTGC GAATGTCAGT AAAGGATGCC CATAATGAAT CGGGAACTAC 

106851 GGAAGCTCTA CGCTCAGGAT TAGGGTTGTA GCCCGTAGTT ACCCAAGTCA 

106901 TAGTTCCTGA TTTTGCAGTT GATGTGTCTG CCCAAGTGGC TTCCCAATGT 

106951 CCCTGATACC CGTAATGAGG TTCTGGAGTT TGTACTGGAG AAGTGAGAAG 

107 001 CGCATCGATA TAAATATCGC TAGCAGCAGT AGCAGCAGTG AATACCACCA 

107051 AAGGCTGCGT GAAGGCTTGG TTTATCGTAT GGCTTTCATA AAAATTGCCG 

107101 CTACTATCTT GGAAAACAAG AGGAGAGGTT AGAGTTATAG TTTTGTTGGC 

107151 TCCTGCTGTT TCAATGGACA CACTCTTATT TCCCTCTAAG GCAGAAAGAT 

107201 CAACGACAAG TTTGGTAAGA CTGATAGCTT CAGTATCTGC TTTGAGCTTT 

107251 GTTCCTGGTT GCATGAGGAG TGTAGAGCCT TCAGTCTGTG TGAAACCATT 

107301 GACATCTAAC TCGACATTTC CTTTGAGTGC TAAGGTTCCA GAGGCTAGAG 

107351 CCAATGGTTG CTTTAATATA GATGTGAAGT TATCAGCAGC TTTCGCTTCA 

1074 01 TCTGCAGAGA GCTTTTCCCC AGAAAATACA ATCGTTCCTG AATAATCTAA 

107451 AGGCGAGTTG CTATCCGGTT GGTTGATGGT CAGAACGTCT GAAGCTCCTG 

107501 TGGTGTTAGA TGCAATCGGA TCATAGAAAT AGATAGATTG GCCTTGGGCT 

107551 GCCCTTAAGT TCGTAATTTT TGCTGACGAT CCCAGGTAGA TAGCATTCCG 

107601 TGTCGATGTT GGCGCGGAGG TTGAGGTTAG AGTGTTGCCA AGGAACGTGA 

107 651 TGTCTCCTTG ATTTGCAGAG AGACTTAAAG ATCCAGAGTC GGCAATTGCA 

107701 ATAGCGCCGC CCTTGCCTGC AGCTGTGTTC CCGCATCTAT TATTTGAAAA 

107751 TAGGGTAGGG CCAGCAGCGG AAAGATCTAG ACCATGGGCA CAGATTGCTC 

107801 CGCCTTGAGT TACTGAAGAG TTCTCGGCGA AGGTCAGACT TTTATTTCCA 
107 851 GAGATAGTAA GAGTAGGAGT CTCTCCTGTT TTTTCACAAT AAATGGCCCC 
107901 GCCCTTGCCT GCAGCATCTG TTGCAGTGTT TCCAGAGAAG AAAAGGGAGC 
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107951 


TATTTTGAGT 


AATCGAGGAG CTGGCTTCAA 


AGCCCAGAGC 


CCCACCCCCA 


1 08001 

X V v \J w J- 


GTT TCTC CTT 


TATTATTCAT AAAGACTAAC 


TGGCCGGTGT 


TTCCTGAAAT 


108051 


ACTTGCAGCC 


GCAGAGCTAT AGATCGCTCC 


ACCTAATTTT 


TTTGCGCTAT 




TAfTAGTGAA 


GGTTATAGAA GAGGTATTCC 


CAGAAATAGA 


AAGAGTTTTT 


1 Oft 1 SI 


fiTGGTGATCG 


CTCCGCCATT GTTATTAGCT 


TCATTGGAGA 


CGTTTTGGCT 


1 0ft?01 


AAAGAGAATC 


GTTCCATTAT CGGTAAGATT 


TAAGGCTCCT 


GCAGAACTTA 


108251 


AAGTACTTTT 


TCCTGAAGCA ACTGTAGTTC 


CAGGAGCTGC 


AATGAAGGAA 


108301 


AGGTTAGAAA 


ATCCTGTGAA TGTTAGGGCT TTATCAGCAG 


TTGTGCTTGC 


lUOJ Jl 


CGCAGCTCCT 


GCATTCGAAC CCGCATCTAC 


CGTGTTGAAT 


GAAAATGAGT 




ATCCCTTTCC 


AGTAAATGTC AGATCACCCG 


TAGTTTCTGT 


AAAGCAGCAG 


1 0ft4 SI 


CCTGTTAATG 


CTGTGCCTTT CCCAGCATCG 


TTTATATAGA 


CATTTCCTGA 


1 08 SOI 


TAAGACATAG TTCGTTCCAT TGGCATCTGC 


TGTAGATTTT 


GGAGTAAATG 


1085S1 


TAGAGCCGCC 


CGCTCCATCA AAGCTATCTG 


TAGGGGATAA 


AGAAGCATCT 


1 Oft 601 

X v O D \J ± 


GCTCCGTAAG 


TTGCAATGCT CAATAGAATG 


GGAGTGACAA 


GAGTCGAAGA 


1 OfifiSl 
± v O v J X 


GATCAGGAGT 


TTGTGCAAGG GTATTTTCAT 


AGAAAGATGC 


TTGGGTTCAA 


1 08701 


TTAATTAACA 


CGTTTTCGAT AATCTAGAAA 


CAAAACTTAG 


AGCCTAGGTT 


i 0R7 SI 


TGTATTATAA 


TTTCGTGAAG AACTTCGTAC 


TTCAAAAGCG 


AATTGACCGA 


1 Oft 801 


AGATTTCCAT 


GTGGGGGTTC ACTTGGAAAT 


GGTTCGCAGC 


ACGAACAGAA 


108 851 


AAACCTTGTC 


GTGCGAGGTT GGTACCATAG 


GCCATCCAGT 


TAGCATCGCT 


1089 01 


AGCTATTAGG 


GAAGTTTGAC ATTTAGGATT 


GCGTCGGTAA 


GCATCGAGTA 


108951 


TATACATAAG 


AGTAAGATCG TAAGTTCCCT 


TTTCTGATTT 


TGAGTCTCTT 


1 09001 


TCGAAGGTGA 


CGCCTATAGG AATCTCTACG 


TTGATAAGCT 


CGCTTTTATT 


109051 


GAAAGCGCGT 


CCTTCAGCAT GACGCTCGTA 


GAAGTCTTGC 




109101 


AGATATACTG 


TACTTTGACA AAAGGTTCGA 


CTTCTTTCAG 


AAGATACGGA 


109151 


ACGGAAATAA 


CAAAAGGCAG GCTAGCTCCA 


AGATCTGCAC 


AGAAGGCATC 


109201 


GTTTCTCCAA 


GAACCCTTGA TGATAGAGTT 


ATCGGTATAA 


TATGTCTTCA 


109251 


TGTGGTTGTC 


TGTATGGAGA TAACTGAATT 


1 TAGCATCGAA 


. CGATAAAGGA 
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109301 ATGATCTGGG AGATCTCAGA GAGCACCCAG GGAGCTCGGG TTGCTTTTCC 

109351 CCAGAGGAAA TTGGCGATGT CGAAGAGCCC TTCTGTATGG TGGAAATACA 

109401 AAGAGGCACC GTAAGTATCT CCGTGGTTCT TACCTGTAAT ATGATTGCGA 

109451 TCTCTAGCAA AGAGCTGGCA GAAGGCAAAA GTAAGCTGAT CCTCGGCAGG 

109501 AGTTGTTGCT GTGATCCCTA GTGCATAACC CCCGCTGATA TGGCGGAAAC 

109551 CATGGCGGGT GGGC ATAGAA TCTCTATAGA AGAAATTCGC AATTCCTGAA 

109601 AGCCATAGCT CACGCTCAAA AGGCTCCCCA CTGGACTTGG TTTCTATAAG 

109651 CTGATTGATC GAGCGTATAT CTATAAAGTT TCCCCATAAG CTATTTAGAG 

1097 01 GGAGATTACT TTTTCTCTCA GGACTAGGAA TGTATCCTGT ACGGGTCCAG 

109751 TTGATGCTTC CTATTTTTGA GGATGTTGCA TTTGCCCAAG ACAACTGCCA 

109801 GTTTCCTTGA TACCCGTAGT GGGTTTCAGG TTCTTGAAGA GTCAGGGTAG 

109851 AAAGAGCTCC CAGAGTAATC GTTCCGTTGG CTCCTGCGGT GGTAAGTTCA 

109901 AGAAGAGGAT AGGTACTAGC ACTTTTTAAG TTATGATTCT CATAGAATGA 

109951 CCCTTCCGTG TCAATAAGCG CAATCGTTCC CGATAGGCTG ATATTTTTAT 

110001 CTGCAGCTTC TGTTTTTAAA GCTGCCTTGT TGGTTCCATC TAAAGAGGAG 

110051 AGATTTACTG CTAAGCCATT AAGCGAAAGA TTTGCCTCTT TAGCACTAAG 

110101 TGTAGTCCCC CCATCCATTA AGATGCGGGA TCCTGGACTT TGAGTCAGAT 

110151 CCTTGAAAGT TACGGTGACT CCATCACGAA GTACAAGATC TCCCCGCGCT 

110201 AATACTGCAG GTTGTCGGAT AGTAGAGGTG ACGTTTGCAG CGATTGCTTT 

110251 TTCTGTAGGG GAAAGCTTTT CTCCAGAAAA GACAATCGCA CCCCCATACT 

1103 01 CGATCTCACT GTTCGCATCT GCTAAGTTTA AGTTCAATGT GTCGGTAGAA 

110351 GCTGCGGTTC CTGGATTTGT GATGGGATCA TAGAAATAGA TAGATTGCCC 

110401 CGTAGCAGCT CGTATCGATG TGACTTTAGC GGTATCAATG ATATTTATTG 

110451 CGTTTCTTGT ACTTGTGCTT CCGTTGGTGA CTTGGTTGTT ATTGAAGGTA 

110501 ATATCTCCAG AAGTAGCAGA GAGAGCGAGT TCCCCAGCAG ATGCTATATT 

110551 GATCGCTCCT CCTCCTCCCT GACCGGCGCT ACTTCCTGAG ATATTACTTT 

110601 GAAATAGAGT AGGACCTCCA GCGGAAATAC TGACCTTGAG TCCAGAGATG 
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110651 GCTCCGCCAT ATGTCAATGC TGTATTATTT GTGAAAGAGA GGTTTTTGTT 

110701 CCCAGTAAGA GTCACTGTTT TATCTGTCGT AGTGCAACAA ATAGCCCCGC 

1107 51 CCTGAGCTTG AGCGGCTTCC CAAGCACTAT TGCCGTC AAA GATCACTTGA 

110801 AAGTTATCTG TAATCGAACA GTTGTCAGTG CTGTACAGAG CACCGCCAGA 

110851 TCCTTTCGCT AGGTTTTGAG AGAAGGAAAC TATCCCAGGG CTGTTCTCGA 

110901 TAGTTATAGT TCCTGTAGCG TAAACTACAC CGCCTTGCTT CCCTGTGAAG 

110951 GCTTGGTTTC TCGAAAAGCT CGCAAACTGA GATGTCCCTG ATAATAAGAA 

111001 GTTTTTCGTA TTGATAACAC CGCCGTTATC TGACGAGAAG TTCTGAGTAA 

111051 ATATAATTTG GGAATTGCCA GTTAGAGATA GATTCCCCAC AGATTTTAAA 

111101 GCACATTGTC CAGTAGGAGA GAGAAGAAGA GAGGGACAAG AGATAATAGA 

111151 GAGTCTAGAA AAATCATTAA AGAGAAGATT CTTATCTGCT GCTGAGGTAC 

111201 TGGCTACAGT TCCAGCGCTA GAGCCCGCAT TGATAAATGC AAACTTCAGT 

111251 GCATGTTGAT TTCCTTGGAA AGTAAGATCG CCGCCCGCTT CTAGGAAGCA 

111301 TCCTGAGGCT AAGGGAATTC CTAAAGCCCC TGCATTTTGA AAGGATACGT 

1113 51 CGGAAAGTAA GGAATAGGTA GTTCCTGCAG CAGCGTCCGT AGTGGAAAAG 
111401 ACCGTGAAGG TAGTTCCGTT AGATCCATCA TAGCTATTAT TGCTGCTATC 

1114 51 TAAGGTCACC TCTGCCGCGA CTATAGAGAG CGATGAAAAG AGCGGGATTG 
111501 AAGAAAAGAA CAACCAAGAG ACAGAGGACT TCATTTGTAA GCACTTTTTT 
111551 GAAACAAGGA AATTAAATTA GCAAATACTG TAAAGAAAAA AAGAAATCAA 
111601 GGGAAACGCA AGGAATTGAT TGATGCGGAG AATCAGAACC CCAAGGATGG 
111651 CGGATCTTTT ACTTCTCTTC ATACGGATCC TAAGAATCTC TTTGATGAAG 
111701 AGGGGATGCC CTCCCCCTCT GATACCCTAC AGTGCGATCT CAATAACGTA 
1117 51 TTCATCTTTA -TAAAAAGTAT GTTTTTCTAA GATTCTCGGA GAATCTTAGA 
111801 AAGAATAACG AGTTCCACAG TTTGCATTAT AGCTTCTTGA GGAGCTGCGC 
111851 AGTTCACAAC TTCCAGAAGC GAAGCAGTCA AGACCATGAA GTAACTTCAG 
111901 ATGTCCAGAA GCCTCAGCAA AGAAAGCTTG TCGTGATAAG TTTGTAGCAA 
111951 ACGTAGACCA CGAGGTGCCA TTTGTTAAGG AGGTCAGGCA GTGAGGGTGA 
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112001 TCCCGGTAAG CATCTACAGC GTAACCTAAA GTAAGAAGCA AAGCACTGGG 

112051 GGGCTTTGCT GATTCGTGTT TGAAGGTGAG TCCCATAGGG ATAGACACGT 

112101 TGACCAGATG GCTAGCGTCA AAGATACGTG GATCAGCAGC AACCTCTTGG 

112151 AATCCTTTTT GATTTACACT CACAACTTGG AGTTTCACAT AGGGAGAGTA 

112201 GCTGGTAAGG TATCTGTAGT TTAGATCTAC AGGAAGAGAA CCACCGACTT 

112251 CAACAGCGAA GCTATGGCTG TCCCAGTCTG ATTTCCCTTG TGTGTTGTTC 

1123 01 GCAAGCTTTG TCGTCATATT ATGGTGGTTT CTTCCATAGG AAACTTGACC 
112351 ATGGAGAACA AGGGGAGTTT CTCCTGGGAG CTCTGGAAGG ACCTTAGAGA 

1124 01 GGACGTGGCG ACGTAATGAG CTATGCAGGG GAATGACATA AGAGCTCTGA 
1124 51 GCACAGAGAG ATCCTGCATA GACTTGAGAT TTAATATCCG AGACTACGTA 

112 501 ATCCTTAGAT TTGCCAAAGA GTTGGCTGAA TGCAACAGCA AAGGTATATT 
112551 CTTGAGGGGT GGTCATGCTG CCACCAACAA TATAACCTCT GGAAATCAAA 
112601 CGGAATCCTG CATTTTCCTT TTGCTTGTCT TGATGGAAGG CGTTGCCAAT 
112651 ACCTCCAATC CAAATCCCTG GATGTGAGGG AGCGTCCGAC ATCGCAGTGG 
112701 CGATCTCCTG CTGTATAGAA TGGATGTTTA CATAAGCATT CCAAAGGCTA 
112751 TTAGGAACTA AAGTCGCACG AAGCTCTGGT TTAGGAGTGT ATCCTAACGC 
112801 TTGCCATTCC GCGACCAAAG TCACCTTCCC TCCAGCTCCT ACTTTAGGAA 
112851 CCAGAGTCCA ACTCCCTTGA TACCCATAAT CCGGAGCAGC CATGCTAGAA 
112901 GGAATCGGAT TGAAGTCGTC TAAATTTACA GTTCCTGAAG TAGAAGAAAG 
112951 ATCTAAGAAA GGAAGATTTA AGTTTGCTTT CAACCCAGGA TTGTCATAGA 

113 001 AACTTCCTTC ATTGTTATGG AATTTCAGAT CCCCTGAGAT TTTTAATCCC 
113051 CCACTTGTGC TGTTTACGGC ' AATCGTTATC ATACGCTTGC CATCTAAAGC 
113101 ATCCAGATTT ACAGAGAGAT TCTTTAGATC GATGCTGCCA TCTGTATTGT 
113151 TAGTTGTCGT GGTCTCTAAG GTCGTTCCTG CATCCATGAA TACTGTAGAA 
113201 TCAGGCTGCT GTGTGAAGGA ATATACTTGT AGGGTGGCTC CTTCTTTTAA 
113251 AACGACATTT CCTCCTGCTA AGTTGATCTT CTGGTTCAGT ATGGTGGTAG 
113301 TATTTGCAGG AATCGAGGCA TCTTGACTGG GGAGTTTTCC AGAAGAAAAT 
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113351 ACTATAGTTC CCGTGTTTGG GTTTGCAGGT GCTACAGGGA CTACAGGCAC 
113401 TGAAGCTATA GGACCATTTT TTGGTTGGGG AGGAGGAACA ATAGCTTTGA 
113451 CAACAGGATT GATGACTAAC TCCTCTATTG TTCCTCCAGA TGCAGGAGCT 
113501 TCCATCGTAA TAGGATCATA AAAATAAATC GTATGACCAG GAGCTGCTGC 
113551 AAGCTTAGTG ATCTTAGCCC CTGCACCTAA ATGGATCGAG TTGGGAGTTG 
113 601 AAGTTCCCTC AGTCGCTCGG TTCCCTGAGA AAGTAATATC CCCATCAATA 
113651 GCCTCTAAGG AAAGTTCTCC GCTATCGGCT ATATAAATGG CGCCTCCCTT 
113701 GCCTCCAGAA TTATTGGTAA AGGAGACAGG ACCGTTAGCT GTAATCGAAA 
113751 GGTTTTTCGA ATAAATCGCT CCTCCCGAAG TTTCAGCAGT ATTGCCATCA 
113801 AAGTTTATGG ATTCACTGCC TGAGATTACA CACTTAGGAG CATAAATACC 
113 851 ACCACCACTT CTTTTTGCCG TATTGTTAAT GAAACTTAAA CTCTCATTTT 
113901 CAGTAAGAGT TAAGCTTTTT GTAGCTATGT CAGACTCTGA GATATTACAG 
113951 AGGATCGCTC CACCACAACC TTCTTGATCT GTAGTTGTTG TTGCTGTTGC 
114001 TGTTGCTGAA TTTCCAGAAA ATACAAGAGC CTTATTTTTG GTAAAGGAAG 
114051 TATTTCCTTT AGTATGTAGA GCCCCTGCTG TCTTTGCTGT ATTTGTGCTG 
114101 AAGGTCACGG TTCCCGTACT TCCCGTAAGA GTAAAATCTT CGGTTTCTGT 
114151 ATAGATCGCT CCTCCTGCAG TTTCGGCAGT ATTGCCATCA AAGGTAAGAG 
114201 TCGTGTTTCC ATGCAGAGCA CACTTGGTCG CATAGATCGC ACCGCCACTT 
114251 ACTGTTGCAG TATTACCAGA GAGACTCACG TTTTCGTTAT CTTCAATCCA 

1143 01 GAGTCCTTTT TTAGTACTTA CAGATGCTGA CTCAAGAAAC GATAGGATTG 
114351 CCCCACCGCA ACCCTCTTGA TTTGCTGAAG AATTACTCGG GCCCGTAGCT 

1144 01 TTGTTCCCTG AAAAGAGCAG GTTGGTATTA CCAGACAGAG AGTTGTTGCC 
1144 51 TTTAGAATAT AAGGCGCCGC CTGTCTTTGC TGTATTTGTG CTGAAGGTCA 
114501 CGGTTCCTGT ACTTCCTGTA AGAGTAAAAT CTTCAGTTTC TGTATAGATC 
114551 GCCCCTCCTG AAGTTCCAGC AGTATTGCCG TCAAAGGTCA GGGAGCCGTT 
114601 TCCAGTTAGA GTACATTTGG TAGCATAGAT CGCACCACCA CTTACTGTTG 
114651 CAGCATTACT AGTGAGGCTG ACTTCTTGGT TGTTTGCAAT CGATAGTCCT 
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116051 TACTCCCATA GCATGTGGCT TTAGGAGCAT GGTGCACCAA TAGAGAATAC 

116101 AGTTAAATAA ATCAAGTAAA TGCTCTGGAG AAGACTCTCA GTTATAGAAG 

116151 TTTCAATCTT GGGAGAGAAG CATTTAAGGT ATTTTTCTAT ATTTAAGAGT 

116201 CCCTTAAAAC ATAAGGGAAA TGCTTAAGGG TAGGGGAGAA GGTGTACAAG 

116251 CGGTTTTGCT TTTAGACCTT CTTGAATTTT AGAAGGAGAG AGTAAGGAAG 

116301 ATGGTTATCT AAACCACGGA TCCTTATTTG TTCTTTCTGT GTTTCCGTTT 

116351 TTCTCTTTCG TCATAACCGT CAGAGAATGG ATTGGAGGGG CTGAGGTTAG 

116401 CCGTGCTTTT TCCGTCTGGA GGCGGAGTTT TAAGAGATTG ATTCGATTTC 

1164 51 CCGCTTTTCT TCTTAGATTG CTGTTTCTGT TCTTCATCTT GATTTTGCTG 

116501 TTGCTGCTGT TTATCTTGGC GATCACGAGG GGAACGGCGG GAAGATGAAG 

116551 AAACTCGAGG AATTGAGGGT TCTTTTCCTC CCTTAGGGTA GACCGGCTCA 

116601 GGGTGTAGAA CCGTCCCCGT GGAGAAATTA GGAGAGCGGG AACTTGCAGG 

116651 CATTATGGGT GTAAACGCAC TGCTCGCTCC ACTACCGAAT GAACTGCTTC 

116701 TTAAATCTTT GAAGTGATAA GGCTGAAAAT CGTCCTTAAA TGGAGGACTT 

116751 ATAGATGCGG GTCGTGTAGA AGCCGCATCT GAGGGTTTTG TTTTAAAACG 

116801 TGAGAGGCTG CCGAAGGAGC GTCTATGCTC AGGATTTCGT GACGAATAGA 

116851 AGAAAGATCC TTGAGCCTCC CGACCGATCC TACGATGACG GGCATCTTGG 

1169 01 CGTTGTGAAG CTTCGTGTTC CTCCATGCGG GCAGATGCAG AAAGATCTTT 

116951 TTTCTTTTCC ACTGCGATTT TTTTTGCATC GTCGGCGGAT GGAGTCACTA 

117001 GAATAGGCTC CGCGGTCTCT TGGGTTTTCT TTTCCATCCT CAAGAGGTGT 

117 051 TCCCTACGGA AATAATCTAA GGTGAGCTTA TTCAAGGACA TGAAGTATAG 

117101 CCCCACCGTA ATCAATCCCA TGACAAACAT AGGGTTGGCA AAGACTAACA 

117151 TCGTCCCACT AGAGGCAACG AAAGCCCCTG CAATAAGACC CGCAGCAATC 

117201 GCTAATACAA TGATAGGAAC GACGATAGCA GTGATTGCCT CACCGATTTT 

117251 CTTGGCCTTC GGACTGTCCA GAATATCAGA AATAAGTAGA GTAACTCCCA 

117301 AAGCTCCCAG GGCAAGTGCC GGAGCGAGAG CATAAAGCAT AAGGCTGTTT 

117351 CCTGAAGCAG TCAGGAGAAT CGAAAGGATC GCAATTGCCG CAAGGGCAAT 
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117401 AATGCCTGTG TCATAGACAT AGCGCATTTT AGGATATTTA TCAGGAATAG 

1174 51 TAATAAATCT TTTGAATAAC CCTGTGGATG ACGTTTTTAA ACGTTTTACC 

117501 ACGCTTGGCT GCCCTGATGG GGATGCTGCT GCAGGAACTT GAGGTTGACC 

117551 TAAAGGGTTT ATAGGGGGTT GGCTCATACT ATCAACTTAC TGTAATTATC 

117 601 ATTAGGCCCA TGAATTTTCA TTCATAGGAT ATATTTCATA CTATTATAAG 

117651 ATTTAATAGG ATTTAGTTAG TTCTCTTTTC TTCTGAGTCT TAACTTTTTT 

117701 ATTAAATAAA GTTTATTTGT TAAAATCTTA ACAGATTTTT AACTAAAACT 

117751 TTAAGTTATT TTTATTTGGA ACTTTTAGTC GAAATAAGAC TCGCTTATGA 

117801 GAGGGACATA CTCATCAGCA AATGGAGGGG GCGTGTGAGG TCGTGAGGGT 

117851 GGAGGCTCAT ACGGGGGGAA AAGATTTTTA TGCACTTGGA AGAGGGTAAC 

117901 GCTAGTAGTT ACAGACATGA GAAGCCCCAA GGAGATAAAG ACAACTGGAG 

117951 GGGGGATAAA GACCAGACTA AAGACCAGAC CGATAATCAC AGCAATCGTA 

118001 AGAATGTGGA GGAGCCAAGC CATAGCATAC GAAGCAAGAT GTTGGCAGCT 

. 118051 TTTTGTTTTT ATAGCGTTAA AGAGAGCCCG TAGGGGTAGG GTCAAGGCAG 

118101 CATATGTAGA AGCGCAAGGG ATCAGGATAA CCTTAAGAGC TCCTAAGACC 

118151 GAGGATACCA GTACTTCTAT GGTAAGAGCA GTTCTGGGAT AGCTCTTCGC 

118201 ACAGTTTTTT AATTTTCGAG CTAGTATTCG TTCCGGAAAA ATGCCATTCA 

118251 GGTATAGCTG AGAGCCTTGT TTGCAGATAT TTTTGAATCC CATATCCGTT 

118301 TGAAAGAGAA TATTTTATGA AAAATTATGT AAAAATTCTA AGAGGATAGT 

1183 51 GGTTTTTAGA CAATCGAAAT TCCTGAAAAG GCAGGAAAAT GAGAGACACA 

118401 AGTAGACAAA ATCTCCTGAA GTTTTTGTAT GGGCCTGTAA AAAAATCTTT 

118451 CTGGAAACTG GAAATTAGAA GTTCATTACA GCGGAACCAC CGAAAAATGC 

118501 GGAGGTTTTA AAAGTATGGG ACGTTTTATT ATTGTTGTGA TCATCAGCTA 

118551 TCGAGATATC ATTCCCAATA GACCAACCGT AAAATCCCTT AATAAAACTT 

118601 CCCGGCCAAG GGGTGAGCTT CACGTTAGCT TCQATTTCAC GTCCTTGATA 

118651 TTCAAAGATG CCGCGAGAAG AAATTAGGTG ATTTTTGTGA AGTTTTTTTC 

118701 GGAATCTTGT AAGGCGGTAA GCAAGCCCTA AGTTACAGAC AGATGTCGAG 
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118751 CGGTAATCAA 

118801 GTCGGTAGCC 

118851 GGAGGCCTGT 

118901 GCCTGGTATT 

118951 AATATTTTTA 

119001 TTTTTAAGGA 

119051 AAAGAAGTAT 

119101 CGAAATAGGA 

119151 TAGAAAACAG 

119201 GTTCCATCGT 

119251 TTTAGCATCG 

119301 GAGGAGGATT 

119351 ATAAAAATAG 

119401 GAGCAGTTTG 

1194 51 AATGGAATAT 

119501 GGAGGGACTT 

119551 GAATTTTTCT 

119601 GCTTAATACT 

119651 TTCTGTGCTA 

119701 GATCGATTCC 

119751 GATTTTGTGT 

119801 ATCAACCCGT 

119851 AGATATCTCA 

119901 TTGCTAGAAG 

119951 ATGCCACAGA 

120001 CGTTCCCCCT 

120051 TTGTGGAAAG 



TAGAAAAATT CACAGGATAG 
TTGTAACTAA CACCTACTAA 
TTCATTAATG ACGCCAAAAA 
TTCCAGAAAG AACTCCTTGA 
GGATCCACAA GCCCAGAAAG 
GAGTGTATAA GCTCCTAAAG 
CTTGGAAAGT CGCCCACCCA 
AGTGAGCTTT TCCATTGAAT 
TAGCCCAGAA TGCTCTGTAA 
ATTGACGATA GCCTATAGTT 
TTCACCTTTA GGTATTGTAC 
TTTACATGCT TCTTCATCAA 
GAGTCGAGAG TACGTGTCCC 
AACATGTTCT GTAAGATTCT 
ACGTAATTTT TAAATTACTA 
TCAAGAAGAT TTCCCTAGAT 
AATCAGCAAT CGAACAAAAT 
ACGCATAAAT GATTTCTGAG 
ACTTGTTTTG GCACTCAGTA 
TTTTTCGCTT CAATTTCATT 
TGTCAGCTCT GGAAGATGCA 
ATCTCTGAAC AAATCCTGAT 
TCAATCAGTT TTGTTCCTGT 
AAGAGCAAGA ATCAGTGTGA 
TGATCCAGTT TCCAGTATAT 
AAGGTTAACA GTAGTGTTAA 
TTTGGAGCGA AAGGTCTCCC 



PCT/US99/26923 

ATGCAATTGA GAGTTAGTTG 
AGGCCAAGCC TTCTCTTGAT 
TAGCAGAAAG CTTCTCAGTG 
TAGAGTCCAT AACCCATCTC 
AATGATAGAC CACTGCCAAT 
AGAGGAGAAC ATAGTTATAA 
AGTCCATTAG GATCTGTCTC 
ATCCGCACCT ATATAGCCAG 
TCGGAAGTGT GCAGAGAAAC 
TGATGAGGCA GCTTTTTAAA 
CTGAGCAGAG AAAGGACGTG 
TTCCACAAGC ATCTTGAACA 
GCAAATGCAG CGATGTGGAA 
CCAACGTTAC TAGAGATTGA 
TCTAATAATT TTCCTACTCA 
TTAGGGGGAT GAAAGGACTA 
TACAGGGTTC TTGGAGAGGT 
ATGATTTCCG TTTCGGCTGT 
ATTTTAAGCT CAAGATTTTG 
TAAAATTCTT GTTTTTTCTT 
CGCTTTTAAT TGTAGAGAAC 
CCTATAGAGG AAATGTCTTG 
CTGCTTATTT TTAAGAGGGA 
GTACGATAAT TCCCAAGCCG 
CCTGCGTAAC CTGCTGAGAT 
GGCAAAACAG AGCGTATGCT 
AAGGAGCCGC TGGAATCGGA 
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120101 TGAGGCTGCG TAATATAAGC ATTAGGAAGA TTGAGAACTT CTGTAGCATG 

120151 AGAGAGAGGG CTGCTCTCGG GGACTGGTGA TGGGGCTACG GAAGTTGCCA 

120201 TGTTGTTTCC TCGAATCGTT GCTAACTAGT TTTGATTTGT CTTTTCATTC 

120251 TTGTGAGAAA GCTCAAAACG TTTTTGATAA AGAGTTGATT GCAGTTTGAA 

1203 01 CTCTTTTTGT GAGAGGTCGC AGAGTTCTTC GTATAACTTT AGCGTATCTT 

1203 51 GAGTGGTTTT TCTCTGTGCA TTGGTTACAA GCTGTAGAGA GGATTCAGGT 

1204 01 CGAATCTTCT CCCTGATAAA TTGTACTGTC GTATGGAGCT GCTGAGGGAG 
120451 CTGCCGGATG AATTTAATAA ATCCTACCAA GGCTTGTAGG CAGAGAAGAG 
120501 TCAAAATAGT AAGAACAATG CCAACGGCAA TCAACAGAAT GCTTTGGCTA 
120551 TAGCAACCCA AACAGATAAT AGCGATTCCA GCAAGAGCTA CAATCACTAA 
120601 GATCGTAATC GCAGCAATAT GCATAGGAAT GGAGTGGGTG AGAAAAACAT 
120651 TGCCCTTCTC CCCCAAAGCT ACGATCTCCT TATTCGTAAT GAATAAATCA 
120701 GCAGACTCTT CCGGAAGGGA TGAGGGAAAT ACCCCGTTTA AAGTACTAGA 
120751 CACAAAGAGA ACTCTATTAT TTGAGGAAAT AATTTAAGAA AAATGGTATT 
120801 TTTAGTCAAT TAGTAAGCGA GTCATGCCTC TTAGTTATTC AAATTTTTAA 
120851 AACCTTACCC TTCCTATGAG GAGACAAGTA AGAGAAATTA TGCAACAAAC 
120901 TGTAATTGTA GCAATGTCAG GAGGCGTGGA TTCTTCTGTC GTTGCCTATT 
120951 TATTCAAAAA ATTTACCAAT TATAAGGTTA TTGGCATCTT CATGAAGAAT 
121001 TGGGAAGAGG ATCGCGACGG CGGTCTCAGC TCGACTACTA AAGATTATGA 
121051 TGATGTCGAG AGGGTCTGTC TTCAGCTCGA TATACAGTAT TACACCGTAT 
121101 CTTTTGCTAA AGAATATAGA GAAAGAGTGT TCGCTCGTTT CCTCAAGGAA 
121151 TACTCTTTAG GCTACACTCC TAACCCCGAC ATTCTTTGTA ACCGAGAAAT 
121201 CAAATTTGAC CTTCTACAAA AGAAAGTCCA GGAACTTGGC GGAGATTACC 
121251 TCGCTACAGG GCACTACTGC CGATTAAATA CCGAGCTCCA AGAAACCCAA 
121301 CTCCTTAGAG GTTGCGATCC TCAAAAAGAT CAGAGCTATT TTTTATCAGG 
1213 51 AACTCCTAAA AGTGCTCTTC ACAATGTGCT CTTTCCTCTT GGGGAAATGA 
121401 ATAAGACTGA AGTTCGTGCG ATTGCAGCTC AAGCAGCTCT TCCCACAGCA 
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121451 GAAAAAAAAG ATAGTACAGG CATTTGCTTT ATAGGGAAGC GCCCTTTTAA 

121501 AGAGTTCCTA GAGAAGTTTC TTCCCAATAA AACAGGCAAC GTTATCGATT 

121551 GGGATACCAA GGAAATTGTA GGGCAACATC AGGGAGCTCA CTATTATACT 

121601 ATAGGGCAGC GGCGAGGACT TGATCTTGGA GGATCCGAGA AACCCTGTTA 

121651 TGTTGTGGGA AAAAATATAG AGGAAAATAG CATTTATATT GTGAGGGGGG 

121701 AAGACCATCC CCAGCTCTAC CTACGGGAAT TAACAGCTAG AGAGCTCAAT 

1217 51 TGGTTTACCC CTCCTAAATC CGGATGTCAC TGTAGCGCTA AAGTCCGCTA 

121801 CCGTTCTCCT GATGAAGCTT GCACGATAGA TTATAGCTCA GGTGACGAGG 

121851 TCAAGGTGCG ATTTTCACAA CCCGTCAAGG CGGTAACTCC AGGACAAACA 

121901 ATAGCGTTTT ATCAAGGAGA TACCTGCCTT GGTAGTGGAG TTATCGACGT 

121951 TCCTATGATT CCAAGTGAGG GCTAGGGAGA GCAGCTTCCT GCTCCTCTTC 

122001 TTCCCTTTCA AAGGCAACGC GATTTTCAAC CAAGGTTGCT CGTAGCTTGC 

122051 GAGCTTCTTG ACGGCAGGAC TCTTTAAGCA AGAGCTCCGC TAGAGGATCT 

122101 TCAAGGTACT GCTCAATGAC ACGGCGTAGA GGACGTGCTC CCATTTCTGG 

122151 AGAATGCCCC TTCGTTACTA GGAAGGAAAT CACAGAGTCT GGGATGTTCA 

122201 AAGCCATTTG GTAGTTTTTC AGTCTCGAGT CCAGTTTGTT GATCTCTAAA 

122251 TGGATGATCT CCGATAGAGA TTCTTTCTCG AGGGGACGGA AAATCACACT 

1223 01 TTCATCCAAA CGGTTAATGA ACTCAGGCTT TAAGTGTTTC TTCATAGCAT 

1223 51 GTTCGATTTT CTCTTGGATG ACCTTATAGT CCATATGGGA CTTCAAGCCA 

1224 01 AAACCAATTT CTCCGCTTTT ACGAATGAGA TCAGCTCCCA AATTGGAGGT 
1224 51 CATGATAATA ATGGCATGAC GGAAATCCAC TTTGCGACCA AAAGAATCAG 
122501 TAAGACGTCC TTGCTCTAAA ATTTGCAACA TCAGGTCCAT AATGTCTGGG 
122551 TGTGCCTTTT CTATCTCATC AAAGAGAACA ACGCAGTAAG GACGGCGACG 
122601 TACCTGTTCC GTAAGGTGGC CCCCTTCTTC ATGACCTACA TATCCTGGAG 
122651 GTGATCCCAT CATCTTGGTA GCAGCAAATT TCTCCATGTA CTCTGACATG 
122701 TCTACCTGAA TCAGAGCGTC TTCACCACCG AACATCTCTA TAGCAATTTG 
122751 TTGGGCGAGC AGGCTTTTCC CTACACCGGT AGGCCCAAGG AATAGGAAGG 
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124151 GGATCCCTAA GAGTAAATGC TCCGTCCCGA CATAATTGTG CTCTAAAAGG 

124201 CTGGCCTCTT CATTTGCTGA TTCAAAAGAT TTTTTTACTC TTCCTGTAAG 

124251 GGCAGGGTCT CCGTAGACTT GAATTTCTGG ACCATAACCA ATCAGGCGTT 

124301 CCACCTCTTG CCGTGCCGTA TCAAAATCTA TACCGAGGTT GCGTAATACA 

124351 TTAACAGCTA CCCCTTGACC AAGTTTGAGA AGACCAAGCA GGATGTGCTC 

124401 AGTACCCAGG TAGTTATGAT TTAAACGCTG AGCCTCCTTT TTCGCCAGTT 

1244 51 TAATGACTTG TTTTGCTCTA TTAGTGAACT TCTCAAACAT AAAAACCTAA 

124501 AAGACAGGGG TAGAACTTTC CTTAAGCATA TACGAAATTT AAAATAATGA 

124 551 TGCAACTCTT CGCTCTAAAC CAGCAAATTT GGTAAAATTC CTCTGAGTTT 

124 601 AAGGGAAAGT TATGCACAAA" CCTTTTGTAT ATGATACAAT AGTTCAGCTT 

124 651 CTTTTGAAAC AGTCTTAATT AGTTTTATGT TTGTTATATG AAAGTTCGTA 

124701 TCGTAGATTC AGGAAAATCT TCAGCGGCCT CCCACATGGC TAAGGACAGA 

124751 GATTTATTAG AATCTCTGCA AGATGGGGAG CTCATTTTAC ACCTTTATGA 

124801 GTGGGAGAAT CCTTGTTCTC TGACGTACGG TCACTTTATG CGTCCAGAAA 

124851 AATTTTTACT TTCCAACTAT GCGGATCTAG GATTGGACGC CGCAGTGCGG 

124901 CCTACGGGAG GGGGATTTGT CTTCCATAAG GGAGATTATG CTTTTTCTGT 

124951 TCTTATGTCT GCGACACATC CTTCCTATTC TTCTTCGGTA CTTGAGAACT 

12 5001 ACCATACTGT AAACTCTTTT GTAGCGAAGG TTCTAGAGAA AGTATTTCGG 

12 5051 ATCCAGGGAA TGTTAGCTCC AGAAGACGAA AACTCTTCTT CCAGAGATTC 

125101 AGGAAATTTT TGTATGGCAA AAACTTCGAA GTATGACGTT CTTTTTGGGG 

125151 ACAAGAAGAT AGGGGGCGCT GCCCAACGCA AGGTGCAACA GGGATTTTTA 

125201 CATCAAGGAT CCTTATTCTT ATCGGGAAGT TCTTCTGAGT TTTACCAGAG 

12 5251 ATTTTTAAAA CCCGAGGTTC TTGAAGAAAT TATTGAACAA ATCCAGATTC 

125301 ACGCGTTTTT CCCTTTAGGT TTGGAAGCTG CTGATGAAGT GCTGCAGGAG 

1253 51 GCGCGTCAGC AAGTCAAAGA GGCGTTTATT AAATTGTTTT GTGGTGAGGG 

1254 01 GTTATGATGA GTCGGTTGCG TTTTCGCTTG GCAGCTCTTG GAATATTTTT 
125451 TATTTTGCTG GTTCCTAATT CTGTTTCAGC AAAGACAATC GTAGCTTCAG 
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126851 TCTACCAACA GGATCTCTTC TGCAGGCTCT GCAGGGATCC TGCCTGGTTT 
126901 TTTTCTCTGC TATCGTTTAC ACTACGCTTT TATTGTTTGG GTAGAGGGTG 
126951 GACCTTGTTA TCGTTCTTCT ATAAGCATCA AAAAAAATTT ATCGGCATTG 
127001 TCATTGCTGT AGTTTGTGTT TCTTGGTATT GGAGTGGGTT GGGGACGATT 
127051 CTCTAGAAAA GGTTCTGCAG AGTCCACCTC ACGTCGGACT GTTTTTACTA 
127101 CCGCTTCAGG GAAGCGGTAT GTAGAGAAAG ATTTCATGGC TATGAAGAAG 
127151 TTCTTTGCTC ACGAAGCGTA TCCATTTACA GGGAACCCTA GAGCTTGGAA 
127201 TTTTATCAAT GAGGGGCTAC TTACTGATTA TTTTCTAACG ACAAGGGTGG 
127251 GAGAAAAACT CTTTTTAAAA GTGTACCATC CGGGAGAGAA AATTTTTAGT 
127301 AAGGAGAAAG CTTACCAGCC GTATCGTCGT TTTGACGCTC CTTTTATTTC 
127351 CTCTGAAGAA GTTTGGAAAT CTTCAGCTCC CCAGCTTTTA GAGATCCTGA 
1274 01 AGGTCTTTCA ACAAATCGAG AACCCCATAT CAAAAGAAGG ATTTCTTGCT 
1274 51 AGAGCCAAGC TCTTTTTAGA AGAGAGAAGG TTCCCTCATT ATGTGCTTCG 
127501 ACAAATGTTG GAGTACCGCA GGCAAATGTT TGCTCTTCCC CCAGATGAAG 
127551 CCTTATCTCG CGGGAAAGAC TTGCGGTTAT TTGGCTACCA GACGATTCAA 
127601 GACTGGTTTG GGGATGCCTA CCTTTCTGCT GCTGTTGAGC TCTTGATCCG 
127651 CTTTATTGAC GAGCAGAAAA AAGTACTTCC CAGGCCCTCA AAACAAGAAG 
127701 CTCGTGACGA CTTTTATGAT AAGGCGAAGC ATGCCTATAC TAAGATCAGT 
127751 AAGAATAAGG AATTTTCCTT AGGATTTGAA GAATTTGTAA ACTCGTATTT 
127801 TCAGTTTTTA GAGATCTCTG AGTCCGAATT TTTCAATATG TATCGAGACA 
127851 TATTGTTGTG CAAAAGAGCT CTTCTCCTAT TGCAGGGAGG CGTTTCTTTT 
127901 GACTTCCAAC CTCTAACTAC ATTTTTCGTT CAAGGAAAAG ATTCCATACA 
127951 AGTAGAGTTC TTTAGACTCC CTAAGGAGTA TAGCTTTAAA ACAAAACAAG 
128001 AGTTAAAAGC TTTCGAAGTC TATTTAAAGT TAGTGAGTTT ACCTAAATCG 
128051 GATAGTTTGG ATGTTCCTAA TGAGATCCTT CCTATAGCGA CCATAAAAGC 
128101 TAAAGAGCCT CGGTTAGTAG GCAGACGGTT TTCTATAGAC TATAAGAGAG 
128151 TCGCTTTGCA AGACTTAGCA GCTACTGTAC CTATGGTTGA AGTGCTGCAC 
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130901 CGCAAGAAGG ATATTCTGAA TAAAGGCTGC TTGTAGAAGA ATACCAAAGA 

130951 CATTAAGCCA AGTATACGCA CCTAACCACA TAAACTACCT TTTTCTCTTT 

131001 TTAGAGTCTC GAATGTTAAC AAGCCAAATC ATAATACCAA GTAGGAAAAA 

131051 AGCCGACGGT GCTAGCACCA TAAGACTTAA ATTTTGGTAT CCATCGGGGT 

131101 GGGTTTCGGA AGCATAAACA AATTGAGGGA TGATGCGAAA CCCCATAAGA 

131151 GTTCCAAAAC CAAAGAGTTC TCTGATGACT CCAATGACAA GTAAGACCCA 

131201 GCCGTATCCT AAGCCAGAGG CAAACCCATC TAAGAACGCT GGAATAGGAG 

131251 TCACATGCCT AGCTAGACTT TCAGACCTTC CCATCACGAT GCAATTGGTG 

131301 ATGATAAGAC CCACAAAAAC AGAAAGTGTT TTGGAAATAT CAAAGAAAAA 

131351 AGCTTTTAAA AACTGGTCGA TAACAATCAC AAACAAGCTA ATGATAATTA 

131401 GCTGAGTAAT CATTCTCACA CTGTCAGGAG TGAACTTACG TAATAAGGAA 

1314 51 ACAAAGAAAG ACGAGCATCC TGTAACAATG CTGACAGCAA TTCCCATAGT 

131501 AATTGCCGTT TGTACTGTTG TTGTCACTGC CAGAGCCGAG CAAATCCCCA 

131551 AAATCGCAAT GAGAATTTGG TTGTTGCTCC ATAGAGGATC AAAGAAATAG 

131601 CTTTTATAGG ACTTTTTACT TGTCATTCGC CTGTTTTCTT TTCATGGGTT 

131651 AAATTAGAAA AATTTATAAG GAGCTGACGA TAGCAAGCCA GAGATTGTAC 

131701 ATAAGCTTCA GTGACACCGT TGCATGTTAA GGTGGCTCCA GAAATCCCAT 

131751 CAATAGCAGA AAGAGCTTTT GGAGAATCTC CCAAAGTAGT ACGCACGGAA 

131801 CCTTTAACTA CCTCAAGCCC TAGGTCTGTT GTTGCAAAAT TTGTAGTTCC 

131851 AGAAGAATCT TGTAGGAAGA TTTTCTTCCC ATAGAATTGC TCTTGCCATT 

131901 CGGGATTTGT AATATTTGCT CCTAAACCTG GAGTTTCTCC TTGTTGGTAC 

131951 CATGCGGTTC CCAATACAGT GTCACCGTCG TTTTTCACTC CTAGATAGCC 

132001 ATGGATGGGG *CCCCAAAGGC CGAATCCTGA TATAGGGAAG ATCAAAGCTT 

132051 GAACTGTAGA AAGGTCTTTC GCAACGTCGG CTCCTGACAT ATTTTCTGTG 

132101 CGAGAGGTAT TCTCTAAAAT GACATAAAAG GGGAGGGGGG ATTGCTGACA 

132151 CGGAGGGGTT TCTTGATATT TCTCAAttAAA TTCAATGGGA TTCAGATTTT 

132201 TTTCTTCAAA AGAAAATACC TTGCCTTGGG CATCTGTAAG TAGAGGACGG 
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132251 ACAAAGCGCT CGGCATACAG CTCTAATTCA GGATAGGAAA CCTCAGAGAC 

132301 TTTTTTTGTA GCAACTTCAA GAAGTTGTGT TTTTTTATCG AAAGTCGCAG 

1323 51 GCACCCACTC TTTTTTTTCC TGAATTTGAA ATCTTCCTTT AAAATCTAAA 

132401 ATATGAGCAG CTAAAAGCAT TTGCTTATTG CGATCGAAAG TAGCAGCTTG 

132451 TTCCTGTATT GGGGAGAGCA CATAGTAGAT TGTGGATAAC AGCACTCCTG 

132501 CAAATAAGCT GAGGCCCAGG ATAAAGGAAA CGATGTACCA GGTTTGGTTT 

132 551 ATGCGGACGG TATGTTTTGA AGAGCCTTTA GACATATTCT AGACTCCCCT 
132601 TTTTCTATAC TTTCTAACAG CAAAATAGTC GATAAGAGGG GCAAATACAT 
132651 TGCCCAGAAG GATCGCTAAC ATCACTCCCT CAGGATACGC AGGATTGATA 
132701 AGACGAATCA CAATAGTCAT AAATCCTATA AAGAATCCGT AAATCCATTT 
1327 51 CCCTAATTTC ATAGTCGGCG ATGATACGGG ATCCGTAGCC ATAAAGACTA 
132801 AACCAAAAGC AAGTCCTCCG AGGAAAAGCT GCCGATAGGC GGGAATGAAG 
132851 AATCGAGCAG GTGCCCAAGC TCCGTTTTGT CCCACGATGA GTACGCTGAT 
132901 AAACTTAAAG AGCCAGCCTG TGAGAAAGGC TCCTATCCCA AAGGCTGCCA 
132951 TGGTTCTCCA AGAGGCAATG CCTGTAACAA TAAGGAATAT TGCACCCAAC 
133001 AGACAGGCGA AAGTGGAGGT CTCCCCCAGA GAACCTATAA TGTTTCCCCA 
133051 AAAGAGATTC CCAGCTGAGA ACTTCCCAAT CCCATAGATC ACATCGGTAA 
133101 TAGCATAGGC AGAATCGAAC TGTGTGGGAA GCAGCCCCAA TCCTCCCTCA 
133151 GCAACAGGAG CTGTAACAAA CGTTTGAAGT TGTGTAAGAG TGAGATTATC 
133201 TAAAACCCAA CCAGGATGCG TCTCTGTCCA AAGAGAAAAT TGTGAGTGAA 
133251 TGACATCTTG AGTAGGGACG TGAGGAATGT GAAGCATATT TGCAGCAATC 

1333 01 GCATCGACAT GCAGACGCTT TACAGAGGGA GGTGTCGAAT TTAGAGTTTG 
133351 TAGGCAGGTA GACTGTGAAA ATCCATCAAT GAGTACTTTT CCTGTCGAGG 
133401 AGTTCATCTT CATGAGGCTA TCTTTAATCA CTCCGGGGTT GCTTCCTACC 

1334 51 CAAACGTCAC CACTCATCTT TGCTGGAAAC GTAAAAAATA AGAATGCCCT 

133 501 TCCTGATAGA GCAGGATTGA GGATGTTCAT CCCTGTGCCT CCGAAGAGCT 
133 551 CTTTACTGAC AACAATACCA AAGGCGATCC CTAAGGCTGC CATCCAGTAA 
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133601 GGAATTGTCG GAGGGAGAGT AAGGGGATAG AGGATTCCGG TTACTAGCAG 

133 651 TCCTTCTGCG ATTTTATGCC CACGAACTAC AGCAAATAGG ACCTCACAAG 
133701 TACCCCCGAC AACATAGCTA ATCGTAAGTA GAGGAATAAA GATCTTAAGT 
133751 CCTTCCCAAA GGATAGGAAC TATATGGATC TCTTTGTAAA CAAAGGATAA 
133 801 ATAACTACCA AATCCAGAAA TATGTAAGAA TTGCTCCATC AGCACAGGAT 
133851 TGCCTGAGGT ATAAACGATA GATTGAAGTC CTGAATTCCA GATCGCAACA 
133901 AAGGTCGCGG GAAACAAAGC GATAACAACA AGCATCATCC AACGCTTAAC 
1339 51 ATCTACAGAA TCGCGGATGA AAGGAGGCTT GGAAGGGGTT TCAATAGGTT 
134001 CGTAACAAAA TGTATCTATC GCATCGACAA TGGGAGTAAA GCGCTGATAC 

134 051 TTGTCTTGTT GACATAGTTT CCAAAGAGAA TTTATGAATT TTTTGAGCAT 
134101 TGTGATTGAG AGAAAAGTTG AAGCTTCTAC ACGTTCAAAA ACGTAGCAAA 
134151 CTGCTTAAAA TTTTAGGAAT AAAAATTTTC ATGATTCAAA AATAGATGGT 
134201 ACTTTTTTCG TTGCTGTTTC CAAAGTTATG TTATGGCTGT CAAGCTCCAG 
134251 GAGCCTACTT TTGTTCCAAC TGCTTGGAAA AACTTCTCGT AGAAGATAGA 
134301 GAAGGGCGTT GTCTACATTG TTTTCGTTAT CTTGGTTCTT CCGAAACACG 
134351 TCTATGTAGC CAGTGTTCAC CCTCTTCACA ACTTCAAGCT TTCAGCTTGT 
1344 01 ACCTTCCTTC GCAAACGGCC CTCTCGGTAT ATGCTCGTGC TTGTGAAGGT 
134 451 AAGCGACCCG CTCTGCAGTT TTTTTCTAAG AGTATCGCCT TTGAGCTAGC 
134501 TTCACTGGAT GAGACTCCGA GTTGTATTGC CTATATAACA TCGACAATTT 
134 551 CTAGGAAAAT CGTAGTAGAA GTTGCTAAAC TAGAAAAGCT TTTACGCATT 
134601 CCCTTGTGGC CGTGGCTTCC TAAGAAAAGA CAAATAGAAA AACTTCCTAA 
134651 AGGGGAAGGT ATCTGCTTTT TGTCGGCCTA TCCTTTATCA CAAAAATGGA 
134701 TGCAAACTAT CGTTGGAGGG AGTGCATCAC CTCTAGTATC TATAAGTCTC 
134751 TTTCTCTCTC AGAATGATCA GTAATTCCTG CAATTGCAAG GTAACCAAGA 
134801 ATACGTACGC CCTCATCAAC ACACAACGTG TTCGCTTGGA TGTCTCCTTT 
134851 AATGATTGCG CCTCCACGGA GTTCGACTTT TCCAGATACT GTGATATTTC 
134901 CTTCTACAAC CCCTTCAATA ATGGCTTCTT GTAGCTGAAT ATCTGCCTTT 
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134951 ACCACTCCTT TAGGACCGAT AATAATTTTT CCTTTTGAGA CTAAAATGCC 
135001 TTCAAAAGTT CCGTCAATAC GTAGGAGACG TTCAAAAGCA AGTTCTCCTT 
135051 TAAAGGTGAC GCCTTCTCCT AAGGTAGTTT CAGGTTCTTC AAGAGGGAGT 
135101 AGAGATTCTG TTCTTGGAGT TGAGGACCAT TGAGGAAGAG AAGATTCTTC 
135151 AGTTAAATTG TGATTCAAAG GGCGAGCTTC CGAAGCTTTA GGGTTGTCAA 
135201 AAAGACTTGG AGGGGTCTCT GGGCGCTCGG ATCTTGAATA TGGCGAGTAG 
135251 CTGGAAGGTG AAGAAGTTTC TTCTTCGTAA AGTGTTTGCA CATC TTC AAA 
135301 AGGACCTTTT CCTGTTCTAC GGAACATGGG ACACCCCCTA AATTAACCAA 
135351 CAATATGATT TTTACAGAGA TTTACTTGAC GCTTAAGAGT TTCGTCATTT 
135401 TGTAATTTAT GTTCTATAGT TTTACAGGCA TAAAGTACTG TCGAATGAGT 
135451 TTTACCAAAA GCAGCTCCTA TTGCAACTAA AGAATCTGTA ATAAGAGTTT 
135501 TTGCTAAATA CATAGCAATT TGCCGAGCTA ACACAAGATC TTTAGAGCGT 
13 5551 GAGTTTCCCT TAAGATCATT CAGCTTTACT TGGAATACTG TAGCAACACT 
13 5601 TTTTAAGATC GTTTCTACAG AAATTTTTTG TTTTGTTGGA GAACGGAAGA 
13 5651 GCTCTTTTAG AGTTTCTCGG ACTGTAGTTT CTGTAAGAGA CTTGCCGAAA 
135701 AGACGACAAT AGGCAGTCAG CTTGTTGATA GCTCCTTCCA ATTGACGGAC 
135751 ATTGCCATAG ATGTGATCCG CAATATAAAA TGCCATTTCA TTAGGAATGA 
135801 GCAATCCTTT TTGCTCCGCC TTGTGCTGTA AAATCGCAAC CCGAGTTTCT 
13 5851. AAATCAGGGA TGCCGACGTG AGCAACCAGT CCCCATTCCA TTCTAGCAAT 
13 5901 GATACGCTCG GAAAGTTTGA GCTGACTTGG AGGTTTATCA CTGGTAATTA 
135951 CAATTTGCTT ACTCAGGTTG ATCAAAGTCT CAAAGGTATT GCAAAACTCT 
136001 TCTTCAAAAT TTTGGCGATT CTGTAAAAAT TGAATATCAT CAACAAGAAG 
13 6051 TAAATCTAGG GAACGATAAA AATTTTTCAT TTTATCAACA GACTTGGATT 
136101 TGAGATGGTA GACAAGATCG TTGATAAACG CTTCTGTAGT GATGCAATGG 
136151 ATGCGTAGAT TTTTATGATG TTCTCTTACG TAGTGACCTA CGGCATGAAG 
136201 TAAATGCGTT TTGCCTAATC CCACACCCCC ATGGATGAAT AAAGGGTTGT 
136251 AGGAGCGGCC AGGTTTCCCA GCAATACCTA CAGCTGCAGA CTTCACAAAT 
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136301 TGATTTGAGG GACCTTCAAT GAAATTATCA AAGCGATAGG AGAGATTCAG 

136351 CTTTAATTCA AAATCTTTAG TTTCTTCAAA GACCTCAGAA ATTCCTTCGT 

1364 01 TTGATTCTTT TTGAGAAGCC ACGGGGGCTG AAGGTTTCTT GTGTTCTGCA 

136451 ACTACAAATT CTAAAGCAGG CTCTCCATGA ACATCTAAGG GGACAAAAGA 

136501 ACAGAGGTCT CTTTTGTAGT TATCAAGAAG ATAATTTTGT ACAAAAATGT 

136551 TGGGGACTTC TAAGCGAATT TTCTCTTGAG TTTCTTCAAG AACTTGAATA 

136601 GGAGAAATCC AATTTTCAAA AGCCGTTTTC GAGCAACGTG TCTTAACATA 

13 6651 ATTTAAAAAC TGTTCCCAAG TAGTGCACTC GTTACAGGTT AACATGCCGC 

136701 TCTCTTTATT TATAAAGCTT TCCCAAATAC AATCGACCCA TCCCATGAGT 

136751 GATGGCGAGA AAATCTCATT GCATCTTGAC TATTCATTGC ACTGAGTGCA 

136801 ATGACTCTTC GCGTTCAGAT TGGATCCTGC ATTCCCTGGG TGAAGTTCCT 

136851 GTTAGGAATG GATCCTATAC ACCCTTCCTC CGCAGGTAAA CGCGGTACGC 

136901 TCTGCCGATA AAATAACTCT ACCGATTACT AGGTTTTAAG GCAAATTGGA 

13 6951 TCGTTGGTTT CGTTAGGCAA TAAGGAACCA CAAATTCAGG AAAAAATAAT 

137001 TATGAAATTT TGTAATAAAA ATGGAAAAAG AACTAAAGAA ATCCGGAGTT 

137051 CTTCAATCAC GAAATACGTT TTCTATAGGA GAAAAAATTA ACGAACTAAC 

137101 GCAGCATTTT TTTTGGTTGC TTTACTATAA CTCATCAATA GAGCTTCAGC 

137151 ATCATTAGCA ATTGCTGGTA TGGGACAGGA TGAAAGGTAG GTGGCAATGG 

137201 CAGTAGCTTC TTCAATTCGT CCCAAACAGA AGAGAGCTTT TGTTTTATTT 

1372 51 AAGAGTGTAG GCAGATGATC TCCTTGCATG CGGAGTGCCT GATCTAAAAC 

137301 AGCAAGCGCC TGACTATTTT CACCAATTTG GAGATAAAGA CCTCCAAGAG 

137351 TTTGATGATC ATAGATACTT AAAGGATCTA AGATCACTAG AGCTTCAAAA 

137401 AAAAGAATCG CTTTTGAATA ATGCCCTTGG CGTAGAAAAG AATATCCTGA 

137451 GATTCTGAGT TCTTCTAACT CATCATCTCC CCAGCCTAAG ATTGCTTTCC 

137501 ATTCATTATC CAACATACAT TATCCTTGAA CAAATTGAAA GATACGAGAG 

137551 ATCACATAGT CTATCATCAC ATTTGTAGAC TTGACTCCAA GATCTAGAGC 

137601 TTTTAAGAGC ACTTTCCCTT GTTGTACTCT AGGATCGTCA TCGTCTTCTT 
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137 651 CAGGATCTTC GTCTTCCTCT TCTTCTTTAT CTTTGTAAGA CAAAAAGGGA 

137701 GTAATTTGAC TGCGATAGGA AAACTTCCCT CGAGTGAGAA CTTTTAAAAA 

137751 TGAGGAGATT TTTTCTATGT CTTCATCTTG TTGGTCTGGA GATCCTAAAG 

137801 AAGGTGCCAG GTAGGGTGTG GAAAAACGCT GTTTGTAAAA ATTATTTGGA 

137 851 GGGGAAAAAC ATGCCCAGTG CGACTTTTGA TTTGTCTGTA AAAGAGACGT 

137901 CAAAGCCGAA GGCTTGGGGT TCATATCCAA AATTTGCGCA TGCTTGGCAA 

137951 CATCACGAAT GGAGATGCCT TCCATCTGGA TTTCTTTGCG AAAGTCGCTG 

138001 ACTATCCTAT TATTGGAAGC ATGTTGCTCA TATATAGACG TGCTATAATT 

138051 AAAAATTTCT ACCATGGCGA GGCCTTAAAA AACCGTCTCT ATCTCTAGAT 

138101 GATAGTGCGG TCTAGGAAAA AGTCAATAGT CTTGATCAGA AGCCTGAACC 

138151 TTTTCTCCTT ATCGAAGGCA CTTAAGAAAA AAGCTCACCC CTATCAAAAA 

138201 ATTTAGAGTG GGGAACTAGC GAGAATTTAA GATAAGGGAA GGCTTTCTTC 

138251 TTTCTATGGA ATCCTGTATC TTTGTCCTTA TTTGAAGCCA AGAGCTTAGG 

138301 AAATTCTTAT GTCCGAACGT GCGCATATTC CCGTATTAGT TGAAGAATGT 

138351 TTAGCTTTAT TTGCTCAACG TCCTCCACAG ACTTTTCGAG ATGTCACCTT 

138401 AGGAGCTGGA GGACATGCGT ATGCTTTTCT TGAGGCGTAT CCCTCTCTAA 

138451 CTTGTTATGA TGGCTCCGAT CGAGATCTTC AGGCTTTGGC AATTGCAGAA 

138501 AAACGTTTGG AGACCTTTCA AGATAGAGTC TCCTTTTCCC ACGCCTCTTT 

13 8551 TGAAGATCTT GCGAACCAAC CCACTCCACG TCTTTATGAC GGAGTTCTTG 

138601 CAGATTTAGG AGTCTCTTCT ATGCAGCTGG ATACTCTATC CCGAGGGTTT 

138651 AGCTTTCAAG GGGAAAAAGA AGAGTTGGAT ATGCGTATGG ATCAAACGCA 

138701 AGAGCTTTCC GCTAGCGATG TCCTGAACTC CCTAAAAGAA GAAGAACTAG 

138751 GGAGAATTTT TCGTGAATAT GGAGAGGAAC CACAATGGAA ATCTGCAGCT 

138801 AAAGCTGTTG TCCATTTTCG TAAGCATAAA AAAATTCTTT CGATCCAGGA 

138851 TGTAAAAGAA GCTCTTCTTG GCGTTTTCCC TCACTATCGT TTTCATAGAA 

138901 AAATACATCC ACTCACCTTG ATTTTTCAAG CTCTACGTGT TTATGTGAAT 

138951 GGAGAGGATA GACAATTGAA AAGTTTACTA ACATCTGCTA TATCTTGGCT 
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139001 GGCTCCTCAG GGACGGCTTG TCATTATTTC TTTTTGTAGC TCTGAGGATC 

139051 GTCCTGTGAA GTGGTTTTTT AAAGAGGCGG AAGCTTCTGG CCTGGGGAAG 

139101 GTAATCACAA AGAAAGTGAT CCAACCTACC TACCAAGAAG TACGAAGAAA 

139151 TCCTAGATCG AGATCAGCAA AACTACGGTG TTTTGAAAAA GCTTCCCAAT 

139201 GAACAAAAGT CGTTTTTTAC GTTTATGCTG CTGTCTATGC TTTTGTGGAA 

1392 51 GTCTCTTTTA TTTCTATATT AATAAGCAGA ACTCGCTGAC GAAATTACGC 

1393 01 CTCGAAATTC CTTGTTTATC TGTACGCTTG CGTCAGCTTG AGCAGCAAAA 
139351 TATTTCTTTA CGTTTTTTAA TTGATAAAAT AGAAAGACCT GATCATTTGA 
139401 TGGAAATAGC AGCTCTTCCC GAATACCAAT ATTTGGAATA TCCCTCAGAA 
139451 GAAAGTATCA GTCTTTTATC CTATGAGCTA CCGTAAACGT TCGACTCTAA 
139501 TTGTTCTAGG AGTGTTTGCT CTTTATGCTC TTCTAGTATT GCGTTATTAT 
139551 AAAATTCAAA TTTGTGAAGG AGACCACTGG GCCGCAGAAG CTCTCGGGCA 
139601 ACACGAATTT TGTGTCCGTG ATCCTTTTCG AAGGGGCACC TTTTTTGCTA 
139651 ACACGACAGT ACGTAAGGGA GACAAAGACC TTCAGCAGCC TTTCGCTGTC 
139701 GATATTACAA AATTTCACCT TTGTGCAGAT CCTTTAGCTA TTCCCGAATG 
139751 TCATCGTGAT GAGATCATCC AAGGGATTCT CCAATTTATT GAGGGGCAGA 
139801 CCTACGACGA CCTCTCCCTA AAGTTAGATA AGAAATCTCG GTATTGTAAG 
139851 CTGTATCCTT TATTAGATGT TTCTGTCCAT GACCGGCTAT CCCTTTGGTG 
139901 GAAAGGATAT GCAACAAAGC ATCGCTTACC AACAAACGCC CTATTTTTTA 
139 951 TTACGGACTA CCAACGCTCG TATCCTTTTG GGAAGCTCCT TGGACAAGTT 
140001 CTCCATACCT TAAGAGAAAT TAAGGATGAG AAAACAGGAA AAGCCTTTCC 
14 0051 CACAGGCGGG ATGGAGGCGT ACTTTAATCA TATTCTGGAA GGGGACGTTG 
14 0101 GAGAGAGAAA <3CTGTTGCGT TCTCCTTTGA ACCGTTTAGA TACGAATCGT 
140151 GTTATCAAAC TGCCTAAAGA TGGCTCTGAT ATCTACCTTA CGATCAATCC 
140201 TGTGATCCAG ACCATTGCAG AGGAAGAACT CGAACGGGGC GTGCTAGAAG 
140251 CTAAAGCGCA GGGGGGTAGG CTCATTCTAA TGAACTCCCA AACAGGAGAG 
140301 ATTCTTGCAC TGGCTCAATA TCCGTTTTTC GATCCCACAA ATTATAAGGA 
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1403 51 ATACTTCAAT AACAAAGAGC GCATCGAACA TACGAAGGTA 

140401 GCGATGTTTT TGAACCCGGG TCGATCATGA AACCTTTGAC 

140451 GCTTTACAAG CTAACGAAGA GGCTAGCTTA AAATCGCAGA 

140501 TGATCCTGAA GAACCTATCG ATGTGACCAG GACACTCTTC 

140551 AAGGATCTCC GCTTAAGGAT ATTTCTAGAA ACTCTCAATT 

14 0601 ATGGCTATCC AGAAATCTTC GAATGTCTAT GTAGCTCAGC 

14 0651 CATCATACAA TCTTTAGGAG TGGCCTGGTA CCAACAGAAG 

140701 TGGGATTTGG AAGAAAAACA GGGATCGAGC TTCCCAGTGA 

140751 TTGGTGCCTT CTCCCCATCG TTTCCATATT AATGGTTCCC 

14 0801 CTTATCTACT CCATATTCTT- TGGCTATGGG ATATAATATT 

14 0851 GGATACAAAT GGTTCAAGCC TACGCTATCC TTGCAAACGG 

140901 GTCCGGCCCA CTTTAGTAAA AAAGATCGTC TCTGCTTCAG 

140951 TCATCTTCCT ACTAAAGAGA AGACACGACT CTTTTCAGAA 

141001 GAGAAGTTGT TCGTGCCATG CGTTTTACAA CGTTACCCGG 

141051 TTTCGAGCCT CTCCTAAGCA TCACTCTAGT GCTGGGAAAA 

141101 AGAAAAGATG ATTCATGGAA AATATGATAA ACGCCGTCAT 

141151 TTATAGGTTT TACTCCCGTA GAGAGCTCGG AGGGAAATTT 

141201 GTGATGCTCG TCTCCATAGA TGATCCTGAA TATGGTTTGC 

141251 CACGAAAAAT TATATGGGGG GGCGTTGTGC GGCACCCATT 

141301 TTGCTGACCG CACACTCCTC TATTTAGGGA TTCTTCCAGA 

141351 AGAAATTGCG ACGAAGAAGC TGCTGCATTA AAGCGTCTCT 

1414 01 GAATCGTTCT CCGAAACAAG GGGGAACGAG GTGAGGATCT 

141451 CTTGCTATAG ACTTTTACCG TTGAGCAAAG ACTCTCTATC 

141501 TCTCCTCTTT ATCCTCTATG AGTAGTTTAT GTTATGGCTA 

141551 TAAACTATAG AAATAACTTT AGCTTTCTTC CCCTAAATAA 

141601 TCTTGATGAG ACGGTCTATT GAAGTTTATG GAAGGGGGAG 

141651 GTGTTTGGGG ATTTAGATTT GGGATAAAGG AGGCTTCTGT 
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141701 AGGAGAGCGA AATTTTATAT TTCAGAGAAG AGTAAGAACT TTATGGACAG 

141751 TTTTTTGTGA TTGCTTGTAT ACTATCTTGA TTGAATTTTT TGTCGACCTA 

141801 CGAGTAAAGA AATCCTTTAA GCATTTTTTA AAAATCAGAG TGAGAGCATG 

141851 CCCCTAGAGG GCTTTTTATG AAAAAAGTTG TTTTTCAATA GTCCCTGGAG 

141901 CGTAAATGGA TTTAAAAGAG TTACTCCATG GGGTTCAAGC TAAAATCTAT 

141951 GGGAAAGTTC GCCCTCTTGA AGTGCGCAAC TTGACACGTG ATTCCCGTTG 

142001 TGTGAGTGTT GGCGACATTT TTATAGCCCA TAAGGGACAG CGCTACGACG 

142051 GAAATGATTT TGCTGTCGAT GCTTTAGCTA ATGGAGCAAT TGCCATTGCT 

142101 TCTTCACTAT ACAATCCGTT TCTTTCCGTT GTTCAGATCA TCACTCCTAA 

142151 TCTCGAAGAA TTAGAGGCTG AGCTTTCTGC AAAGTATTAC GAATACCCTT 

142201 CAAGTAAGCT CCATACCATT GGGGTGACTG GAACCAATGG GAAAACTACA 

142251 GTTACATGTT TGATTAAAGC TTTATTGGAT AGCTATCAAA AACCTTCAGG 

142301 GCTTTTAGGA ACCATAGAGC ATATCTTAGG AGAGGGGGTG ATTAAAGATG 

1423 51 GGTTTACTAC ACCTACACCC GCTCTTTTAC AGAAGTATTT AGCCACTATG 

142401 GTACGTCAAA ATAGAGACGC TGTTGTTATG GAAGTCTCTT CTATAGGACT 

142451 TGCCTCTGGA AGAGTAGCCT ATACCAATTT TGATACAGCA GTTCTGACTA 

142501 ATATTACCTT AGATCATCTC GATTTTCATG GCACATTTGA AACCTATGTT 

142 551 GCGGCGAAAG CCAAGCTTTT CTCTCTCGTG CCCCCTTCGG GAATGGTTGT 

142 601 TATCAACACA GACTCTCCCT ACGCTTCTCA GTGTATTGAG AGTGCAAAGG 

142 651 CACCGGTCAT CACTTATGGT ATAGAGAGTG CTGCTGACTA CCGAGCCACC 

142701 GATATCCAAC TTTCTTCCTC GGGAACAAAG TATACCTTGG TGTACGGGGA 

142751 CCAAAAAATT GCGTGCTCTT CCTCATTTAT TGGAAAGTAC AACGTCTATA 

142801 ACCTACTTGC TGCGATCTCT ACAGTACATG CAAGTTTGCG TTGCGATCTT 

142851 GAAGATTTGC TAGAAAAGAT AGGCTTGTGT CAACCTCCTC CAGGTCGTTT 

142901 GGATCCTGTA CTTATGGGTC CCTGCCCTGT ATATATTGAT TATGCACACA 

142951 CCCCCGATGC TTTAGACAAT GTCTTAACAG GATTGCATGA GTTACTTCCT 

143001 GAGGGGGGAA GACTGATTGT TGTTTTTGGT TGCGGTGGAG ATAGAGATCG 
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143051 CAGTAAACGG AAGTTGATGG CCCAGGTGGT AGAGCGTTAT GGTTTTGCTG 

143101 TTGTAACTTC AGATAACCCT AGGAGCGAGC CTCCTGAAGA TATTGTGAAT 

143151 GAAATTTGTG ATGGGTTTTA TTCAAAAAAC TATTTCATCG AAATCGACAG 

143201 AAAACAAGCA ATTACATATG CTCTGTCTAT TGCCTCAGAT AGAGATATAG 

143251 TGTTAATAGC GGGAAAAGGG CATGAAGCTT ACCAAATATT TAAACACCAA 

143301 ACAGTTGCGT TCGATGATAA GCAGACTGTT TGTGAGGTAC TCGCTTCCTA 

143351 TGTCTAAGCA ACTGTCGTTT TTTGCTTTAT GTGTGTTAGG AAGTCACCCG 

143401 ATTTTTGCTC AAACACCGAA TCCTCCTCAG CGTGTACGAC GCAGTGAGGT 

1434 51 TATATTTATA GATCCTGGAC ACGGGGGAAA AGATCAAGGC ACGGCAAGTA 

143501 AGGAACTTCA TTATGAAGAG AAGTCCCTGA CCCTGTCTCT TGCTTTGACG 

143551 GTTCAAAGTT ACTTAAAGCG GATGGGTTAT AAACCTCAGC TAACCCGATC 

143601 TTCTGATGTA TACGTTGACT TAGGGAAACG CGTTGCTTTG TCGAACCGTG 

143651 GGCAGGGGGA TGTCTTTATC AGCATCCACT GTAATCATTC TTCAAACGCA 

143701 GCAGCCTTTG GCACCGAAGT ATATTTTTAT AATGGTAAGG TCGGATCTCC 

143751 GACTAGGAAT CGCATGTCAG AAGTACTGGG AAAAAACATT TTAGCTGCTA 

143801 TGGAAAAAAA TGGCATTTTG AAGTCTCGAG GTTTGAAAAC TGCGAACTTT 

143851 GTTGTGATTA GAGATACTTC TATGCCTGCA GTTTTGGTGG AAACCGGGTT 

143901 TTTATCCAAT AGTCGTGAAC GTGCGGCCCT GCAAGATGCT CGCTATCGTA 

143951 TGCATGTAGC GAAAGGCATC GCCGAGGGAG TTCATAATTT TCTTTCTGGA 

144001 CCTAGTTTTC AGAAACCAAA ACAGAATATC GCTAAAATAC GTAAACCACA 
144051 GATACAAGCA AATTAGTACT TTAGGAGTTA AAGGCAAAAA ATCGTCCTCG 
144101 ATGCGATTCG AACGCATGGC CTGCTGCTTA GGAGGCAACC GCTCTATCCT 
144151 GCTGAGCTAC GAGGACGCAA AGACCAGCAC TTTACCAAGT TAGATTAAAG 
144201 AAGTCACGTG TTAGATGCAC GCTAGCAATT TAGGGGAAGT TTTTCTCAAG 
144251 ATGTGGGAAT GATTTTTCTA GGTTCTAGAA ATATAGTTAT TCGCATTAAT 
144301 CGATATGGTT TATAGTGATT GCGCATTTTT TAAAAAATGT CTTGAATCCA 
144351 AAGGATGAAT AGATATGATG AGCTTAGACT TCAAAAGTTT ATTATTTACG 
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144401 GTTACTCAAC GAGCTACTTT AGGGCACTTT AATAGGAGGC ATTGTCTAAT 

144451 ATGGCTACCA TGACAAAGAA GAAACTAATC AGCACGATCT CACAAGATCA 

144 501 CAAAATTCAT CCTAATCACG TACGTACCGT GATTCAGAAT TTTCTAGATA 

144551 AAATGACCGA CGCCTTGGTT AAAGGTGACA GGCTTGAGTT TAGAGATTTT 

144 601 GGTGTGTTGC AAGTAGTAGA AAGAAAACCA AAGGTAGGAC GTAATCCTAA 

144 651 GAATGCAGCA GTCCCCATTC ATATTCCTGC TAGACGCGCT GTAAAGTTTA 

144701 CTCCAGGGAA AAGAATGAAG CGCTTGATAG AAACTCCGAA TAAGCATTCT 

144751 TAATTCTTGT AGTCTTCTTT GTCTCAGTTG TTAGAGTCAG ACCGGTTTTT 

14 4801 TACCGGGCTT GACTCTAATT TTTGTTATTA TTATCGTTTG GTGCAATGCT 

144851 TTTCTGATCA AATTGTGCGT GATAATGGGG CTGCAATCCA GGTTACAACA 

144901 TTGTATAGAA GTGTCCCAGA ATTCGAACTT TGATTCACAA GTAAAACAGT 

144951 TTATCTATGC GTGCCAAGAT AAGACATTAA GGCAGTCTGT ACTCAAGATT 

145001 TTCCGCTACC ATCCTTTACT AAAAATTCAT GATATTGCTC GGGCCGTCTA 

145051 TCTTTTGATG GCCTTAGAAG AAGGCGAGGA TTTAGGCTTA AGCTTTTTAA 

145101 ATGTACAGCA GTACCCTTCA GGTGCTGTAG AACTGTTTTC TTGTGGGGGA 

14 5151 TTTCCTTGGA AAGGATTACC TTATCCTGCA GAACATGCGG AATTTGGCCT 

145201 ACTCCTGTTA CAGATCGCAG AGTTTTATGA AGAGAGTCAG GCATACGTCT 

14 5251 CTAAAATGAG TCATTTTCAA CAGGCACTCT TTGATCACCA AGGGAGCGTC 

14 5301 TTTCCCTCTC TCTGGAGCCA GGAGAACTCT CGACTCCTAA AAGAAAAGAC 

145351 AACTCTTAGC CAATCGTTTC TCTTCCAATT AGGAATGCAA ATTCACCCAG 

1454 01 AATACAGTCT TGAGGATCCT GCACTAGGGT TCTGGATGCA AAGAACGCGT 

14 5451 TCTTCATCCG CTTTTGTAGC CGCTTCAGGA TGTCAAAGTA GCTTGGGAGC 

14 5501 GTATTCCTCA GGGGATGTCG GTGTTATCGC TTATGGACCT TGCTCTGGAG 

14 5551 ACATTAGTGA TTGTTATTAT TTTGGATGTT GTGGAATCGC TAAAGAGTTC 

145601 GTGTGCCAAA AATCTCACCA AACTACAGAG ATTTCTTTTC TCACCTCTAC 

145651 AGGAAAGCCT CATCCCAGAA ATACGGGATT TTCCTACCTT CGAGATTCCT 

145701 ATGTACATCT GCCGATCCGC TGTAAGATCA CTATTTCCGA CAAGCAATAT 
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14 5751 CGCGTGCACG CTGCGTTGGC TGAGGCCACC TCTGCCATGA CGTTTTCTAT 

14 5801 TTTCTGTAAG GGGAAGAATT GTCAGGTTGT TGACGGCCCT CGCTTGCGCT 

145851 CCTGTTCCCT AGATTCTTAT AAAGGTCCCG GAAACGACAT TATGATTCTT 

145901 GGGGAAAATG ACGCAATCAA CATTGTTTCT GCAAGTCCCT ATATGGAAAT 

145951 TTTTGCTTTG CAAGGCAAAG AAAAATTTTG GAATGCAGAC TTTTTGATTA 

14 6001 ATATTCCTTA CAAAGAAGAG GGCGTCATGT TAATTTTTGA AAAAAAAGTG 

14 6051 ACCTCTGAGA AAGGAAGATT CTTTACGAAG ATGAATTAAT TTTGGGTCTG 

14 6101 TAATTGTGTT TAAGAATTGT TTGTATTAAA ATGATTCTTT TTATACGAGG 

146151 AGAGCACATT CTAATGGAAC TTCTTCCACA CGAAAAACAA GTAGTTGAAT 

146201 ATGAAAAGGC TATAGCCGAA TTTAAAGAAA AAAATAAGAA AAATTCTCTC 

146251 TTATCTTCTT CAGAGATTCA GAAATTGGAA AAGCGTTTAG ATAAATTAAA 

146301 AGAAAAGATC TATTCGGATT TGACTCCTTG GGAGCGTGTA CAAATATGTC 

14 6351 GCCACCCTTC GCGTCCCCGT ACTGTCAACT ATATTGAAGG GATGTGTGAG 

14 6401 GAGTTTGTCG AGCTTTGTGG AGATCGCACC TTCCGAGATG ATCCCGCAGT 

146451 TGTTGGTGGC TTTGTAAAAA TCCAGGGTCA GCGTTTTGTC CTTATTGGCC 

146501 AAGAAAAGGG ATGCGATACA GCGTCACGCC TTCATAGGAA CTTCGGTATG 

146551 TTATGTCCGG AGGGTTTCAG AAAAGCCCTT CGCTTAGGAA AACTCGCTGA 

146601 AAAGTTTGGC TTGCCTGTGG TCTTTCTTGT CGATACCCCA GGAGCATATC 

146651 CTGGATTGAC TGCTGAAGAG AGAGGACAAG GATGGGCAAT TGCCAAAAAT 

146701 CTTTTTGAGC TCTCAAGACT TGCCACTCCC GTGATTATTG TCGTTATCGG 

146751 TGAGGGATGT TCAGGTGGAG CTTTGGGCAT GGCTGTAGGT GATTCTGTAG 

146801 CTATGTTAGA GCATTCCTAT TATTCTGTAA TTTCCCCAGA AGGATGCGCC 

14 6851 TCCATTCTTT GGAAAGATCC TAAGAAAAAT AGCGAAGCAG CTTCCATGTT 
146901 GAAAATGCAT GGAGAAAACT TAAAACAATT TGGCATTATC GATACTGTTA 
146951 TCAAAGAGCC CATTGGGGGA GCTCACCACG ATCCTGCATT GGTATATAGC 
147001 AATGTTCGAG AGTTTATCAT CCAAGAGTGG TTACGATTAA AAGATCTAGC 
147051 TATAGAAGAG CTGTTGGAGA AACGGTACGA AAAATTTCGC TCTATAGGTC 
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TGACAGTAAA 


AGTCTTTCGT 


ACAGAAAAAT 


x y Jl 


TTfiPPTTPAP 


AAAAT ATTG T 


GAGCATAACA 


ATAAGATTTC 


TGCTTTAGAG 


148001 


PAfiAAAAGTG 


CTGCTTACGG 


TTTGCTTCCA 


CGACCCCTCC 


TGCATACCAT 


148051 


AGPTTCTTTA 

x x ^ x x x ** 


TTTTTTGCTT 


TTGTCGTCGT 


TATCGGAATT 


TATAAATTTG 


1/Olfll 
1'iOJ.vl 


PTATTPPTCP 


CGAAGAACTT 


ATCGTATTTT 


GTGGTTTGCT 


CTACCTAATC 


148151 


TAPfiAPPPTA 


TTAAGAAGTT 


CGGGGATGAA 


AATACCTCCA 


TCATGAGGGG 


148201 


ATGTGCTGCT 


■GCGGAGAGAT 


TTTATGAAGT 


CTTGAATCAC 


CCCGATCTTC 


148251 


ATAGTCAAAA 


AGAAAGAGAA 


ATCGAGTTCC 


TTGGACTTTC 


TAATACAATC 


148301 


ACATTCGAGA 


ATGTTTCCTT 


CGGCTATCAG 


GAAGATAAGC 


ACATCCTCAA 


148351 


AAATCTAAGC 


TTTACCTTAC 


ATAAAGGCGA 


AGCTCTAGGC 


ATTGTAGGAC 


148401 


CTACAGGATC 


TGGAAAAACA 


ACACTTGTTA 


AATTACTTCC 


TAGGCTCTAC 
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14 84 51 GAAGTCTCCC AAGGAAAGAT TCTTATCGAC TCTCTTCCTA TTACGGAATA 

148501 TAACAAAGGG TCCTTAAGGA ATCACATCGC CTGTGTATTA CAGAATCCTT 

14 8551 TCTTATTCTA TGATACTGTA TGGAATAACC TTACCTGTGG TAAGGATATG 

14 8601 GAGGAGGAGG CTGTTTTAGA AGCTCTAAAA CGTGCCTACG CTGATGAGTT 

148651 TATTTTAAAG CTCCCTAAAG GAGTCCATAG CGTGCTCGAA GAATCTGGGA 

14 8701 AGAATCTCTC AGGAGGACAG CAGCAACGTT TGGCAATAGC ACGTGCTCTG 

148751 TTGAAAAACG CCTCCATCTT AATTTTAGAT GAGGCAACGT CAGCTCTAGA 

148801 TGCCATTAGT GAAAATTACA TTAAGAATAT CATTGGAGAG CTTAAAGGAC 

14 8851 AGTGCACACA AATCATTATT GCGCACAAGC TGACCACTCT TGAACATGTA 

148901 GATCGCGTGC TCTACATAGA AAATGGTCAA AAAATTGCCG AAGGCACAAA 

148951 AGAAGAACTC TTACAGACGT GTCCTGAATT TTTAAAAATG TGGGAGCTCT 

149001 CAGGGACTAA AGAATATAAC AGGGTCTTTG TTCCTGATCA CAAATTAGTC 

149051 GCAAATCCTA CGGACATGGC AATAACAACT TAGGTGGGAT CGCTCTCTCC 

149101 ATGAGCTCAG GCAACAACTC TACAAGTGTC TGAGTTAGCT TTTGTGATAC 

149151 CTCCTCCAAT CTGCTGAAGG GACAGTCTCC TGGAACAGTA TAATCAGAAG 

149201 TGATCTTGAG AAAAGAACAG GGGATGTGAT GTTCTGCTGC TTGTGAGGCT 

149251 ATAGCATAGC CTTCCATATC TAGAAGTTTA AACGTCTTAT GAAACCCATA 

149301 ATGGTACAAT ACTGGAGAGG TAACCAGAGA GCTTTTAGGT AGAGAATCCG 

1493 51 GTAGAGCGTC AAAGATATAA GGGGGATCTT CAGAGAGAAC AGGAGGTGTA 
14 9401 TCCGTAGTGA GGTTTGCAAT TTTCTCAATA GTGTAACATT GACCTAAAGG 

1494 51 AATCTCGGGA GAACATGCCC CCACAAAACC TGGATTGATC CACAGATCGT 
14 9501 AATCTGTATA TGCTTGGCAA TAGCTTTGAA GAGCATTTAA AACGGCTGTA 
149551 CTTCCCCAAA CATGGACAAT ATAGAGATQT AGATGGTAGT CAGTACAACG 
149601 ATAACTATAG AGATGCTCGT TGATCTGTGT AAAATCAAGT TGTTCAATTA 
149651 GAGGAGAAAT TTCTCTATAG TCTGCAACAA TGCAAAGGAT TTTTTTAGGT 
149701 GTATTGACAG CATTCATTGG CCTTCCAGAG CATATGTAAA GCTTTTTTCC 
1497 51 CAGTTTTAGA TAGTTGAAAG GTTTCTTTGT TGATATAGGT TCCTATGAAT 
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149801 CTATGAATCA CGGTCACGTT 

14 9851 TCCCGCAGTT ATAGGATCTT 

149901 CTGCTGTTAG AGCATCCACT 

14 9951 AAACATCCTA AAGGAAGGGG 

150001 AAAGTCTGCC CGCAATGTCA 

150051 GAATCAGAGC GCCTCCATCG 

150101 TTGTCATAAG GCATGGGAAT 

150151 GAGAGCATGA GCGGTTGTCA 

150201 AACATTCAGG ATCTAAGGAG 

150251 GTATTTCCTA CGTCCATAAG 

150301 CGCTGCTGAC ATTTTCATTA 

150351 TCAAAGTTTC AATATCCGCA 

150401 TGGGGGTCTT TTAAGAAAGA 

150451 AGAAAAGGCA GCAGTCAGTA 

150501 GGCTTTGATG TCTTTATTTT 

150551 GCCATTCGAA GTAAGACTTA 

150601 TTGCCAAAAG GC ATTTTGAA 

150651 GTCGAGGACT TGCTGGGGCG 

150701 TGTGCAATAT CACTACGTCA 

150751 AAACCGTAAA CTTGTCTTAG 

150801 GCGATATTTT TGTGCCCATT 

150851 GCTCTAAGGA ATGTCTGCGA 

150901 CCGTCATTAT TATGAGCCAC 

150951 AAATCCCTCG TAGGCTTCAG 

151001 TAGTGATTCC ATGAATTTTG 

151051 TTCACATAAG TAAGAAAGGA 

151101 TTCTATAATG CGATCTCTTT 



PCT/US99/26923 

TTTATTTTTA GAGTATTCTA CTGCTTTTGC 
TCAGGGAGCA AATTAAAGAC TTTCTTAATG 
GTAGCCATAG GAACATATTT CGCAATGGCT 
AAAGATGGTC TTACGGCGCC ATAGCTCTCC 
ATTGGAGATC GTAGCTGAAG CGCTCTTCAT 
ACTTTCCCTT GCAGTATCGC GGATAGAATT 
GAGTTTTGCC TTGGGATAGT AAAGTTTACA 
TCTCTCCAGG AGTTGCCAAG GTATCTAGAG 
AGGACGATAG GACCGCTGTT GTATCCTAAG 
ATTATAATAA TCAGAAACTA GAGGGAAGAG 
GGGAGAGCCG TCGCTGCAGA GCTAGGGTAT 
ATTGTTACCT GGTTAAGAAG AGGCCTGAAT 
ACGAAAAAGG AAAATATCAT TCGGGCAAGG 
TCATGTCGGT TGATGTAATA GAGCTATAGC 
CAGGCTTATC CAAAGCTCCT TGGTTTTCCA 
GGAATATCCA CAAGAGGCTG CCCTTTGTAT 
GACTTTCGGG TGATAGCTCT GTTGCAGCAA 
GTAAATCACC GATTAAAGAA GTAAATACCT 
TCTAGAGCTC GGTGTGCTTG ATTTTCAGCA 
GTATTGTAAA TTATGTTTTG GTAGATCGGG 
TTAGAGAGTC TATTGTACGG TTTGTCAGAG 
CATTCCTTAC CGAGTAGGGG GAAATCAAAA 
TAAGATGCTG TCCTCTCCGC AAAATTTCCT 
GAAATTTGGG AGCAGAAAGT ACCGCATCCG 
GATGCCTCAT CAGGAATGGG AATTTCCGGA 
CTCATCTGTG ACACTATTGT AGGCAGCAAT 
CTATTTGTGT TCCTGTGGTC TCCGTATCAT 
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151151 AGAAAATAAG AACATCCATA GTTTGACTAC TCATTACGTT TTTCTTGTTG 
151201 CTCTTGAAGA GCCTGACGTC TTAGTTCATC CAAATTCATA TTCCCAGAAG 
151251 AGATCAACCC AATAGCATGA GAAAAACTAT CACAGACTAG CTTTATTGTA 
151301 TCGATATATA TCCGTAATAG TGTGTCATGA ATTTCTCCGT TTAGGCAGGG 
151351 CAACACAAGC CGATAAAATA TCAATCCCTG TTCTTCATCC ATGCCAAAGC 
151401 CGGGAATATC AATGTCCCTA TTTAAGAGAT GGAGTAAACG AGCTGTTGAT 
151451 GCCTTATGAG ATTCATGCAA TTGGTAGGGA AGGTAACAAA TCAACTGCAG 
151501 TATTTCTCCC TCACTGCGGA TTACAAAAAA TAAAGGGAGT TCATTGCCAT 
151551 TAGCTTGAAT GTTAATGTAA GTAAGACCGC TTTCTCTTTC TAAGAAAGGT 
151601 TCTTCATCCG AACTTTTAAG AAATTTTGTG AGATTATTTT GATTTAATGT 
151651 CCATGTCGTC ATTTAGGAAA TACTCCAAGT TGTTCCTAGA GCCTGCATCA 
151701 TTGCTGGCTG ATAATACTAG ATCTAATCTT GATTCGTCTG TTGTTTTTTT 
151751 TGTATCTTTT GTATTGCTTT ACTGAGGAAA TCAGAGGTTG CAGAGGAAGA 
151801 AGACTGCTTA TTATTTTCTT GAAGTAGATG ATTGATATCA GTGTCGGGAG 
151851 GAAACTCTGT CAGTAGCGTA ATTTTTTCTG GTGTTTCTGC AATCACAAGA 
151901 GTTTTATTCA CAACTCGAAT GAGGTAAATA GAAGTTTTCG GCGTTAGGGA 
151951 ACGTCGTTCT AGGATTTTGA TTTGAGACGA GCCTCCAAAA CCGTGACTTC 
152001 TTGATCTCAC AAACTTTTTA AACGCCCAAA CTCCAAAGCC AAAAATTGTT 
152051 AAAAGTAGAA TCAAAGATCC TAGCATTTTA AACATTTCTA ATTTCATGCT 
152101 TCCTGGGAAC ATTTCATGTA CAGAAATGGG CTCTTGGATC GTTTCTGCAA 
152151 GAGCAAGCTC ATCAGAAAGC TTAAAAACTA AAGAAAAAAG ATTAAAAAAC 
152201 ATGTGTAAGA CCCGCGATCA TCTCTATAAA ATTATAGTGG TAGCCCGATT 

1522 51 TGTATCCAAC TACATACAAG TAATAATGAA GTATAGTTTT AGTCGATGCT 
152301 ATATAAATTA TAGTACAATG ATTTCCAAGT ACAGATTAAA CCGTAATCAT 

1523 51 GTATATTCCT GCAAGTACCG TCTAGAGAGC TCCCCTAGAT GATTTGGGTA 

1524 01 ATTCACAGAC TCCTTTATAC CCTTCTAGGG TGTGCTCGTT CCACAGAGCC 
152451 CAAGCTCTTG TCTTTCATAG ACAAAACGAC AGCAGTCTGT CGGTGGATGC 
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152501 
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152551 
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152901 
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152951 
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153201 
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A PTTT 


ATPATGGAAA 


153251 


GAAACCCTTT 


GTCATGGCTT 
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i IAAIoA 1 


PTPPATPTTA 


153301 


TTGAGAGAGG 


AGATTTTAAA 


ATTGTGATGA 


1 1 V-t-^J\_A^~V■. 


TPAAPAC1ATG 


153351 


CACGTTCATG 


CGGACTTTCT 


AGCTCCCCCA 


t>L AVjA 1 AAOA 


ATPPPATTPT 


153401 


TTCAGCTTGG 


GAAGCTGGTG 


TCCGCTATTA 


rp/ - * » /^>/~< * /"»PTT 


ATP APTPTTT 


153451 


AGGGAACATC 


TCAGGACCAA 


TTCCCATCAC 


% mTr» P PTPP P 


TP ATPTAPPT 


153501 


ATAAGGTCTC 


ACCAGTAATT 


GCTGAAGCTA 




TA APAAAflPT 
1 AAUAnnOV. a 


153551 


GCAACGGCAC 


CCACCTGCTC 


GGCATTCATA 


bLC J. 


TAPPPPPCCA 


153601 


CTCTTGGTAA 


TAGTCTACCA 


TTCTTTCAAT 


AAAACCAATT 


GCTTTTCCAG 


153651 


CTCGGCTTGC 


TAAAGGTCCT 


GCAGAGATGG 


TATTGACACG 


TATGCCCCAA 


153701 


CGGCGTCCCG 


CTTCCCAAGC 


AAGAGTTTTG 


GTGTCACTTT 


CCAAAGCTGC 


153751 


TTTTGCCGAA 


CTCATGCCCC 


CTCCGTATCC 


AGGAACAGCG 


l CGCATAGAAG 


153801 


CCAAATAGGT 


GAG CG AT ATT 


1 GTCGATCCAC 


1 CACGGTTCAT 


' GATACTTCCA 



WOQW79^[file:/A\dcwa^^ - Page 242 of 330 

WO 00/27994 PCTAJS99/26923 

153851 AAGTGAGAGA GAAGGCTAAC AAAAGAATAA CTAGAGGCAC TGAGAGCCGC 

153901 TAAGTAACCT TTTCTTGATG TTTCTAATAG AGACTTAGAA ATTTCAGGAC 

153951 TATTTGCCAG CGAGTGGACA AGAATGTCAA TATGACCAAA ATCTTTTTTT 

154 001 ACCTGTTCTG CGACTTCTGA TATCGTGAAT CCCGTAATGC CCTTGTAACG 

154051 TTTATTTTCA GCAATATCTT CAGGAACATC TTCAGGGCTA TCAAAACTTG 

154101 CGTCCATGGG ATAGATCTTA GCAATCTCTA AGAGAGTGCC ATTCGATAAT 

154151 TTTCTAGATT CATTGAATTT TCCTAATTCC CAAGACTGAG AGAAAATTTT 

154201 GTAAATCGGT ACCCATGTTC CTACAATAAT CGTAGCTCCT GCTTCTGCAA 

154251 GAAGTTTAGC AATACCCCAG CCATATCCTT GGTCATCACC AATGCCCGCA 

154301 ACAAATGCTA CCTTTCCTGT TAGATCAATC TTTAGCATGA ATCCGCCTTA 

154351 TACTTTTGAA GCTTATTGGA AGGAGAGTAA CAAATCTTTC GATTATTAAG 

154401 AAAACCTTTT GGTGCCTCAA CAGGGGAGAT CCTGCCTCCA ATGTAAATAG 

1544 51 AAACGTAAAT TCTTTAAATT TTTTTCTTTA CATATTTTAT AGAATATCCA 

154 501 AACTTCTCAC TCCCGCGTAC TGCTAAAAAA ATTTTCAAAA GAATTTACGA 

154 551 TCCGAACTTA TCGTAGTTTG GGTTTCACTG ATTACTTAGG AGGTTGTTTG 

154 601 ACGAATCCTT TAGGGAAATT CCCCTCACCA CAGAATCCAC AGGTTGTTAC 

154 651 GATAGCGCCT TCTTCCACAA CACCACAAGC AGTCTCATCT GCAGTTCAAG 

154701 GTTTTCTTCA AACTGGAGGA GCTGCCTCCT CTACAGCGAC AACTACTACC 

1547 51 GCATCCGGAG CCTCTGCATT AGGACTTTCA CCTGATCAAG TGCAAGCGTT 

154801 GCTTACTAAT TTATTAAATG TGGGACAACC ATCAGTGGGA CAACCATCAA 

154851 CTTCAGCAGG AACTTCGGGA GCCTCCTCTT CCAGTGCAAG TATGCAGCAA 

154901 CAGCTTTTGC AACTTATCTT AGACAAGACA ACAGGAAGTG GCGGATCGTC 

154951 CGTGAGTTCA GAGCAATTAC AGCAACTCCT TAGCTTGGTG AGCCAGATGA 

155001 CTACGTCTCA AGGAGGAAGT GGTGGAACTC AGGCAGGACA GGCCGCTTCG 

155051 GTACTGTTGA ATTTGTTATC GGCAACAGGA TCTGCAGCAG CAAATCCTTT 

155101 AGGGACAGCT GCATCGTTGG CACAGATCAT TTATGCAGCA GTAACAAGTC 

155151 CTGGAGCAAA GAAAACTAGC GAATTTTGTT ATAATTATTG TGGAGAGACC 
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155201 TGCCAAGGCA ACTGCGGTTG 

155251 CGGTTGTGGA GGATTTGGCC 

155301 GCGGGATAGG AGAGGGATCC 

155351 GAGGCTTTAG AACAGAAATA 

155401 TGAGCTTGGC ATTGATACCA 

155451 GATTTCCTCC AATCGCGGAG 

155501 GACTTTTGTG AGATCTTGAA 

155 551 GGCGAGTTGT GTGGATGGTT 

155601 TTGCCTCAGG GATTGCTAAG 

155651 GAAAGCAAAG TGATGGTTCT 

155701 TTGTAGCCCT TTTAACCTAG 

155751 AGGTCTTCTC CCTTAAAAAG 

155801 TCTGCATCCT GCATGAATCT 

155851 TGGAGGAAGC ACTCCGATTT 

155901 TTGCTTTGGT TTTATCTCAC 

155951 GATCCCCAAC GTGGCAACAT 

156001 GGCTCGGGGG ATGGGCATGC 

156051 AATCTTCCTC GTCACGCTAT 

156101 TCTCTTACGA AAACAAGTTT 

156151 AGATAAGTGC CTACTACAGA 

156201 TAAATTTACA ACGCCCCCCT 

156251 CCGCAACCTC AACCAGATCC 

156301 TGCTAGAGAG CGTTCCCCTG 

156351 TACCTTTAAG TGTGATTTCT 

156401 CGGGATTTAG AAGATGAAGA 

156451 CGACTCCAAT TACAGTTTTA 

156501 TTTCTTGAAG TAAGTGCCGA 
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TCCTACCTGT GGCTGTCCAG ACGGACAGTG 
GTTTTTTCTG TGGTGTATGG AAAAATTGTT 
CAAGAACCCG CAATCCCTTT ATAAGAACTC 
TGGCAAGGCT GTTCTTTTAA TTGCGTTAAG 
TGAGCTTATT ATCAGGACAT CGACTCGAGG 
GTCATGGCTG CATGTGACCG GTGTTCTATG 
GTCTCAAAGC ATGGATCTGT GGGCGGATGC 
TATTACAGGA TCCTTTTTGG AGTACAGCAA 
TCTTCTCTTC AGGAAACGGA ATTCGAGTGT 
TTCTTCATGG GGAGAGCAAG GAGCACAGGT 
AGAGGATATG TATGTCTTTC CCATCACTTA 
AACGGGTGCG AGAACATGGG AATCCAGTTG 
ATTAATGTCT ATTTTCTTTG TAGCTACCAA 
GGATCACCAA AGAAAATCTG ATGGCGTTAG 
TATCAATGTT ATTTTGTCCC AGCCACAGGA 
TTTAGGTAAT CCAGAAGTCA ATGCTATTTT 
GTGTCGATCT GGAAAGGAAG CGAGGGGGAG 
TTAGAATTAG CTGCACGATG TTTTGAGAAT 
GTTAAGCGAT GCTAACAATG TTCAAGAAAG 
TGTCAACTTC ATTGATGCAT ACGGCGGGAC 
GTACCCACAC CTTCTGGAGT CACGGCACAT 
TGTGGTTACG TCTCAACCTT CTTTATTAGG 
TGTCTTCTAG AGGGCGTTTT CCTGTAGTTT 
CCTAGGTCGC ACCCCGGAAG GGTAGAAAGG 
AGAGGAGGTT ATGTTTTGAA GCAGTGTAAA 
TGAATCTCTA ATTGTAAAGT TCTAGGGGTT 
GCACATTCTC TAGGATCTTC GGTTGATGAC 
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156551 GCACAAATTT TTAGGGGAAG ATTTGTGAAT GGGGAGATAA ATTCTAGGGA 

156601 GTGAGCATGG AGGAGAGGGC GGAAGATCTG GGGAGGCTGT TCTTTAGGTC 

156651 CGTAGTCGAC ATCTCCGACA ATAGGATGAC CCAGCAATCC CATTTGTAAG 

156701 CGGATTTGAT GGGTTCTCCC TGTGATGGGC CTGCGGCTCC AAAAATCACA 

156751 GCTCCACACC TCCGGTATAC GGGGGCCGTA TAAGATTTTA CGGTTCCAAA 

156801 TTTTTTTTTA GGATGACCAA AAACGAAAGC TATGTATTGT TTATGGATTT 

156851 TTCTTTGCTT GAACAATTTC ATGAGCTCAG TAGCCGCTTG TTTAGACTTT 

156901 CCCATGAGAA GACACCCAGA GGTGCCTTTG TCTAACCTAT GCACAGTAAA 

156951 AAACCGTGTC ATGTGTGCCA TTTGTTCAGT AGTAAGATGG GGAGGTTTTT 

157001 CGTAGATAAT GCTATAGTCA TCCTCCCAGA GGATGCTAGG TTGTTGTTTT 

157 051 GTTGAGGGGA TCAGAGATAG GGAAACACGG TCGCCAGGTT GTACCTTGTA 

157101 GGATTCAAAT CTTTCTATGA ACCCGTTCAC TCGACATCGA TGTTGGCGAA 

157151 TAGACGCCAA GATTTCTTGC TTGCTATGAT TAGGCAGTTG AGATCTAAGA 

1572 01 AAAGAAGATA ATCTTGAGAC TTGTGTGGCA AGCCAGGAAA AATTTTCCAT 

1572 51 AAAATATTGT AAAGCAGCCC TTTTATCATT TGATAATTGC ATAAAATTTT 
157301 AAGAGATTTT GTATGACAAA GATAGCTTTT TCTGAAAAGG CAAAGAATTT 

1573 51 TCCTGTAGAG GCATTAAAAA AATGGTTTGA AAAAAATAAA CGATCTCTTC 
157401 CTTGGAGAGA TAACCCGACT CCCTATAGTG TGTGGGTTTC CGAAGTTATG 
157451 CTACAGCAAA CGCGAGCTGA AGTTGTTATA GATTATTTTA ATCAGTGGAT 
157501 GGAGAGATTT CCTACCATAG AGTCTTTAGC TGCAGCAAAA GAAGAAGATG 
157551 TCATTAAGTT ATGGGAGGGA TTGGGTTATT ATTCTCGAGC GCGCCATCTT 
157601 TTAGAGGGAG CTCGCATGGT TATGGAGGAG TTTCATGGAA AGATCCCTGA 
157651 TGATGCCATT TCCTTAGCTC AAATTCGTGG AGTTGGTCCT TATACGGTTC 
157701 ATGCTATTCT AGCCTTTGCT TTTAAGAGGC GTGCTGCTGC TGTGGATGGC 
1577 51 AATGTCTTGC GTGTTCTTAG CCGGATATTT TTGATAGAAA CTTCTATAGA 
157801 CTTAGAATCA ACTCGTACTT GGGTTTCTAG GATTGCTCAA GCGCTTCTTC 
157851 CTCATAAGAG TCCCGAGGTT ATAGCTGAGG CTCTGATAGA GTTGGGAGCT 
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111 o aaaaaa 


P AP A AT AT AT 
\_ rioriri 1 ri X ri 1 


GCTATCAAAG 


AAGGGTAAGT 


TGGGGGCCTT 


1 C Q O A 1 


1 IflAoAoAAu 


P A APPTPPP A 


ATCGGGTCAG 


GACTGGAAGG 


TAGCAGCCCT 


158951 


AAGGAGAGI 1 


mm P* mmmm PO m 

i ili n i ui i 


A AAAGAATGT 
/\rWAriVJrvri i vj l 


TCTCCAACTT 


ACTCTTTTTA 


159001 


CTTTATTCCC 


AAAAATAGCA 


ATGAGGTGAG 


GTTAAACAAC 


CCGTGCAGTG 


159051 


CAATGGGAGA 


AAGAATGTGC 


CGATCTTTTT 


CATATAGAAA 


CCCTGCAGAT 


159101 


AAGGAAAAAA 


CAAAGAGCAC 


GGGGACAAAG 


ACCCAACTTC 


CTAAAGAGTG 


159151 


TTCAATGTGA 


ATGAAAGAGA 


AAATAATAGA 


AGAGCATAGT 


ACCGCAGCTA 


159201 


TGCGCGTCAT 


TTTGTTTTTC 


AAGAATGTCT 


GTAGAATTCC 


TCTAAAAAAT 
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159251 ACCTCTTCTC CAAATGGAGT GAGGACGCCT AAATTTAGAA 

159301 GTAGTGTCCT GTTATAGGCA GAGAGTTCTG AACTTCTTGA 

1593 51 GTGTGTGAAT CTCTTGCGTA GGAAGAACCA AAGTTAAAAA 

159401 ATAATCCCAA TCAGTTGTGT TACTGGGATG ATGATGATCC 

159451 GGCAGATCCT AGAGCACGCC ATGAAGTTTT AACCGGTCTT 

159501 AAAGTATAGC ACGTGTGATA TCCTTGGGGA GAAAAAGCAG 

159551 AATGCAAAGG CAAGGCTAAT TCCTGTCATG GTGGAAAGTA 

159601 TTGTGAGCTC ACGCTAAGAG CTACAAGGGA AGAAAAAACA 

159651 CACCAAATAA AACTTGGCGG AGCTTTAAAG GTGTTTTCCC 

159701 GGCCAGATAA AGAAGTTTTT GGAAGCTAGA GCAGCGACGC 

159751 GAGAAGAATA AACTTGGACA TTTCCTTAGA CTACGAGTAG 

159801 CATAGCCCTC AACTCTGGCA ACAACTTCGC GGAAAAGACG 

159851 AAGCCCATGG GCGTTGAATC AAAATGTTTT GAGTTCCAGC 

159901 ATAATCATTG ACTAGGGTAG TTAAAGGCTG GCTGCATTCG 

1599 51 GATAAATGAG AGCTATTTTA TGATGACGGA TATCAAAAAC 

160001 ACAGACGCTG TTACAGAATC GACACCTGCT TCTTTCCCTG 

160051 TAACAGTTCT GTAGCAACAA TGAATTCTGC AGGAAGAAAT 

160101 TTGTTTCGGG TAGACGATTC GCAATCGGAG CATAGAACTG 

160151 TGAGGTGAAG CATTGTGCTT GATCAGGAAG ACCTTTTCCG 

160201 CCTTTTGCTG ATCTCTTCAG TAAATTCTCC TTGGAGGTTC 

160251 GTTCAAGACT CTTTCCTGGG CGATGAAATA CAGGAAGCAT 

160301 CCTTTAGTTT TGCTCCCTGA AGTGTATAGC TTAGGATGAT 

160351 AGAGCCTAAG TGAGTGCAGC TGGATAGGGT TGGGGATAGA 

160401 ATGCCAATAA TACCAACATT TTTCGCATAG TCACTGTCCT 

160451 ATTTTGCAAA AGATTCTAGC CCTGGGAAAG TTTTTACTTT 

160501 ACTTTCGCAA TTGAGAGATT TTCCATTTAA AACTCTCATT 

160551 AAAGAAAAAA ATAAAAACAA GCAAAGAGAC CGTCTCAGTT 
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TCATGCTAAT 
GTGACTTCTT 
TTTACTCATC 
ACATTCTGAT 
TCTCCAGAGA 
GTAGAACAGA 
ATTCCGCAGT 
AGAAGAGCAC 
AGAGGGTGCT 
CAAGGGACAA 
TTAGCACAAA 
GCTATGCATT 
CATAGCGATG 
ATAATCTCTT 
GCGAACACGT 
TCTTTTGTTC 
TGCTCAATAA 
AGAGACTGTC 
AAGCATAAAA 
CAAGGTAAAG 
CGCAATCACA 
AACTTCCTGA 
AGTCCTAAAG 
TAAATTGCTT 
TAAGATCAAT 
AGCTTATATC 
TTAGTTTAGA 
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160601 


AACTCAAGGT 


TGAGAAAGGG 


ATTCTGACCA 


AAGTTGTGAG 


GGAACi I I Go 


160651 


TAACTTTTTC 


TTTAGGAATC 


AATGTGCACC 


CTGGGAGGAA 


i ACAAC I Go A 


160701 


GCTGAGATCA 


CAGAAAAAAT 


AAACCACTTC 


AT GAG TAG C 1 


G iGCAAAAMG 


160751 


AATACTACTA 


TTTTCTATTT 


GCATAGGCAG 


TTcTTCG AT I 


1 A A AGGAA 1 1 


160801 


TTTACTTTAT 


CATCATAAAA 


ACTATGATGA 


AAAGGT rc I 1 


AG 1 GAAG I 1 G 


160851 


TAAGGAACAC 


CATGAGTTTG 


TGATTAAAAC 


TCATGGTGCG 


T A PTTP A Tiff* 

TAG I iGAACC 


160901 


CTTATTGAGG 


AGGGACGCAA 


CCACTAAGAG 


CTAACAATAC 


I AAAG AAGA 1 


160951 


AATAGTAAAG 


AGAATGCTTT 


TTTCATTATT 


CATCCTTAGG 


I I AA J 1 GAG G 


161001 


TTTCCAGCCT 


AGCTCTAGGG 


CGATTATTTA 


TCAAATTTTT 


CTT1GI AAA 1 


161051 


AATGATCATG 


CGACCATTAA 


TTTAGCGATA 


AATTATGATT 


TgGTgAGGAA 


161101 


AATTCAATTC 


TTTATAATAA 


TGATATGAAA 


TTAGAGAATG 


TCTATAGGGG 


161151 


CGGACTCTAT 


TTGTGATCCA 


GGATCTCTTT 


AGGAGCACTT 


TGTGGATTTT 


161201 


GATTATTTTG 


GTCTGAGTGA 


TATTGGTAGG 


GTGCGCGCTA 


G AAATG AAG A 


161251 


TTTTTGGCAG 


GTAAACCTCA 


TGTCTCAAGT 


GGTTGCTATT 


/-* r*T*r* 7i /■•opto 
GCTGAgGG ig 


161301 


TTGGGGGGCG 


TCTTGGTGGA 


GACATTGCTT 


CTCAAGAGGC 


AGTG AC I AGG 


161351 


CTTATGGAGC 


TGATTGATGA 


GCAACAGTCA 


AAATTGATGG 


pom* tpppp a 

GGTA I GGGGA 


161401 


TGACCAGTAT 


AAGGAGACTT 


TAAAAAAGAT 


CCTTTTAGAG 


PTP A iTPPTP 
G i G AA 1 GG i G 


161451 


TGGTCTATGA 


ACACGGCCAA 


ATGGAAGAGC 


ATCTCCAGGG 


TAiGGGAAGG 


161501 


ACTCTTAGCT 


TCATCCAATT 


CCGGAAGGAT 


AGGGCATGGC 


rr» > rprn TO * TP T 
1 A i 1 1 CA 1G 1 


161551 


GGGAGATAGT 


CGAATTTATC 


GTATTCGTGA 


GGGAGAACTG 


ggcggcc 1 I a 


161601 


CCGAAGACCA 


TTCTTTAGAA 


AATCAATTAA 


AAAATCGTTA 


rr\r>r*r> OTTOOT 

TGGGGI 1 GC J 


161651 


AAACAATCAG 


ATAAGGTGTA 


TTCTTATCGC 


CATATTCTGA 


OT TV R TOTTTT 

CTAATG rl 1 1 


161701 


GGGAAGTCGT 


CCCTATGTCA 


TGCCTGACAT 


TCGGAATCTT 


CCTTGTGAAA 


161751 


AGGAAGATTT 


GTACTGCCTC 


TGTTCGGATG 


GATTGACAAA 


CATGGTTCCA 


161801 


GATATCGATA 


TTCGTGATAT 


CTTGAACCAG 


CCCGCCACCC 


TAGAAGAACG 


161851 


GGGGAATGCA 


TTAATTTCTC 


TAGCCAATAC 


TCGTGGAGGC 


GATGACAACG 


161901 


CTACTGTCGT 


ATTAGTCCGA 


ATACAATAGT 


TCCTTTGCTA 


AGGATAGTAT 
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161951 


TCCATGATCT 


ATTTGGATAA 


CAATGCGATG 


ACACCCCGAG 


AVjAVjwj w\U 1 


162001 


TTTGGAATTT 


CTCCAAAAAA 


CCTTCCTTAT 


AGAAGGGACG 


1 ACGGGAA i G 


162051 


CTTCGAGCGT 


CCATCAATTA 


GGTAAAAAAT 


CTCGTCAACT 


GGTTCTAGAA 


162101 


GCTTCACACT 


GGATGCAAAA 


GGTCCTTTCG 


TTTCAGGGCC 


GTGTCCTCTA 


162151 


TACCTCAGGG 


GCTACTGAGA 


GTTTAAATTT 


AGCAATAGCA 


AGCCTCCCTA 


162201 


AAGACAGTCA 


TGTTATCACC 


TCAGGTAGCG 


AACACCCCGC 


CAU CT. I AGAG 


162251 


CCTTTAAAAC 


ATTCCTCGCT 


TTCCGTTTCT 


TATTTAAATC 


L G G AA G AA GG 


162301 


GAGATGTGTT 


CTTACTATAG 


AGCAGATTGA 


AAGAGCTGTG 


TV OTP/^!' ?A TA JA R 

ACTCC I AAAA 


162351 


CTTCAGCAAT CATCTTAGGT TGGGTCAATA GTGAGACTGG 


TGCCAAAGGT 


162401 


GATATAGCTG 


CTATAGCCCA 


CTTCGCGCAA 


GAACGACAAT 


TGC AATTTA I 


162451 


TGTGGATGCG 


ACTGCAAATG 


TAGGTAAGGA 


GAGGATAGTT 


CTTCCCTCTG 


162501 


GTGTCACTAT 


GGCAGCATTC 


AGTGGACATA 


AATTTCATGC 


AC 1 C 1 C 1 GG A 


162551 


ATC GG AGCTC 


TTCTGGTCTC 


TCCAGGAGTC 


AAACTACATC 


CTCAGC 1 G 1 G 


162601 


GGGAGGAGGT 


CAGCAAGGAG 


GGCTGCGCGC 


AGGCACAGAA 


AATCTTTGGG 


162651 


GAATCGCCTC 


TCTGCTTTAT 


ATTTTCAAAT 


ACCTAGATCT 


TCATCAAGAG 


162701 


CGTATCTCTC 


AGGAAATTCT 


TACCCATAGA 


AATGGTTTTG 


AAAAGGG AA 1 


162751 


CAAAGCACGC 


ATTCCTGATG 


TCCATATTCA 


TTGTGCGGAT 


CAACCACGGG 


162801 


CAAACAACGT 


CTCAGCAATT 


GCTTTCCCTC 


CGTTGGAAGG 


TG AGG I A 1 I G 


162851 


CAAATCGCCT 


TAGATATAGA 


AGGAGTGGCT 


TGTGGTTATG 


GA 1 GGGL-A I o 


162901 


CTCTTCAGGT 


GCTACCGCAC 


CCTTTAAATC 


TCTTGTCAGC 


A 1 GGG ibi lo 


162951 


ATGAAGAGTT 


GACCCTGGCA 


ACACTCAGGT 


TTTCTTTTAG 


CCATCTTC I G 


163001 


TTGCAAGAAG 


ATGTTGAAAG 


AGCCGTTGGA 


ATTATAGAAA 


AAGTCG1 AGA 


163051 


ACGTTTGAAA 


AATTCCTAAG 


TCTTAAAAGA 


GAACATGTTT 


CTAAGCTGAA 


163101 


AGAACACTCC 


TGACTCTTAT 


TGCAGAATCT 


ATGAGAGTAA 


GTTTTTAATC 


163151 


GATACGGTTT 


TTATCCCAGA 


TAAAGACATC 


TCTTTAACTT 


CTAAAAGCAA 


163201 


GTTGTTGATA 


ATTACAGAAG 


TTCCTACTTC 


TGCAGGACTG 


TCTAGCAGTT 


163251 


GCAATACCAA 


TTGGGCTAGG 


GTTTCTACAG 


GATATTGCGG 


AAATTGAATA 
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163301 TCGAGTTCTT TTTGCAGATC TTTTATGCGA GAGTTGCCAG GAAACGTTCT 

163351 TTCAATAACA GAGATGGTCT TGGGTTTTAA ATGAGCAATG TTTGTAGTGT 

163401 TGAATAAGAT TTTGAAAATT GCATTTAAAC TAAGAATACC TATAGGTTCA 

1634 51 CCAGAAGCAT TGAGGACAAC AGCAACACTC GAACGGTTGT CTCGAAACTC 

163501 TTTGAGGATA CGAATAAGTT TTGATTTTGC AGTGATAAAC CAAGGCGAGT 

163551 GTAGATTATT GATTAGGGGT TCATCAAGAG CTTTATTGAC AAAGTCTTTA 

163601 GGATGGGCAA TCCCAATAAC GTTTTTTCGG GCCTTGTGAT AGACAGGAAT 

163651 AAAGTTGATA TCTGTATTTT TTATAGTCCG GCAAAAATCT TTAACATTTG 

163701 CAGAAGAAGG AAGCATGGTA ACCTGTTCTA AAGGTTGGCA TACCTGATCT 

163751 GCACAAGTCG CACTTAAAGA GAAAATATTT GTAGCAATTG TATTGAAATC 

163801 TTGTTCTTCA TGGTGAGTCT CTAAAGCTTT TTGGAACTCG TCTCTACTTA 

163851 ATGTAGAGTT CAATTTTTCT TTCCTAATAT TTAGAAGATA GTAAAGACCC 

163901 TCAGTGAGAC TTCCTATGAG CTGAATCAGA GGATAGAAAA TATAGTGGGA 

163951 ATAATAGAGA ATCGGTGCTC CCCAAAGTGC TAATTTTTCA GGAATCTTCC 

164001 GTGATATTGT TAGAGGTAGA AGTTCTGCAA AAATCACAAC TATAAAAATT 

164051 TGAGTGAAAG GAGCGTAATC TGGAGTGATT CCTAAAGCTC GATAGCAATT 

164101 TCTTGAGGAC TCAGACCCGA CTTGTAGAGC GATATTCACT CCTAACATCA 

164151 CCGTTCCAAA TAAACGATAG GGGCGGCGAA TCAGGAAATT AATGTAGCGA 

164201 GCTTTCTTAT GATCTTTAGT CAGATAGTAT TGCAATCGTA CACGGTTAAA 

164251 TGACACGCAG GCCATTTCCA TCATCGAATA GAATCCTTGT AAGACAATAC 

164301 AGATAATGTT GACTCCTATC CAAAAGAGAG CAGAATTAGT CATACAATTT 

1643 51 CCTTATATAC ACACGGCGAA TGCGATTCGG AGCAGCGTCT AATACCTGGA 
164401 AAAGCAAGTT -ATTCCAAGAG AGTTTCATTC CTGTTGTCGG AATCGTTCCG 

1644 51 ATTTGCTCTA TTAACCAGCC TCCTATAGTC GCAATATTAT TGTTCGTCGG 
164501 TAGGTTGATA TCGAAGATCT CACTAAACTC ACGGAGTTCT AAAGTTCCTG 
164551 AGGCAATAAT AACATCAGCT CCTGAGGTGG TATAGAGTAT TTTATTATCT 
164601 CTCTGGTCTA CAATTTCTCC AGGAACAATT TCAAAGAGGT CTTCTTGAGT 
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164 651 GATCAATCCT TCAATAGATC 

164701 CGTCTTCAGC TGCCATCTGA 

164751 GGCATATAAT ACGGTTTTTT 

164801 CAGTGGCTTG TCATGTAAAA 

164 851 GGTTTTGGAG GTTATCGTTA 

164901 GAAAATAAAA GATAGAGGTT 

164951 AATATCCTGG CGTGGCTGCA 

165001 AAAGATAACC ATAGAGTAAA 

1650 51 TCCTTACAAC TTTGCAATAC 

165101 AATCTGTTGC TTCGATAAAA 

165151 TACCCCAGTG GAGTAGGGGT 

165201 GGGGCTACGG AACTAGCAAT 

165251 AGGGAGAATC TCACCTAAGA 

165301 TAAACCACCA CGAAGCTGCA 

1653 51 ATAGCAATAT TCAGTCCGAT 

165401 GTGGGGATGT AGAAGAAGGG 

165451 AGCGCTTATA GTGCGAGATC 

165501 TGAGATAACG AAATGAATCC 

165551 CATTAACATG GTAGGAATCA 

165601 GTTGCTTTGG GGAGACACAG 

165651 AGACGCTGGA TTAAAGCAGC 

165701 CAGATGTAAT TGGAGTTGAC 

165751 GGCATAAAGA AGAGACCTTG 

165801 TCCTGGATAG CACGATCATA 

165851 CACCCACCAG GATTTGATCA 

165901 TTTTTTTTCT TACTGAAACT 

165951 CTTTTGATAC TTTCCTTTTC 



PCT/US99/26923 

CGTATTCATC AATGATCATC CCTAGGGTTT 
CATAAAGCCA TTTTTGCAGA GATGGTTTCT 
CAGCAAGGGG AGGAGATCAT CCGAAGATTG 
GAAGAGAGCG CGCTGTGCAA ATGCCCAGAA 
CATATAGGAA CTCGTGAGCA ATGCTGTTTA 
CTCTAAAGGG GTTTGGATAT CATAAAATAA 
TACGCTCTTT AACACTACAA TCACTAAGAG 
CGGCTTTCTT CTTGATTGAC TACGCCGAAA 
TTCCTTCAGC TCTTGGGGTT GGATGATATC 
TCCATTGGAC CACATAATTA ATTCCTACGA 
TTGAAGATTT TAGTAACACA AAGAATAAGA 
CTGTGTATTA AAAGGAAGAG CTACTGCTTT 
TCAAAGTAAT TGCTAAAGGA AGACCTACAG 
TCTCCAAATA GAATGGCAAA ACAGTTTTGA 
ATCACAAAAA ATTAAGGTGA TGAGCAGGTG 
TAGCTACTCG CTGCTGTTTC TTAGATTTAG 
AAACTCGTAG GCAAAGAAAA CAAAGCAATT 
CGAGCATAAA GTAAAACAGA TAATGAAGAA 
TGGTCTCTTT TCAGTCCTTA TTTTCTGATT 
AGTTTCTTAT AGCCTTTAGG AATGAGACCC 
TTCTATAGCA GCGGAGTCTT GCCAGTGTTG 
GCTGCTTTTC TTGAGCAGAA AGAATGTCTT 
CTTTGTAAGC GTAGCTCTTC TGTACGTAAC 
AACAAAGCCT CCAATTAAGA TGCTAAAGAT 
TCACTTCTTC TAGTAATCTA AAACCCCAGT 
TTAGACACAA GGTACGGTGA TGCCTTGTTG 
TGTCTGCATA AGAAACCTCG CAGGTCGTAC 
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166001 TAGACTCAAA GAATAAGACC TGGGCAATCC CTTCATTAGC GTAAATTTTC 

166051 GCTGGCAATG GCGTAGTGTT AGAAATTTCT ATAGTCACAT GCCCTTCCCA 

166101 TTCAGGCTCA AAAGGTGTGA CATTTACGAT AATTCCACAG CGTGCATATG 

166151 TAGACTTTCC TATACACATT GTTAAGACAT TTCTAGGAAT TCGGAAATAC 

166201 TCAACGCTAC GAGCTAGAGC AAAAGAATTT GGAGGAACAA TACAGACGTC 

166251 ATCAGTAATA GAGATGAAGA TATCCTCAGT AAAGCATTTT GGATCAACAA 

166301 CAGAGTTATA GACATTGGTG AACACTTTGA ATTCTCGAGA TAGGCGGAGG 

166351 TCGTAACCAT AACTCGATAG GCCGTAACTT ATAAGTTTTT CGCCTGTCTC 

166401 CTCATTTACG TTCACTTGGC CATTAACAAA GGGATGGATC ATATCGGCAT 

166451 TTAGGGCCAT CTCTCGTATC CACTTATCTT CTTTTATGCT CATTTAGAAA 

166501 CCTTAACAGT TTGAAATTGC TTTCTTAATG ATATTCTGTT TTTCAATTTA 

166551 CTGGTTTTTG GGGGGAACTT TTCTAAGTAT AAGATAGACT TTGATTATCT 

166601 CTTGAAAAGA CCAGTTGTAT AAACAAGAAA AGCCTATCCC AAAGGCTACA 

166651 ATTTTATCAC GAAACCTAGA GGTAATGTTA GATAATCCTA AGGGAAAAAG 

166701 GCAAACCTTA TTTTTAGGGA GAACTTCAGG TAGGTCTGCT CTTTACTCTT 

166751 ATAGTAGAAG AATCTTGGTT CTCTTGAATG CATTCATGCG AGGACCTTGA 

166801 TAAGAACTTC TTGGATTCAT AAAAAGATTA ACATCTCCTT ATTGATAAGC 

166851 TAGAGAATTT TTACTACCAA CTTCTCAGTG GAAAATGTTT TTAAAAATAG 

166901 TTCGCCATCT TTAATTTATC TGTTTTAAGA CAAAAGAAAT CTAGATCACC 

166951 ACAGGAAGTT TAAATCATAA AATGAAAATG ATGGAGAGGT TCTAGTGCTC 

167001 GTACTTTGGC CCTGCTCTCC TTGATAGAAA GAAGAGGTCC ATAGTGTACT 

167051 TCTATATAGT ATCTCGTGTA CTATGCCGAG TATAACCGAT CGGCGTTATC 

167101 GATGAGAGTT TCAAAAAAAT ATAAAATCCA CCTAAAGAAA AAGCGATAGA 

167151 GAAGGTTCGT ACATGACGCA TCAAGTAGCT GTCTTGCATC AGGATAAAAA 

1672 01 ATTTGATGTT TCGTTAAGAC CTAAAGGGTT AGAAGAATTT TATGGACAGC 

167251 ATCATTTAAA AGAACGCCTA GATCTATTTC TTTGCGCAGC ATTGCAACGA 

167301 GGAGAAGTTC CAGGACATTG CTTGTTTTTT GGACCCCCAG GCTTAGGGAA 
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167351 AACCTCACTT GCTCACATCG TTGCCTACAC CGTGGGGAAA GGGCTGGTCT 

1674 01 TGGCATCAGG GCCTCAGTTA ATCAAACCCT CGGACCTGTT AGGACTTTTA 

167451 ACTAGTTTGC AAGAAGGGGA CGTGTTTTTC ATCGATGAGA TCCATCGTAT 

167501 GGGGAAAGTT GCTGAGGAAT ACCTGTATTC TGCAATGGAA GATTTCAAAG 

167551 TCGATATTAC TATAGATTCA GGACCCGGAG CTCGCTCGGT CCGTGTCGAT 

167601 CTTGCTCCTT TCACTTTAGT GGGGGCAACG ACTCGATCAG GAATGCTAAG 

167651 CGAACCTTTA AGAGCACGCT TTGCTTTTAG TGCGAGACTT TCCTATTACT 

167701 CGGATCAAGA TCTAAAAGAG ATTTTAGTCC GCTCCTCACA TTTACTCGGA 

1677 51 ATCGAAGCTG ACAGCTCCGC ATTACTAGAA ATTGCTAAGA GATCCCGAGG 

167801 GACGCCACGA CTGGCAAATC ATCTTCTACG TTGGGTCAGA GATTTTGCTC 

167 851 AGATCCGAGA AGGAAACTGT ATCAATGGGG ACGTAGCAGA AAAAGCTTTG 

167901 GCTATGCTAT TAATAGATGA TTGGGGATTG AATGAAATTG ATATCAAACT 

167951 TCTCACTACA ATCATCGACT ACTACCAAGG TGGTCCCGTT GGAATTAAAA 

168001 CCTTATCGGT AGCTGTGGGA GAAGATATCA AAACTCTTGA AGATGTTTAT 

168051 GAACCGTTTT TAATTTTAAA AGGTTTTATC AAAAAGACTC CCAGAGGCAG 

168101 AATGGTAACA CAACTTGCTT ACGACCATTT AAAAAGACAT GCAAAGAACT 

168151 TATTGAGTTT AGGAGAAGGA CAGTGAAACT ATTGAAAAAC GTACTTTTAG 

168201 GTCTTTTCTT CAGTATGAGT ATCTCAGGAT TCTCAGAAGT AAAGGTATCC 

168251 GATACTTTTG TGAAGCAGGA TACTGTCGTT GAACCTAAAA TTCGTGTCCT 

168301 TTTATCTAAT GAAAGCACCA CAGCTCTCAT AGAAGCCAAA GGTCCTTATC 

1683 51 GCATTTATGG AGATAATGTC TTATTAGACA CAGCGATTCA AGGCCAGCGT 

168401 TGCGTGGTCC ACGCTCTATA CGAAGGGATC CGTTGGGGAG AATTTTATCC 

168451 CGGACTCCAG TGTTTAAAGA TCGAGCCTGT AGATGACACT GCTTCTCTTT 

168501 TTTTTAACGG GATTCAGTAT CAAGGTTCCC TATACGTTCA TCGTAAAGAC 

168551 AACCATTGCA TCATGGTTTC TAACGAAGTT ACAATCGAAG ATTATCTGAA 

168601 ATCTGTACTT TCTATAAAGT ACCTTGAAGA GCTAGATAAA GAAGCTCTAT 

168651 CTGCTTGCAT CATTCTAGAA AGAACCGCTC TATACGAAAA GCTCCTTGCA 
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168701 AGAAATCCTC AAAACTTTTG GCATGTTAAA GCTGAAGAAG AAGGGTATGC 
168751 AGGATTTGGT GTGACCAAGC AGTTCTATGG TGTAGAAGAG GCTATAGACT 
168801 GGACAGCTCG TTTAGTTGTG GATAGCCCTC AAGGATTAAT TATAGATGCA 
168851 CAAGGGCTCT TGCAGTCCAA CGTAGATCGT CTTGCTATAG AAGGATTCAA 
168901 TGCACGTCAG ATTCTTGAGA AGTTCTACAA GGATGTGGAT TTTGTAGTTA 
168951 TAGAATCCTG GAATGAAGAA CTGGACGGAG AGATCAGGTA ACCTCTTTCG 
169001 CATGGCTGAT CGCAATTAGC GTGGTATGGG GCTGTAGCGA CACTGTCGGC 
169051 GTTGCTACAT TTTGAGGGAC AAACCCTTGC TGACTCTCGG CAACTATTTG 
169101 ATAAGGAAGA AAGTTGCTGG AGGCTTTAGG TAAGGTCGCA AGTTGGTCTT 
169151 GAGCTCCCAC GTGAAAAGCA ACATATACAT GCGCTTTTGG CGATTTTATT 
169201 TTAAATGCTA AGAAATTTCC AGGGCGCCAT GTCATGGGAT TTCCCATAGC 
169251 ATCTACCCAA CTGATTTCCT TATTGGAAAG AAAGCCTCGA TTAAAAAGTG 
169301 TTTTATATTT TTTTCGAAAC GCAATGAGAT CACAGAGAAA GTGCATCAGT 
169351 GTAGGCTTTG CGGTAAGCTG ATCCCAAAGG AAGTAATTCG CATTCGAATC 
169401 CAAAGCCCAA CGGTTGTTAT TGCCTTCCGC GGTATGGGCA TACTCATCTC 
1694 51 CTGATTGAAT CATCGGAATG CCTTGCGAGA CCATCAAAGT AAGGAAAAAA 
169501 TTTCGTAACT GTCTTTCACG AACTTCAAGA ATGCCAGGGT CTTCTGTTTT 
169551 CCCTTCCGTT CCGAAATTGT AGCTGTAGTT CGCATCTGTG CCGTCACGAT 
169601 TATCCTCTCC GTTAGCCTCA TTATGTTTGT GGTTATAAGT CACAGTGTCA 
169651 CATAACGTAA AACCATCATG GCAACTGACA TAGTTAATCG AATTTGTAGG 
169701 CGAGCCGTGA GGATAGATGT CTTGAGATCC TGAAATTCTA GAAGCAAAGG 
1697 51 TTCCTATGAG ATTTTGATCC CCATTAAGAA ATGCTTTCAC GTTATCACGA 
169801 TACGGGCCGT TCCATTCACT CCATCTTGGA GACAGTGTGG GGAAATAGCC 
169851 CACCTGATAC AAACCGCCAG CATCCCAAGG CTCAGCTATA ATCTTTGTGC 
169901 TCGCAAGTAA AGGATCAAAA GAAATCGCCT CTAAAACAGG AGCGAATTGT 
169951 AGGGGAGATC CCGAAGGACC ACGAGAAAAG ACAGAAGCAA GATCAAATCG 
170001 GAACCCATCG ACATGCATTT CTTCTACCCA ATAACGTAAG ATGTCGAGAA 
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170051 TCCATTGGGT CGTGGGGGCG CGGTTTGTAT TGAGAGTGTT TCCACAGCCT 
170101 GAATAATTTG TAAAGTGACC TTGTGCATCT AAAATATAAT AGCTCGGAGT 
170151 GTCTATCCAA GGCAAAGAGC AGGTCGTCCC TTGCAAGCCC GTATGATTAA 
170201 AAACAACATC AAGAATGACC TCAATACCTT CTTGATGCAA GGTCTTTACT 
170251 AAAGTTTTAA ACTCTCTACT TGGAGCGCAA GGATCAGAGG CATAAGCATA 
170301 ACGTCGGCAA GGAGAAAAGA AATTTAGGGG AGCATAACCC CAATAATTGC 
170351 ACAGATAAGG GAATTTCGAA TTTCTAAAAG GATGCGCAGT CTCATCGAAC 
170401 TCAAAGATAG GTAAGAGTTC AACAGCGTTG ATTCCCAGCT TATGCAGATG 
170451 GTCGATCTTT TCAATGATTC CTAGGAAGGT TCCCGGAGCA TGAACCCTAG 
170501 ATGAAGAAGA TTGCGTGAAG GAACGTACAT GCATCTCATA GATGATCATC 
170551 TCTTCTTTCG GCAAATGCAG AGGCTGATCA CCATCCCAAG GAAATGGTTC 
170601 TTCCTTTAAA TAACAAAATG CATAATCCCC CTGTTTCTTT CGCGAACCAA 
170651 AACTCTGTGG GGAATGAATA TTCTTCGCAT AGGGATCTGC AAGATATTCT 
170701 TTAAAAGAGT ATTGCATTCC ATGCTTTTTA GGCCCATGAA CACGAAATGC 
170751 ATAAGACGAT TGATCAGAAA TACCCTCGAT CTCTATATGC CAAATCGCAC 
170801 CCGTGCGGTG TGTATCGGGG TAAAGAGGGA CTTCTATGAC TTCTGAATTT 
170851 TCGTCTGTTA AAGCAAGGAT GACTTCGGTA GCTTGTGAAG CATATAAAGC 
170901 AAATCGATAG CGGTTTGGGG AAATTTTAGA AGCCCCAAGA GGTAAAGGAA 
170951 CTGAGGGATA AGAAGAAACT TTTTCCATCG TCGATCAGAT ATCATAACGG 
171001 CGCCAGAAAT ACAAGGGAGT TTTTGATTTT AAGCCTCTGA AATCGTTTAC 
171051 TAACAAATCA CTCTTATGGT AACTTGAAAC AACAACAATC TATTACAAGG 
171101 AGATTCCCTC ATGTCCAGGC AAAATGCTGA GGAAAATCTA AAAAATTTTG 
171151 CTAAAGAATT AAAACTGCCG GATGTAGCTT TTGATCAGAA TAACACGTGC 
171201 ATTTTATTTG TTGATGGTGA GTTCTCTCTT CACCTTACTT ATGAAGAACA 
171251 TTCTGATCGT CXTTATGTCT ACGCTCCTCT GTTAGATGGA CTCCCTGATA 
171301 ATACTCAGAG GAAATTGGCC TTATATGAGA AATTATTAGA AGGATCTATG 
171351 CTTGGCGGTC AGATGGCTGG AGGTGGAGTA GGTGTTGCTA CTAAGGAACA 
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171401 


GCTCATTCTC 


ATGCACTGCG 


TTTTGGATAT 


GAAATATGCT 


GAAACGAATT 


171451 


TACTAAAAGC 


ATTTGCTCAA 


TTGTTTATTG 


AAACTGTTGT 


TAAGTGGCGT 


171 


AGCGTGTGTG 


CTGATATTTG 


TGCTGGTAGA 


GAACCTTCTG 


TAGATACAAT 


171 


GCC AC AAATG 


CCTCAAGGCG 


GTGGTGGAAT 


GCAACCTCCT 


CCTACAGGTA 


171 fiOl 


TTCGTGCTTA 


ATTAAGCATA 


TAAGTACTAC 


ATCTTTTTTC 


TAAATTTAAA 


171651 


AGATTGATAA 


AGCTCTTCTT 


AGAGAAGAAA 


TGACTTTATT 


GACTTCTCGT 


171701 


TTTTGTCCTG 

X X X * X w W a 


AATCTGTTTT 


GAGAAGTTCT 


TAAGATGTAT 


TAATTAAAGA 


1 7 1 7 M 


AGTATTAAAA 


ATAGAAATCT 


AAAGGCTATC 


TTATGATGTT 


TGGGC ATTTT 


171 am 


rcPTGGTTArT 


TTGGAGCAGA 


TCCTGAAGAG 


CGAATGACTT 


CCAAAGGAAA 


171 fl^l 


A PGTGTG A TP 


ACTCTGAGAC 


TGGGAGTGAA 


GACTCGAGTT 


GGAATGAAAG 


1 7 1 Qfll 
X / X J ux 


ATf^AAAPTGT 


TTGGTGCAAA 

X X X vviw* 


TGCAATATTT 


GGCACAATCG 


CTATGATAAG 


1 7 1 Q^l 


A T G GTTC*C*TT 


ACTTGAAGAA 


AGGCTCAGGA 


GTCATTGTTG 


CTGGCGATAT 


172 001 


CTCTGTAGAG 


AGTTACATGA 


GCAAAGATGG 


TTCACCGCAA 


TCTTCTTTAG 


X / £t \J J X 


TGATTAGTGT 


AGATTCTTTG 


AAATTCAGTC 


CTTTCGGTCG 


CAATGAAGGC 


172101 


AGCCGTTCTC 


CATCTTTAGA 


AGACAATCAT 


CAGCAAGTGG 


GATATGAATC 


1 721 51 


TGTATCCGTA 


GGGTTTGAAG 


GTGAAGCACT 


GGACGCAGAA 


GCTATTAAAG 


1 72201 


ATAAAGATAT 

X ****** w«* A J- 


GTATGCTGGT 


TATGGTCAAG 


AACAGCAGTA 


TGTCTGTGAA 


1722 51 


GATGTTCCTT 


TTTAATTCCT 


AGTCATTAAA 


GGAGAGTTTG 


TGGTTTTATT 


172301 


TC AT GCTC AA 


GCCTCTGGGC 


GTAATCGTGT 


TAAGGCAGAT 


GCTATAGTCC 


X / £t J J X 


TGPCCTTTTG 


GCATTTTAAG 


GATGCAAAAA 


ATGCAGCTTC 


TTTTGAAGCC 


1724 01 

X / ^ *a U X 


G A G TTTG AAC 


CCTCGTATCT 


CCCCGCTTTA 


GAAAACTTTC 


AAGGAAAAAC 


1 724 M 


PGGGGAGATT 


GAACTCCTTT 


ATAGTAGTCC 


TAAAGCTAAG 


GAAAAACGCA 


172501 


TTGTCCTCTT 


•AGGCTTAGGG 


AAAAATGAAG 


AGCTCACCTC 


TGATGTTGTT 


172551 


TTCCAAACCT 


ATGCGACACT 


AACTCGTGTC 


TTACGTAAAG 


CAAAGTGTTC 


172601 


CACAGTCAAT 


ATCATCTTAC 


CTACAATTTC 


TGAATTGCGG 


CTTTCTGCCG 


172651 


AAGAATTCTT 


AGTGGGGTTG 


TCCTCAGGAA 


TTTTGTCATT 


AAACTATGAC 


172701 


TACCCACGTT 


ATAATAAGGT 


AGATCGTAAT 


1 CTTGAAACTC 


CTCTTTCTAA 



128 



WO0027994 [ file :/A\dc was03\firmdata\lp\FoleyPat\PatentDocuments\WO0Q27994.cp c] 



Page 256 of 330 



WO 00/27994 PCT/US99/26923 

172751 AGTCACGGTT ATCGGTATCG TTCCCAAAAT GGCGGATGCT ATCTTTAGGA 

172801 AAGAAGCAGC CATTTTCGAA GGCGTATATC TCACTCGAGA TCTTGTGAAC 

172851 AGGAATGCTG ATGAAATTAC CCCTAAGAAA TTGGCAGAGG TTGCTCTGAA 

172901 TCTGGGAAAA GAGTTCCCTA GTATTGATAC TAAGGTCTTG GGAAAAGATG 

172951 CCATCGCCAA AGAGAAAATG GGACTCCTAT TGGCTGTTTC CAAGGGTTCT 

173001 TGTGTGGATC CACACTTTAT CGTTGTCCGT TATCAAGGAC GTCCTAAGTC 

173051 TAAAGATCAC ACCGTCTTGA TAGGGAAAGG GGTCACTTTT GACTCTGGAG 

173101 GTTTAGACCT CAAGCCTGGA AAATCCATGC TTACTATGAA AGAAGACATG 

173151 GCAGGTGGGG CTACAGTCCT CGGGATTCTC TCGGCGTTAG CAGTTTTAGA 

17 3201 GCTTCCTATA AATGTCACGG GGATCATTCC TGCTACAGAG AATGCTATCG 

173251 ATGGCGCCTC CTATAAAATG GGAGATGTCT ATGTAGGAAT GTCGGGGCTT 

1733 01 TCTGTTGAGA TTTGTAGTAC CGATGCTGAG GGACGTCTTA TCCTCGCTGA 

1733 51 TGCGATTACA TATGCTTTAA AATATTGTAA ACCGACACGT ATTATAGATT 

173401 TTGCAACTCT AACAGGAGCT ATGGTAGTCT CTCTAGGAGA AGAGGTTGCA 

173451 GGTTTCTTTT CCAATAACGA TGTTTTAGCT GAAGATCTTT TAGAGGCGTC 

173 501 AGCCGAAACC TCCGAGCCGT TATGGAGACT TCCTCTAGTT AAGAAGTATG 

173551 ATAAAACATT GCATTCTGAT ATTGCTGATA TGAAAAATCT AGGCAGTAAC 

173601 CGTGCAGGGG CTATTACAGC AGCATTATTC TTGCAGAGAT TTTTGGAAGA 

173 651 ATCTTCGGTA GCTTGGGCAC ATCTTGATAT TGCAGGTACT GCATATCATG 
173701 AAAAAGAAGA AGACCGTTAT CCAAAATATG CTTCAGGTTT TGGTGTTCGT 
1737 51 TCTATTCTTT ATTACTTAGA AAATAGTCTT TCTAAGTAGT TGCTTTCTAT 
173801 TTATTTATGT TTTAGTAATG ACTTTTATTT TAGTTTTTTT AAAATAAAAG 
17 3851 TCATTTTTTT TATTAAAGTT TTCAATCGTC CCTGCCGATA GATCAGGTAA 
173901 GTAATTACCT GTCTAATTAG GGGAATAAAG ATGATTGGAG CGCAAAAAAA 
173951 GCAAAGCGGT AAAAAGACAG CTTCAAGAGC TGTACGGAAG CCTGCTAAAA 

174 001 AAGTTGCGGC TAAACGTACG GTTAAAAAAG CTACTGTTCG CAAAACCGCT 
174051 GTAAAAAAAC CTGCAGTTCG TAAGACGGCT GCTAAAAAGA CAGTAGCAAA 

129 



WO00279 94 ffile: /A\dcwas03\firmdata\lp \Fole yPat\PatentDocuments\WO0Q27 9 94 .cpc] 



Page 257 of 330 



WO 00/27994 PCMJS99/26923 



174101 


GAAGACTACA 


GCTAAGAGAA 


CAGTTCGTAA 


GACTGTTGCT 


AAGAAGCCTG 


174151 


CAGTTAAGAA AGTTGCTGCT AAACGTGTAG TAAAAAAGAC 


AGTAGCAAAG 


174201 


AAGACTACAG 


CTAAGAGAGC 


GGTTCGCAAG 


ACTGTTGCTA 


AGAAGCCTGT 


174251 


AGCTAGAAAA 


ACTACAGTGG 


CTAAAGGTTC 


TCCTAAGAAA 


GCTGCAGCCT 


174301 


GTGCTTTAGC 


ATGCCACAAA 


AACCATAAGC 


ATACATCTAG 


TTGTAAACGT 


174351 


GTCTGTTCTT 


CAACAGCTAC 


GAGAAAGCAT 


GGCTCTAAAA 


GCCGTGTTCG 


174401 


TACAGCTCAT 


GGCTGGCGTC 


ACCAACTGAT 


CAAAATGATG 


TCTCGATAAT 


174451 


TTGTGATTTT 


CGCATTATTG 


CTCATGTTAA 


CGGGAAAGGG 


AAACATTGGG 


174501 


TTTCCTTCCC 


GTTTTTCTTT 


TTAAGGTTAA 


AAAGCTTTAT 


AGAGCGAGAT 


174551 


CTTCAGGCTT 


CATGCTGTAC 


AGTTGGTAGG 


AAAATACGTA 


TAGTAGGTTC 


174601 


AGGATACTAC 


TTTTTTGACT 


CTACCTATGC 


AAAAATCCTT 


AACGAGTTTT 


174651 


GATGACTTTT 


CCCAGGCGTA 


TGCAGAGAAA 


GTGCCCGCTA 


TAGCTCTTAT 


174701 


AGGGAGTGCT 


TTGGAAGACG 


ATAAAGATGC 


GCTGATTGAA 


TTATTAGTCT 


174751 


CTGAGAGCTT 


CAAAGAGCTC 


GGTGGTCAGG 


GACTCATGCC 


AGCAACCCTC 


174801 


ATGTCTTGGA 


CCGAGACGTT 


TGCACTCTTT 


CAAGAGCATG 


AAACTTTGGG 


174851 


GATTATTCAT 


GCAGAGAAAT 


TCCCTCTAGC 


AACTAAGGAA 


TTTCTAAGCC 


174901 


GCTATGCTCG 


GAATCCTCAA 


CCTCACCTTA 


CGATTTTGAT 


CTTCACCACA 


174951 


AAACAAGAAT 


GCTTTCGAGA 


ACTGTCAAAA 


GCCTTGCCAT 


CGGCTCTTTC 


175001 


TTTGAGTTTA 


TTTGGTGAGT 


GGCCCGCAGA 


TCGTCAGAAA 


AGGATCATAC 


175051 


GCCTCCTGTT 


GCAAAGAGCT 


GAGCGTGTGG 


GGATTTCTTG 


CTCTCAATCA 


175101 


TTGGCATCTT 


TGTTTTTGCG 


TGCACTTGCT 


TCAACCTCTC 


TTCCTGATAT 


175151 


TCTCAGTGAA 


TTCGATAAGC 


TACTGTGCTC 


TGTTGGCAAG 


AAAACGTCCT 


175201 


TGGATCACTC 


TGATATTAAA 


GAGCTCGTTG 


TCAAAAAAGA 




175251 


CTATGGAAAT 


TTCGAGACTC 


TCTATTGAAG 


AGGGATCCGG 


TAGAAGGTCA 


175301 


CCAGCAGTTG CATTTTCTAC TCGAGGATGG TGAAGATCCC 


TTGGGGATTA 


175351 


TTACTTTCCT 


TCGTACCCAA 


TGTCTCTATG 


GTTTACGTAG 


TATTGAAGAG 


175401 


GGATCGAAAG 


AAAATAAACA 


CCGAATGTTC 


GTCCTTTATG 


GAAAGGAGAG 



130 



WOQQ2799 4 r file:/A\dcwas03\firmdata\ip\FoievPat\PatentDocuments\WO002 7994 .cpc ] 



Page 25S of 330 



WO 00/27994 PCT/US99/26923 

17 54 51 ACTACACCAA GCCTTAAATT CTTTATTTTA TGCAGAAACC TTAATTAAAA 

17 5501 ATAATGTCCA AGATCCTATA GTAGCTGTGG AAACTTTAGT GATTAGAATG 

175551 GTTAACCTGT GACTTTATAT CTTCTTCCCA ATACTCTCGG TACCCGTGCT 

175601 GTAGAGACTC TCCCCTCCGT TATAGGAGAA TTAGTTCATA GACTAGATGG 

175651 GCTGATTGTA GAAAGTGATC GTGGGGGTAG GGCATTTCTA AGTTTATGGA 

17 57 01 AAATTCCCGA AGTTCATAAA TTTCCTCTTG CTATTCTTAG TAAACATGCG 

17 5751 CGCCTCCCTA AGGCTTGGGA TTTTTATCTA GAGCCTATCG TAAAACACGG 

17 5801 GGAGAATTGG GGACTGATCT CTGATGCGGG TCTTCCCTGT ATTGCAGATC 

175851 CTGGAGCGAG TTTAGTGCGT CGTGCACGTG CTTTGGGGAT TCCTGTGCAG 

17 5901 GCTTTTTCAG GTCCCTGTTC GATAACGTTA GCGCTCATGC TTTCAGGCTT 

17 5951 GCCTTCCCAG AGCTTTACGT TTTTGGGATA CCTCCCGCAA AGTCCTAAGG 

17 6001 AACGTGTAAA GTCGATAAAA AAGGCAGCGA CCTCCAAAGA GGTATCTACT 

17 6051 TCAGTATGTA TAGAAACTTC TTATCGTAAC GTCTATACTT TTGAGTCTCT 

176101 TC TAG AT ACT TTACCTTCCT ATGCGGAGCT TTGTGTTGCC TCTGACCTTT 

17 6151 CGGGCCCGAG TGAACTTGTT CTCACACGCC AAGTGCAATC ATGGAGAACT 

17 6201 ACTGAGGACT TAGGTTCTGT GAAGCAAAGT ATAACCAAAG TCCCAACAAT 

176251 ATTTCTTTTC CACATCCCGA ATTAATTTTT TCTTAGCTTG TTTTATCCCC 

176301 GGTATCATTG TCAGGAGTTT CTTAGCATGT AAGCTCTTCG GGGATATTTT 

17 6351 TCTTGTGATC ATAAGCAAAA ATTAAAGGTT GCATGATTCT AAATAGCGCA 

176401 TAATCATAGG TTTGCGTCTT TTTAAAATAA TCGTAATCCT CTATTGCGTG 

176451 TTATCAATAC TTATTTTATA ATCTTTGCCA AATAAGTTTA GAGCGTTGAT 

176501 GACAGCACCC ACAGAATCTC GATCTTCACC ACCAACACTA CTCGAAGAAA 

17 6551 CGGAACCTCT ATCCCCAAAT CCTATTCCCG CCGATATCCA AATCCCGAGA 

17 6601 ATTACAATAT CTCCTCCTTC TCTAGACGTA TCAACAGTTG CATCTTCGGC 

176651 TGAAGATATC TCGGTCTTCA TTGCAGGAGG CCCTAGAAGT TCTTCATCTG 
17 6701 CTTCTGTGGC TTCGGATGTG TATGAGTTGG TTTGCCTTTG TGGCGGTGAT 
176751 GAGGATCCTG AGCCCCCAGA TTCTGAGGTA CGTACTCTCT ATGTAAATGG 
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176801 AAGTTGGCAA ACGCATCAAG AAGCTGTACA GGAATTGCTG TATATTAGTG 

176851 AAGTACGGGG AGAGGCCGTT CGTCTTCTCT ATAACGATGG AAGCGGCATG 

17 6901 TCTCCTTGGC CCATCAGTCC TTGTCGTACT CTTCCTACTC TCGATCATCC 

176951 TTTATGTCAG GCCCTTTTGA CGGTTTGGGA ACAGTTTTTC TCTGCTCCTG 

177 001 AAAATCAGAA TCGTGAGTTT CTAGTGATTT TCTATGGGGA TGCATCGCCT 

177051 TATATACAAC AGGCGTTAAC GCAATCTAGG CATAGTCCAC GTATTGTTGT 

177101 TGTAGGGATT TCCCCGACGG TCTTTATTCA AGGAGACTTT AGGGTCCATA 

177151 ATTACCGTGT TTCTGGAGAC TTCTTTAGTT CTCTGGATTG TCGGGGAACT 

177201 AGGGCGGAGA ACACCACGAT ACTGCCGTAT TCTTCGGGTC TTGAGGGTGT 

177251 TTTTCTGCCT TCTATCCGTT GTCCTTCTTT TACTTGGGCG GTGCGTTTTG 

177301 GAGAGCAATG CTTGGTTGCG AATAGGGGTG AGGATGTAGA AGATAGGGGA 

177351 GGTCTTTCTC AAGATGCCGA AAGATCACAG TTACCACACA GTGAAAGAGA 

1774 01 TCTAGCTGTT GTCATTGATT CTACGGATCC TAGTTCTATG AGTAGGCTTG 

1774 51 TAGAATGGTT GAATCAAGGA TCGCCTTCAT CAGATATGGA AATCAATCCC 

177501 TATCCCCAAC GGTGTCCTGA TGTAGCTCTT TCTGCGCTTT ATGCAATTTC 

177551 TAGAGTTTCA GGACTTGCGC AGGAATGGAT CCTAGCCTCT GTTCATGAGG 

177 601 GCTTAGACTT GCAGATCTGT TACTCTTTAA TTTTGATGCA CACGACGTTT 

177651 GCGGTCCGGT ATTTTTTCTT ACTCTTTACA AATTATCCTC AGTCTAGAGA 

177701 GAGATTCCGT ACTGCACGAA TCGTAGCACA ATCTCTATAT TTACCAAGCA 

1777 51 TCCTTGTTCT TGTTTTTGAT TGTGGCAACG TCCTGCGTAA ACTATGGATG 

177801 CCTCAGGAAA TCTTACGAGC AATTTTTATT TCTGCGTCTA CAATTTCAGG 

177851 GAGTATTGTC TTTGTAGAGT GCACTCGCTG GATGGGGCGA GGTCTTAGAC 

177901 ATCGTGTACA ACAATTTGTG CAGCAACGAG TTATAGGAAG TGGCCTGCCT 

177951 GTAGGAACAG TACGAGCTTC TTATCGCGAT CGTGCAGGCT TTATCATAGG 

178001 CTTTTTACAA ACTGTACATG GAGGACTTTA TTTGCCGGTA TCCATTATGG 

178051 TGCTTAACCA GATTGCAATA CAAGTTCCAC GTATCTTAGT ACGTCCAAAT 

178101 AACACTGCTG TTTATGATCT ACATAATAAA AGTGCTGAAG AAAATTGGAG 
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178151 


CAGTGGTGA I 


Vjlrtl IMuLlu 


TTGGCCAAAC 


ATTAAACTTC 


ATCTTATGTG 


178201 


C TTTCGTC TT 


PTTPPT&. JA *T 


PTATGGTTCT 


TTGTGAAGTC 


CGTATTACGC 


178251 


CATTG I AGGL 


PTPPTPGTPG 


CTAATGTATC 


TTTGGAGACA 


TCTTGGTATT 


178301 


TTTAACATAA 


P ft PP "h CCltZC A. 
GAGCAL.oUC/i 


A A A AAAAGAA 


AGACCGAGAA 


AGAAGCTTTA 


178351 


CCTTTAAGAG 


TTCTCTGATG 


TTTPPPP.P.ZiG 


GATCAGGAGT 


CCGAAGGGCA 


178401 


GAGTCTAGGT 


TTTAGCCTTC 




AGAAGGAATA 


TCCCATAATC 


178451 


TCTTCCGCTA 


TTGTATCGTG 


G AAGAG ri\. 


C P PTG TCT AT 


TTAGGGCAAG 


178501 


ACATTGTCCA 


TGCACACTGA 




TAATTTTACA 


TCTTGCGTAA 


178551 


GCATGGAGAT 


AAGTGTGGAG 


PPPA fiPTPPG 


PGAGGTCTGC 


TCCTTCAAGG 


178601 


AGTCGGAGTC 


GCAGGGCTAA 




ATTCGTTCTT 


TTTTTGGGAG 


178651 


AATTTCTGAG 


om o m o m mo o o 

GTCTCTTGGG 




PTTAGGTACA 


GCACGTAGAT 


178701 


AGTGAGAAAT 


ATGACTATAA 


TTTTTTr r. rr 

11111 1WVL.U 


GPTPTPPGTG 


AAGGTATTGC 


178751 


GAAGCTGAAA 


<^»rr»o 0<T> n ooo 

CTCCTAAGCC 


1 AAunnnWj'j 


PGATPTGTCC 


AGTAATAGAG 


178801 


GTTGTGCTTT 


GCGGGGTAA I 


PTP.P.PTTGGP 


AT ATG AAG C A 


AGTTCATAGC 


178851 


GTTGGAACCC 


mmoop aPAP.T 


AGGAGATTTT 


CAGCAAGGAG 


GCTCATCTCA 


178901 


GCTAGAATTT 




A ATTGTGGGG 


ACTAGAATTT 


TGCGGTGTTT 


178951 


ATAGAAGGAG 


^T/^Trp/v tat 
GTGTGGGGA1 


PTA.TL.GTGAG 
t Inl /\o 1 


GTTG TA TAG A 


GAAATGTGAG 


179001 


TGATAGGGAG 


» omo » o n p/vp 

AGTCAGAGCT 


mo » TPTAPP.T 
1 GA1 ulnbo 1 


PGPTTAGGAA 


TATCTCCAAA 


179051 


GACTGTGTGG 


GCAGTCCGI A 


o a TT A P.GTP T 
GA 1 1 nuo 1^-1 


A TAG AAAG AT 


TAGAGAATCC 


179101 


GTGATTCTGG 


CATTCTTGCA 


GTGCTGTGAT 


TGGCGCAGAT 


GAAGAATGCG 


179151 


TTCTTCCGAG 


GAGCTGTAGG 


ATAGAGTCGT 


CGAAGGTTTG 


TACGCCAACG 


179201 


CTAATTCTAT 


TTATTGGAGT 


CTCTTGTAGT 


TGACGTAGAT 


AGCTTACGGT 


179251 


GAGATTTTCG 


GGGTTGGCCT 


CTAAAGTAAT 


TTCCCGGGCA 


TGGGGGGCTA 


179301 


GCTCTTTGAG GATGCGCTTA AGATCAAGAG 


GAGAAACTAA 


TGAAGGTGTT 


179351 


CCCCCTCCAA 


AAAACACAGT CTCTATGAAA TGCGTCTCTT GGATGGGGGC 


179401 


TAGCTTTCTT 


AGCCCCTCTT 


' GAATTACAGC 


ATTACAATAG 


; AGCGATACAG 


179451 


ATTCACTTTT 


' GTAGGGGATT 


1 GTATAAAAAC 


: TGCAATAGCG 


1 ACATTTTTTT 
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179501 


GTGCAGAAGG 


G AA I A 1 wnn 1 


ATAAAGAGCT 


AGGGGAGCCT 


TACCATTCAT 


179551 


TAGCA i CGGG 


A 1 tnn 1 wn 1 A 


CCACCGCGTC 


TCCAGCGATT 


GCGATCGCTG 


ivy bui 


AAAGGG 1 L. 1 1 


CA 1 LuiL 1 1L 


GTGGGAATAG 


TCGACTTCTA 


CTGCCCCATC 


1 /yobl 


TTCAuGU CC-T 


TLA J. Gwj 1 VjA 


GTTCCAGTTC 


TACGGTTTCG 


CCTGGGTCGA 


1 / y/oi 


TATCAAIT 1 C 




GGTTCCACAA ATTCAATATC 


AGACATACTA 


179751 


GGACCA I Q.AG 


GG 1 AGAt. A\j 1 


CTTTGCGGGA 


CTGCGTCGTG 


GTGTGACGTA 


179801 


CTCTTCAATC 


Iti l-(_\jl 111 


CTTTGAGGAG 


TTGTAGACGT 


TCTTTTTCTT 


1 *7 ft O C 1 

179 851 


CGTAAA Tl 11 


A rG 1 I G 1 AAtj 


ATTCGGATTT 


CTTCTTGGTG 


CCTGCTAATT 


179901 


TCTTTTTTTG 


GGAG I AAAtX 


CAACTGCATC 


CATTGCGTAA 


GGTCATGAAG 


179951 


TTCTGATTCT 


AATTTTTTAA 


GACGTTCGCT 


TTTCATCCAA 


GGGTTCCCAT 


180001 


CCTGTATTCG 


AGAGCAGG rG 


CATAGCATTT 


TTTTGTTTTT 


TCAACTGCAG 


180051 


AAAAAAAACA 


AAGATTTTTT 


TTTTTTTGAA 


AAATAAAAAT 




180101 


GCAGACATTT 


AGA1 LGl Al 1 


TATTGAGTTT 


AATTATTTTA 


TGGATTCCGA 


180151 


GTTTGTGGGG 


C AAG TAT A 1 1 


CTTCGGATAT 


GGATTGGATC 


GAGTCTATGT 


loOzOl 


ATCAGAGATT 


T. A 1 G AA 1 GAv. 


GAGACTTTGG 


ATCCTTCTTG 


GAAGTATTTT 




TTTGAAGGGT 


ATGAGLTGov? 


TCAAGCAGCA 


TCTCCATCAG 


AAGCTAGTAC 


loU Jul 


T AAG ATTTC, T 


r*r> o j\ jx rp/*» AAA 
(jGuAA 1 uAAA 


CTATTGCTAT 


GCTTCAAGAA 


CAAAAATCTC 


180 J bl 


AGTTTCTATG 


m ji A r r"T kr P jx rp 

TAGGA1 1 1A1 


GG1 1 Al 1 AlVj 




AAGTCAAATT 


1 Oft / A1 


TCAACGC- 1 1 G 


AAV- 1 A\- 


AGATTCTCGA 


TTCATTCAGG 


AAAAGATCGC 


1 Oft / C1 

loU4 Dl 


T AAG ATTG A I 


(_ 1 VjoA 1 \jAvj\- 


AGGTGCCTTC 


TGCGGGTCTA 


CTTCCTAAAG 


1 O ft C ft 1 


CTCAGGTTTC 


Gvj 1 AV-o AwAVj 


CTGATCGAAG 


CTTTAAAAAA 


ATGCTATTGC 


1 O ft C C 1 


GGAAGTCTTA 


GT1 lAGAAAv- 


CCTAACATGT 


ACTCCTGAGT 


TGCAGGAGTT 


180601 


TGTTTGGAAT 


■CTTATGGAGA 


AGCGACAAGT 


GGAGCGCTTT 


GCAGAGCAGC 


180651 


TCCTTCGCTC 


CTATAAAGAC 


TTATGTaaAG 


CAACGTTTTT 


TGAAGAGTTC 


180701 


TTACAGATAA 


AATTTACAGG 


TCAGAAACGT 


TTTTCTTTAG 


AGGGCGGAGA 


180751 


GACCTTGGTC 


CCCATGTTGG 


AGCATCTTGT 


TCATTATGGA 


TCGGCATTAG 


180801 


GAATTTCTAA 


CTACGTTTTA 


GGAATGGCCC 


ATCGAGGTCG 


TTTGAATGTA 
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180851 TTAACGAATG TTTTGGGAAA 

180901 AGACGATCCT GCAGCACGTG 

180951 ATAAAGGGTA TGTGCTAAAG 

181001 GTGATGTTGC CAAACGCTAG 

181051 GGGGGTCGTG GCTGCCTTGC 

181101 GCAGCTTAGC AATTTTAGTT 

181151 GTGGTTTATG AAACTCTCCA 

181201 GGGTACGCTT CACATTGTTG 

1812 51 CACGGGAGTC AAGGTCCACC 
181301 GGGATTCCTG TATTTCGAGT 

1813 51 AGCTATAGAG TACGCTCTGC 
181401 TCATAGATCT CTGCTGTTAT 
181451 CCCTCAGTAA CAGCTCCCTT 
181501 TATTCGCGAG CTGTTTAGGC 
181551 TTTCTGAAGA AACTTTGGCA 
181601 AATCGTGAGT TTCAAGTATT 
181651 AAAAGAATGT CATCACTGCG 
181701 ATGATTGTGA TGTTTCTTTG 
181751 CGTCTTTGTG GTTTCCCTGA 
181801 TCTTTTAGAA AAAAGAATGA 
181851 GGGCGATGGC CGAAGAATTA 
181901 AACCTGAGAC TCTCAGGTCA 
181951 ACATTTGGTA TGGAGTGATA 
182001 ACCATCTTTC TGCAGAGCAG 
182051 TCCGAATATG CAATTTTAGG 
182101 AAAGACTTTA GTGTTATGGG 
182151 CACAAATCAT TTTCGATCAG 



PCT/US99/26923 

GCCTTACCGT TATGTCTTTA TGGAGTTTGA 
GTTTAGAGAG TGTTGGGGAT GTAAAGTACC 
TCCCATCAGA AAGATAGGGA AACTACCTTT 
TCATCTCGAA TCTGTAGATC CTATTGTCGA 
AACACCAAGG TCACGCAGGT AAAGAGCAAA 
CATGGAGATG CAGCATTTTC TGGTCAGGGA 
GCTGAGTCGT GTTCCAGGGT ATTCTACTGA 
TGAATAATTA CATAGGGTTT ACCGCAGTGC 
CCTTATTGTA CGGATATTGC TAAAATGCTA 
GAATAGCGAG GACGTCGTTG CCTGTATAGA 
AAGTTCGTGA GAGATTTAGT TGTGATGTGA 
CGCAAGTATG GACATAATGA AAGTGACGAT 
ACTCTATGAT CAGATTAAGA GAAAGAAGAG 
AATATCTGTT GGAAGGGCAG TTTGCAGATA 
TCTATTGAAA AAGAGATTCA AGAGAGTCTG 
GAAAGGGACG GATCCAGAAC CCTTTCCTAA 
ATCGCTTAAA TAACGGCGAG CTTATTTTGC 
GATCGCGAGA CTCTTTTTCA TATGAGCTCG 
CAATTTTCAT CCCCATCCTA AAATTAAGAC 
AAATGGCAGA AGGTGGGGTT GGTTATGATT 
GCCTTTGCTT CGCTATTAAT CGAAGGGTAC 
AGATTCTATT CGCGGGACAT TCAGCCAACG 
CTGTGACTGG AGATACCTAC TCTCCATTGT 
GGCTCTGTAG AAATGTATAA TTCTCCTCTT 
GTTTGAGTAT GGCTATGCTC AACAGGCATT 
AAGCGCAGTT TGGGGATTTT GCTAATGGTG 
TATATCTCTT CGGGAATTCA GAAGTGGGAT 
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182201 TTACACTCTG ACATTGTTCT GCTTCTTCCC CATGGGTATG AGGGCCAAGG 

182251 ACCCGAGCAT TCTTCATCTC GTATAGAACG TTATTTGCAA TTAGCCGCGA 

182301 ACTGGAATTT TCAAGTGGTC TTGCCTTCCA CTCCTGTGCA ATATTTTCGG 

182351 ATTCTCAGAG AGCATGCTAA GAGAGATCTT TCTTTGCCTT TGGTGATCTT 

182401 TACTCCTAAG TTGCTGCTGA GATATCCACA ATGTGTAAGT AGTATCGAGG 

1824 51 AGTTCACAGA ACCTGGGGGA TTCCGTGCTA TTCTCGAAGA TGCCGATCCT 

182501 AATTATGATG CTTCTATTTT GGTATTGTGT TCGGGAAAGA TCTATTATGA 

182551 TTATGCAGAA ATGCTTCCTC AAGATCGGCG TAAGGACTTT TCTTGCTTGC 

182601 GTATAGAGAG CTTGTATCCT TTAGCTCTTG AGGATTTAGT GAGCCTTATC 

182651 GATAAGTATT CTCATTTGAA ACATTTTGTT TGGCTACAAG AAGAATCCAA 

182701 GAATATGGGG GCCTATGACT ATATGTTTAT GGCGTTGCAA GACATTCTTC 

182751 CTGAGAAACT GCTATATATA GGACGTCCTC GGAGTAGTTC CACAGCTTCT 

182801 GGATCAGCGA AGCTCAGTCG TCAAGAGCTG GTCACGTGTA TGGAAACCCT 

182851 CTTTTCTTTA AGGTAAATTA TGACTACAGA AGTACGCATT CCTAATATTG 

182901 CAGAGTCGAT TAGCGAGGTG ACCGTAGCTT CCTTGTTAGT TACAGAGGGT 

182951 GCTCTGATTC AAGAAAACCA GGGCTTACTA GAAATTGAAA GTGATAAGGT 

183001 AAATCAGCTC ATTTATGCCC CAGTATCGGG AAGAATTTTC TGGGAGGTTT 

183051 CAGAAGGCGA TGTTGTTCCT GTAGGGGGGG TAGTGGGAAA AATAGAGCCC 

183101 GCAGGTGAAG GGGAAGAGCT TGGAGATTCT CAGTCTAAAG AGACTATAGA 

183151 AGCTGAGATC ATTTGCTTTC CTCAGTCTGG GGTGCGTCAG TCTCCTCCAG 

183201 AGAATAAAAC GTTTATTCCT CTTCGTGATC AGATGGACCA AGGATCCCAA 

183251 GGTCTTTCTG CAGGAGATCG AGGAGAAACT CGAGAACGCA TGACCTCGAT 

1833 01 TCGTAAGACA ATTTCGCGGC GTCTTTTGTC TGCTTTACAT GAGTCTGCGA 

183351 TGCTCACGAC ATTCAATGAG GTCTATATGA CACCTCTTTT TCATTTGCGA 

183401 AAGGAAAAAC AAGAAGAGTT TCTATCTCGA TATGGGGTGA AGTTAGGATT 

183451 TATGTCTTTC TTTGTGAAAG CTGTCTTAGA GGCTTTGAAG GCATATCCAC 

183 501 GAGTGAACGC CTATATTGAT GGCGAGGAGA TTGTTTACCG TCACTATTAT 
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183551 


GACATTTCTA 


TTGCTGTAGG 


TATCGATCGA 


GGACTTGTGG 


TTCCTGTGAT 


183601 


ACGCGATTGC 


GATAAACTTT 


CTAACGGGGA 


GATTGAGCAG 


AAACTCGCAG 


183651 


ATCTTGCCCT 


TCGGGCTCGT 


GAAGGCCTAC 


TTGCAATAGC 


GGAGCTTGAG 


183701 


GGAGGAGGTT 


TCACAATTAC 


CAATGGAGGC 


GTATATGGAT 


CGCTACTTTC 


183751 


GACTCCCATT 


ATCAATCCCC 


CGCAAGTGGG 


GATTTTGGGG 


ATGCATAAGA 


183801 


TAGAAAAGCG 


CCCCGTTGTT 


CTTGATAATG 


AAATTGTAAT 


TGCAGATATG 


183851 


ATGTATGTCG 


CTTTAAGCTA 


TGATCATCGT 


CTTATTGATG 


GGAAAGAGGC 


183901 


TGTTGGGTTT 


TTAGTCAAAG 


TGAAAGAAGG 


CCTAGAGAAT 


CCTGCCTCAT 


183951 


TACTCGACTT 


GTAATTTCTC 


TGATTCTCAT 


AAAGGCTCTT 


TTAGAGCCTT 


184001 


TTCAGATTTT 


TTAACCTCTT 


TTCTTATCAT 


GAAAAGGATT 


GCACTCATCT 


184051 


TGAGTAAGGA 


ACAATGTAAA 


TTGAGTATCC 


GCACTTACAT 


TGGCTTGTAC 


184101 


CGCAGCCTCA 


TTCAAGGCGC 


CTAACAACTT 


TCCGGGTAAG 


TACACATTGT 


184151 


CTCTTTTAAA 


GTGAGGTTTC 


CCGGTATGAT 


CAAGAGGAGT 


GAAAGCTTCT 


184201 


TCTGCGTGAA 


TATGTACTGA 


TGTGAAGCCA 


TTTTCTAAGA 


GATCTACAAA 


184251 


CAGATTTTTC 


AAGAACTCCT 


TTGAGTCTTT 


TTCAGAAAGC 


GAAGAGCTCA 


184301 


TCGGATCTTT 


TCTTTCTAAG 


ATATCTTGTA 


CATCCTTGTC 


GTAATAACCG 


184351 


CAAGGAGTCC 


ATGCTAAAAC 


TAAAGTTGAA 


CTGACACCAG 


GAGTTGGAGA 


184401 


ATTGAAAATT 


GCCCATTGAC 


CTGCTTTTTT 


TCTAGAGTTC 


ATTTTAGTTT 


184451 


TCATAGCTTC 


CCACTGTTCC 


TCTCCTAAGA 


GATCTTGCAG 


TTGCTTTAGG 


184501 


GGAGCCGAGA 


CTTCCTTATC 


CCCTTTCCAA 


TGCGAAGTTT 


CTGGTGAAAG 


184551 


GAGGAATAGC 


GAAGGTACGC 


GAGCCCCGTG 


ATCTCTAATT 


TCTGTGAGAT 


184601 


TAGGAGCTTT 


AGCTGTGACA 


ACCTTAAATT 


AAGGACCCTT 


CTTTAGATGG 


184651 


TGAATTTCAC 


AATTCGTAGC 


TTCTGTAATT 


TTGATTTC TT 


TAGTQsAVjI I I 


1847 01 


TTCTTTGGGA 


GGAAGAGCGT 


CGTAAGGTAA 


GGAACAATGA 


TGTTAATAGG 


184751 


GCTAGTTTAG 


TTAAATAAGA 


ATAGGGGATC 


TCAGGTTTAG 


GAGTGATGGG 


184801 


TTGTTGCTCG 


ATTACAATGA 


TGGTTAATGG 


AAGAAGAGCG 


TTTCTAGAAA 


184851 


GTGCATCTGC 


TTCCTCGTCT 


TCTTTAGGCA 


ACAAGCTCAA 


GATGAGAAGC 
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184901 ACG AGCCCGk TGGCTAATCC CACGAGTGTG ATAATACCAG CAATGCTATA 

184951 CGCCTGTATT AGAGGGAAGC CTAAAGCTAA GAAGGTAACG AACGTGATTC 

185001 CAGCTGCTAA CATTAGTGCA GCGAAAAGTA TGTAAAGAAC AGCACGCGCA 

185051 ATTTGAAGAG CAAGACTTTT ACTTCTGAGT GTGCCTTCTG TAGCTTGGGT 

185101 TTTGTTAGCT TCAATCTCAG CGAGTTGCGC TTTTTTTGGC TTCACTGGTT 

185151 CGTGAACTTT ATGATTAGTT GATGCGCCTA ATTTCATAGA AAGTCTTTAA 

185201 GCTTAATTTT TTCCTAAAAA TTGTAGTTTA CATGTTGTTT TTCATCAAGA 

185251 AGATTACTGA ATTCTGAGTG GGTTGGAACG AAGCATGTTC TTCTAGAAGC 

185301 TAAGGTCCTT AGGGGAAAGG AAAAGAAGCT TGGCTGGATC TTTTTAATCT 

185351 CTGGGTAGGA GAAGGACGGC GGTTACATTA TTTCTATTTT TTGAATGTTG 

185401 GCCTGTGAGA TTGGGATCGG GATGGTGAGC AAGGTACCTT GAAGAAAAGA 

1854 51 AAGCGTCGTG TTCCGTGTAG GTACAGAGGT CTGAGATAAA AATGCGGTCT 

185501 TTAGAGATTC CTAAATTCGT AAGTTGCTTG CGAGCAATCG CACGCAGGTC 

185551 AAAATGGTTT TTGGGATTCA TAAAGGGAAG AAAGCTACGA GGAAATAACG 

185601 TAGCGTAATC GGGATAGATA GCATAATCTG GACCGATGGA AGGGCCGATA 

185651 GCTACGAAGA GATCTTGTGG TTTTGTATGA AATAATTTTT TCATAGTACC 

1857 01 TACGGTGACA GCATAGATAT TGCCAAGCAA TCCTCGCCAT CCGCTGTGTA 

185751 CATTTGCGAT TGCGTGGTGT TCTCGATCAT AAAAGATAGC TGCTTGGCAA 

185801 TCGGAATGGC GGATATGGAG AGAGAGGAGC GGAGACTGCG TGCACAGTCC 

185851 GTCTGCAGGT TGGTAGGTGG GGGATGTAGG TGTAACACAA CGTACGGAAG 

185901 TGCCGTGGCG TTGATGAAGG TCGCAATACT TCGGAGATTG GAGAGCTGAA 

185951 GCAATCTCAG GATTCTTGGC TGCGAAGACC GTGCCCTCGG CATCCTTTTG 

186001 TTTTGAAAAG ACTCCGTGGC GTATAGGCAA ACCATCGAAT TCTTCGAAAG 

186051 TCCAATAGTT CAGAGGGTGA GTGTGGAAGG ATAGAGTCAT ACTGTCAACT 

186101 TAGTTTCTTC AGGGTCTTTC CATACCCCAT GTTCTTGTAA AAGTCGAATT 

186151 AATTCTTCTT CAGCATCTTC CATGGGTATG TGAGCTTTTA CACAAGTATG 

186201 TTTTACATAA AGATCGATCA TCCCTGTTTT GGAACCTACA AATCCAAAAT 
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186251 CTGCATCTGC CATTTCTCCA GGGCCATTCA CAATACAACC CATGATAGCG 

186301 ATCTTAAGTC CTGGTAGGTG CTGCGTTCTC TTGCGGATAC GTGTGGTGAC 

186351 TTCTTCAAGA TCAAAGAGGG TCCGACCACA CATAGGACAG GAGATGTACT 

186401 CTGTTTTTAC AAGGCGCACC CCTGCATTTT GTAGAGTGCC AAAGGCAATT 

186451 TTTAGCACGT CCTGTAGGGG AAGGTTCGGT AAGTCAAGAA CCACAGCTTC 

186501 TCCAAGGCCA TCAAGAAGCA GAGCTCCAAA CTCTGTTGCT ATGGAAATAG 

186551 CAGCTTCTTC TTTATTGTCA AAGTCCCTTG AAAATACTAG CTTGGTCGGT 

186601 TTTCCTTGGT GTCCTTGTTT TTCAAAGAAA TCTCGGGAGG TATGAATGAA 

186651 GGGGTCTGAA GCATGAAAAT GCACAAATGG AGCTTGATGA ACAGCAGGGC 

186701 TATCCCAAAT CTCCTCATTG TGTTCATATA GGCAAGGCAC TTGATGGTGG 

186751 TGGAAAACTA AAAAGTGTTC TCGAAGTACA TCTGTAATAG GAGCATCTTT 

186801 TAACTCAGGG GGAACGACGA CCCCTTCAGG AGTTGTGAAT GCTTTTTCTT 

186851 TTGTTACGGG ATTTACCCCC AAGTGTTCTA AGAGTTCTTC AGGAGTAAAG 

186901 TCGGTAAGAT GGTGAGGATA GAGTTTTAAA AAGACTCCGT AGACGTCTCC 

186951 CCAAAGTGTT GTTTTCGCAG GCTTCTCTGC AGCAGAAACA AAGTTTTCGG 

187001 AGTGTTGTAG GGAAAAGGGA TTTTTCTTTT CTGGAAGGTC TAAGTAGATT 

187051 TTCGTATGGC GTAGCAAGCT ATCACAGACA GGAATTTCTG TAGTGGGACA 

187101 CCCTGTGAGA GAGCAGCGTA TGGTATCCCC GAGTCCTTCG GCAAGAAGAG 

187151 TTCCGATTCC TACTGCGGAT TTTATGATCC CGTCCACGCC CATTCCAGCT 

187201 TCAGTAACTC CAAGGTGAAG GGGATAGAGC CAGCCTCTAG CATCTAAGTC 

187251 TTTAGCAAGT TGGCGGTATG CAGTTACCAT GATCTTCGGA TTGCTAGATT 

1873 01 TCATTGAGAA GACAACATCT CTATAATTCA GCTTTTCACA TACAGCGATA 

1873 51 TATTCAATTG CTGAGGCTAC GATTCCTTCG ATAGTGTCGC CATATTTTTG 

1874 01 CATGATTCTT TCGGAAAGTG ACCCGTGGTT CACTCCAATG CGCATAGCCT 
187451 TGCCTAGTCG CTTACATTTC TCTACTAAAG GAGCAAACTT TTCTTCAAGA 
187501 CGCAGGAGAC TTTGGGCATA GCTTGCCTCT GTATAGATCT TCGTCCCCTT 
187551 GAACATGTTC CTCTTATCTA TGTAGTTGCC TGGATTGATG CGAACCTTGT 
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187601 CAGCAAAATC AGCAACTAAC ATAGCTGCTT GAGGGAAGAA GTGGATATCT 
187651 GCAACCAAAG GGATATTTAA CCCTAGAGCA ATCAGACGTT CTTTAATTTT 
187701 TTCACAGGCT TGTGCTTCCT TGATTCCCTG TACAGTCACT CTGACAATAT 
187751 CACAATTATG TTCCGCTAGA GCGTAGATTT GCTCTACTGT ACTGTCAATG 
187801 TCTGTGGTTA ATGTCGTTGT CATTGATTGG GTTTTTATTG AGTGGTCACT 
187851 GCCTATGTAT AAGTTGCCTA TTCTTACTGT ATGGGTTTTG CGTCGCGAGG 
187901 AATTGATGGC AGGGGTAATG AGTGTCATAA AAATCTCAAA AATTTCAGAG 
187951 TTTTATCAAT TATAAAAGCC GAGAGAATTT TGTTGAAGCT GATAAAGCTT 
188001 CTCTAAGGAT GCCTACATTT ATCAATTTTA CAGAACTTCA CCTCAATAAG 
188051 AGGAATTTTC AGAGAGAAAC GACGACCTTG GATGGCCGAG TGTTTCAGGC 
188101 CATCAAGAGA GTATATGGAG AGATTATTGT GAATTAAAAT AATTATTTCA 
188151 TACTTATCAA GGGGATAAAG ATTTTGCTTT AGAGGACGAT TCGTAAGGAA 
188201 CTCGAATCTG TTTTTGTGTT CGTAGAACGA ATCCCGAAGT TTGACAAAGA 
188251 GAAGGAGTTC CCAAGAGGAG TGCTTGGGGC TCGTACTTTG GGAACGCTTT 
188301 TTTCTAATTT GAAATCGGTG TATTTTGTGA TGGATGCGTG ATTTTATACA 
188351 TCGAGATTAA CATGATAGCC AGAAGGATTA GGGAGAGCAA GAATAAGGGC 
188401 CCTATAACTG TAAAGTAGGC TCCGACAGAT GTTGCTACAA AGATCATGGC 
188451 TATAGCAACG ATTCCAGAAA CAAAGGTATT GAGGAGGAGT AAAGCAGTTG 
188501 AAGCCGCAAA GGCAATTTTT ATGCATCGTC CTGATCCCGA TAACCTTTCT 
188551 AGGAATTCTC CAAGCTTTGT TTGTTCAACT GGAGATGCAC TTGATGTTCC 
188601 TGTGACTACT GGAGAAGACA TGTGATTTCC TATATTTCTA TGACAAGCCC 
188651 TATAATTTTA TAGTTTTCTA TTTTAAATGC AATGTAATTA GTGTTTTACA 
188701 AATTAATCTC 'CTACTAAAAT AGGAGGAGAC ACTCTCATAT TTTCTATGAC 
188751 CTAGGGAATA AAATATTTTA TTTATAATGA GTGAAAAACA CCGCTGTTTG 
188801 TCTAAGAAAT ACTTTTAGAA GAGAGTTTAC GTTTTATCTG CTGTTCAGCA 
188851 GTTGTTCCTT CGGTTTTGTT TACAAGGGCA AGCTTTATAA CAAGAAGAAG 
188901 CACACAGAGT ACTGTTGTAA GAATAAGGCC TCCTAGGATA ATCAAGCAGA 
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188951 


TGGGAGGAGT CCCAGGAATA AAGAAAGCTG 


CAATTATTCC 


TAAAACTTCT 


189001 


ATTACTAGAA 


TGGCAACAAG 


GATTTTAACT 


AAAAGTTTAA 


TCGTCTTCGT 


189051 


CGTTTTATCT 


TGATGCTTTG 


CTTTTAGAGC 


TTTGGAATTT 


GGTTCCATGA 


189101 


GGCCTAGATT 


TCCAGAGGAA 


GGGCGGTGGC 


TTGTGGGTAG 


GGGGGCTGAG 


189151 


GACACGGGCA 


TCGTCTTTTA 


GTTAGAATTT 


TCTTTATTTC 


GCAAAAGCTT 


189201 


TGTCTTATGG 


AGAGCAGAGG 


CGTTTCTATT 


TGCTTGAGAA 


AGTATAGCGT 


189251 


GGTTTAGAAA 


GTTAGAAAAG 


ATTTTTTAAA AAATCTTCTA 


TGTTTTGGAT 


189301 


GTTTTAAAGA 


GAGAAGTCTT 


ACCATTTTTT 


AAGAAAGGAT 


TTCCTTTTCA 


189351 


CAAAAAGGCA 


TCTACTCTCT 


TTTGCTTAAC 


CAGGGGGAGA 


GATTAGGGTG 


189401 


TGATGAGGGG 


ATGTCTAGCA' AGAGTTGTCA AAATGATAAC 


CCACGGTTGA 


189451 


TTTTGATTTT 


CTGCTTCTGA 


TCCAAATGTC 


TGCAGAGCAT 


CTATCAAGGC 


189501 


TAACTTCACA 


CCATCAATCG 


ATTGCAGACG 


TAGACTAGTA 


GTCTCTTCGT 


189551 


CTTTTGGAGA 


P C AG GG AAA 


TGGGAGGAAA 


GCAAGGGGAG 


CTGAACTACG 


189601 


GTTGTTTTAC 


GGCGCTTGGC 


CTCATTCAAA 


CAGTTAAGGT 


AGGCTTGCCT 


189651 


ACAAAAGGTA 


AACGCATCAT 


TAGGATTGTA 


GTTATAGTCA 


GAAGCTTTCG 


199701 


GCCCTAGAAG 


TTGTCCTAAG 


AATTGCGGTA 


GTCCTGCTTT 


ACCTGCATTC 


189751 


GTGGTTCCGT 


TTAGATTTTC 


CCATTTTGCT 


GAGCGGCATT 


CTCCTAATTG 


189801 


TAGGGGCTGT 


TTTGAACGGA 


GAGGATCCGG 


AGATTTTTTC 


GAATTTTCCC 


189851 


AACAGCGTAC 


ACTAGTTGCT 


TTCGCAAGTG 


CGAGACTTGT 


TCCTTTTACG 


189901 


TTGTTTGCCA 


TGTTTGGGGT 


GGCTGCATTC 


ACAATCATGA 


CGATGCCTTG 


189951 


AGTTTTATAG 


CGAGGCTTTT 


CAATGGGCCC 


TAACGTAGAG 


ATCAAAGTCA 


190001 




TTTCAAATGC 


CAAGCATAAC 


CGGTTTGATT 


GTCAGCAGCG 


190051 


ATGAAAGGCA 


TATCTGTGTT 


AGCCTGTAGA 


TCAGGTATAC 


GGTCCCAATT 


190101 


TTCTTGTAAC 


AGTGTTTGAT 


GGGTTCTAGG 


GGAAGGAAGC 


GGGGGCTCTT 


190151 


CTAGGACAGC 


TTCCGTAGGC 


ACTGGGGTGA 


GTGCTAGGGG 


CGGGACAGGA 


190201 


GGAAGGTCTG 


TATCTTTTAT 


GATGGGCGTC 


GGCTGATGCG 


TTACGCTTAT 


190251 


AGGACTTTTA 


GGTTCTCTTA 


AGAGAAAATA 


GAGAAGAATA 


CTAAAAGCCA 
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190301 . AGACTGCTAG AGCGGTAAGA ATAAATAATA GAGGAATGCC TAAAATTATA 

1903 51 GCAGCGGTTC CAGAACTTAC AGCAATTCCT ATAATTAGAA TAAAAGCGAC 

1904 01 AAGGTCCAGA CCTTTGAGCT TATGAGATGC GAGTTGCGTG GGCATGCTTT 
190451 CGCTGCTGAT CAGCATGGTT AGATTTACGT TAGTCAAATT GGGTTCTGTG 
190501 GTGGACATAA AACTTTTTAA ATAAAAAACA AAAAGTTTAA AAAAGATTCT 
190551 TTTTTATGTG AAGTTATTTA TATTTTAAAT AGAAGTTGTT TATTTAAAAT 
190601 AATAAATAGA CAACATTTTC ACTTTAAAAA GTATTGGATG CGGCTTAGAG 
190651 CCAAGAATCT AGGGGGTGCT TGTGATCAGG ATAAATTGGC TGTGTTTTTA 
190701 GAAAGCTGTT CGATTGGAAG CACGGACGCT TTCCGACATA GACTAGGGAG 
190751 TCGTTAGATC GAAACGGGGT GGAATGATGG GAGGCTGGTC CTTGTCTGTA 
190801 AGGATGATAA TTTTTTGCTC TTCCTGGTTG TCTTGTTCCC ATCCAAAATC 
190851 TTGAAGAGAT GTGATGAGTG CCAATTTTAT AGCCTCGATC CATTCTGCTC 
190901 TTGTTTTCCC GAGGTTTAGA AGTCTTGATG GAGCAAATAG ATTACATCCA 
190951 ATGAGGGGGA GTTGAATCAC ATCAACGCCT ATGATTTCAG CTTCTTGGAA 
191001 CAGGTTATGA AACGCTTTCT TGCTTACTTC AAATGCTTGC TTAGGATCGT 
191051 TGTTACACTT AGCAGCTTCG GGACCAAGCA GTTGTGCTAA GAAGTGTGCT 
191101 TTGCCTGGGA CATGGTCGTT TGAGGTGTGA TCACTATTTT CCCATTTTGC 
191151 TGAGCGGCAT TCTCCTGGCT GTAGTTGGGA TCCAGAACGA GAGTGCGCTC 
191201 TAGGGAGCCT AGATGCGTTC CAACACTGTA GACTTGTAGC CAGGGATAGA 
191251 GCTTTATTCG TTCCCCCTCC TTCTCGGGAG ATGTTCTCGT TTGCTGCGTT 
191301 AACAATCATC ACCCTGCCTT GAGTTTTGAT TCTTGGAACT GCAATATCTC 
191351 CTGAGGTAGA TATAAAGATA AGCTTCGAGT CTTTAAGCTT CCAAATAAAA 
1914 01 CAGGGCTGCT GAGGTGTTTC TGTAGTAAAC GATGGGTCTA CAGCGGCTAA 
191451 GCTCGGAAGA AGATCCCAGT TTTGTTCAAG AAGAGCCTCA TAGCCAGGAG 
191501 TGACCGTGAT AGCGGTAAGT TCTTCTGGGG GTGTGGTTAG ATCAGCAAGA 
191551 GAGACTTCGG GGAGTGATTC GGGATCGGGA AGTGGATCGA TCTGCAAAGG 
191601 GAGGTCCTGT TGCTCAGGAG GCTGGGGAGA GGGAGACAGA GAACTTTGCT 
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191651 CAGATTCGGG TTCGATTTGA GGAAGAATCT CGTATAATTT AGGTTTCTTT 

1917 01 AAAAGGGAGT AAAGTGAGAC CGCAGCAATT GCTAATGTTA TGACAACTAC 

1917 51 GGCAGAAAAA ATAGGCATAG CCAAAATGCC AGCAGCAATG CTAAAGCCCA 

191801 AAGCGCACGC TAGGGCTAGT GTGAGGGTGA CAATTTTCAG TATTTTCTCG 

191851 ATTCTATCTG TACGGTTGAG AGGGAGTCTA ATCGGATAGG AATGTTTCGC 

191901 AGGAGTTCTG TAGAGACTGG CGTCTGTATA AGAGGGTAGG GGATTAGAAT 

191951 CTGTCATAAT ATTTTATTAA TAGATCAATT TTCAACTATT ATATTACTAA 

192001 TTTGTATTTT TATTAGATTT TTTTATAAAA CAAAATTAGT TTATTAATGC 

192051 ATCCTATTGA AATAGATCTT TTTAGTTAAA AGGGACCATA AGTAGCTGTT 

192101 TGTGGTCTGT AAGGATTATA GTCATGGGAG TTGAGGGGTG CTGCGCAGCA 

192151 AAGGAGCGAA GAGCTTCCAT AAGACCTTTC TTTATATCAT TTACCCACTA 

192201 CGTACGGATG TGGTAAAGCT TATATGCACT GCTATTAGGC TTTGTTTGGT 

192251 TTACGGGTTC TAGTTCCAGC TTTCCTCCAG GTGAGTATAT AGAGGAAGAG 

192301 ATCAGAGGCA CTTGGACCAC AGTGGCTTGG TTATTGAGAG CTTCATCAAA 

1923 51 ACAGTTCAAA TAGGCTTTCT TAATAACATT GCTTAATTTC TCAGGATGTG 

192401 CTTTCAATTC TCCTTCATAT TTAGGACCAA GAAGTTGTGC TAAGAAATGT 

192451 GCTTCTCCTG GGTTCGTATC ATTTATTCGT CCAGTCTCTA TTGATCCAGG 

192501 GTGCTGAGCG GCATTCACCC ACAGATAATC CTTTGCCAGT GTTTATTTTT 

192551 CCCCCAGATG TTCTCGTATT GTTCCAACAA GTAGGGTGTG TGGCTGCTGA 

192601 GAGAGCAGCA TTGGTTCCGG CTCCACCAGA TTGCATGTTC GAATTCGCTG 

192651 CGTTAACAAT CATGACTCTT CCTGAGGTTT TCAGGCGTGG TTTAGCGATG 

192701 TCTCCTGTAG TGGCTACTAA GGTAATTGGG GCTCCTTGAT GTTCCCAGAC 

192751 ATAGTATCTT *TGAtTAGGAT CTTGGAGAGT CCAGGATATA TTGATTTCTG 

192801 AGAGAGTATT GACTAGGGTC CATTCATTTC TCAGCAGCTT TTGGTAATCC 

192851 AGAGGGACTG AGGGTTGCAC TTGAGCTGCA ATATCCTTGG GGACATGTTT 

192901 TTGGGGCCCA AGGAGCTCTT CTGTCGGTTT GGATGGCTCT CTTCTTCTTA 

192951 AAAGAAGACA AGAGAGTACG ACTGCTGCAA GGAGAGCAGC ACCAGTGGCT 
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193001 


ATAGCCATGA 


GAGGCATACC 


AAGAACTCCA 


GCAACAACTG 


PAP.TPPPTAP 


193051 


AGCTATGGCT 


AAAGCAAGAA 


TAAGAGCTGT 


GAGTTTTATT 


Alii iAoLAA 


193101 


TCACTTCTTT 


CTTAGTAAGC 


ACGGGTGATT 


CTGAGATCAA 


T^T^Tr 1 'h TPT 

TCTC1CA1 C 1 


193151 


ACAGACAAAT 


GTGATTGATT 


CTCGTGTACA 


TCTGGGATTG 


AAG 1 oo 1 a 1 1 


193201 


AGTCATAAAA 


TATCTATTTA 


AAGGAAAAGA TTATATCTTA 


ATT 1 G I 1 vj 1 1 


193251 


TTTTACAAAA AATCTTTTTC TTTGATAATA AAGATTTTTA 


T TV TV TV TTf'TTT 
1 AAA 1 1L1 1 1 


193301 


TT A ATTTCAT 
i X nf\ A a a a 


TAATTTTAAT 


GAGTTAAGGC 


GTAGCAGGCG 


P Tv p p tt A TTG 
C AC CI 1 A 1 A v? 


193351 


TTT AfSTTP AT 


AAGAGAACTC 


TAAGTGGGTG 


CCACACAGAG 




193401 


A PTTP,GTTP A 


AAAGGGTGGC 


GTTTGTGGTT 


TTATGGGGAA 


Tv rpT^T A PP 
CGAl 1 C IntL 


193451 


PPAGPCATGG 


GATTCAAAGA 


CTTCTTAGAT 


CGACAATACT 


TV /""PUP Tv P Tv TT 
AC 1 A oAun 1 1 


193501 


PT A A TP. GP. GG 


GTGAAGACAT 


GTGTATTTCA 


TGGGCATGGG 


1 AACCnjAooA 


193551 


P A T P A GG TTT 


ACTTTCGAAG 


GCCCATATAT 


CATCTTCAGA 


oo 1 C AA 1 AAA 


193601 


AGGGGGGGGT 


AGGGAGAACG 


ATGTTTTACA 


ACAGTGTAGA 


TV TV TT 7\ TV 7A T 

AA 1 1 AAC 1 A I 


193651 


TTG TG AAAT A 

A A w A W^TV A *• 


TTTTCATTTA ATAACAATTT 


AATGAGTTTT 


TV J\ *T* r P r P/^ TV TV /"* TV 

AA a i T(jAA(jA 


193701 


p A P A GGP AGT 


GGCTTATCTT 


TTGTCTATAA 


GTCGGTACAG 


n"i/"> j\ O 7\ OTT T\ T 

TCAvjAC a 1 Ai 


193751 


A AGAATTGPP 


GCTTTTTGGG 


TCTTCATAAT 


CTCTATCTTA 


ft.Tv/" , TvTvTVTv?V PP 

AACAAAAAoo 


193801 


AGGTCAATCA 


CAAACTGGGT 


GAGTTGTAGA 


GGTCTCAATA 


TV ^ TV TV TV f*TTT^ 

A(j AAACj I 1 1 


193851 


TAAGGTTAGG 


AGAGCTATAA 


TTGCGCCTGG 


GATTCTGCAT 


Tvf" , TV/" s /" ,, !\TT\r , f" s 

ACj AC (j A 1 AC C 


193901 


TGAAAAGATA 


GTGCCTAGGG 


TCGGTGTTTG 


CTGATGAGAA 


i CjCC AACjAAvj 


193951 


CATAGTCATC 


AACCGTCTTA 


ACATATTTAT 


AAGTTTTCTT 


jv i\ r*r*r* tv tv t 
ACjC AvjCvjAA 1 


194001 


GGCACAGCTG 


CTAGAAAGCA AAACAAGATG ATAACAGCCA 


CC AATAGAA 1 


194051 


TGTTCCTGAG 


ACAGCAAGGA 


CTGCAATGGC 


AGCTGGATGT 


/■*/** TV^TTT/^ ,P P Tv 

CCAGTTTCTA 


194101 


GAGTGAATTG 


CAGAGCAGGG 


CGGATTGCGA 


ATAAACTAAG 


AACTATGAGG 


194151 


CAGCATCCGA 


TCACTAGGAG 


AACTTCCAAT 


ATGACAGAAG 


TGATTGTCCG 


194201 


TGGAGAAGAC 


TCAGCTTGCT 


GCTTCCCCTT 


GAGTTCTATG 


AGTTTCTGAT 


194251 


TTTGTGTGTT 


TGCGCTATTG 


ATTAAGGGAG 


CGGCTGGACT 


TTCAAAAAGA 


194301 


ACGTCCGTAG 


CAGATACTGG 


AAGATATCCC 


ATTTATTTTA 


ATATCCTTTA 
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194351 ATTAAAATTT GCGCAGATAT TATACCTTAA TTTAATTTTT ATTAGAGAAT 

194401 AAAATTCTAT ATTTTCTTTT GTGTAATATT TTTTTATAAA AACATGAGTT 

194451 TTTAATTTAA CTTAGAAGAG AAGTGTTGCA ATATCTTTTA TTAAAACAAG 

194501 AGTACCTAAG AATTCGTGCG TAAAATATTA CGTATTTGAT TTGTATAAGT 

194551 CGTGAGAGTG TTTTTTCTAT TATTGAGGGG CTATTTTGAT TTTTCTTGAA 

194 601 TTGCGTCATG GAAAGATCTT GGAGGCTTTA GGAGCCTTTT TGTCCCTCAT 

194651 CATCTCGAGA TAACCAAACT GAAAAGATGC GTACCAAGAG TTTGTTTTTT 

1947 01 CGTTCGCGAA TCTACTGTAC CCTTGATTTT AGGATAGCCG TTTTGAGTGA 

1947 51 GATAGAGATC CTAAGAGAGA TCCTCAATAC TTTCTTGAGG TAGTTGGGCT 

194801 AAGGATCTTT CAACACAAGC AATAGTTAGC GGATATTCTG GGTGCAGCTC 

194851 GGCCATGTAT TTTACTGCTG TAATCATAGC CAAAGATTGT TGCATTCTTG 

194901 ATTGCGGGGG CAATTCTCTA GAATAGAGAT GCTCCCTCAA GCTGTATAAA 

194951 GGGATTTGGA TAATTTGGGT GTTCGTATTG ATCGCAGTCT GGATATAAGT 

195001 GACATAGAGT GCATAGTAGT ATTGGTAGAT GGCTTCAGGA GTTGTTGGTA 

195051 TTTGTGGGAG GAAGACGCGG CCTCGGAAAT CGGGTTGCCA GTATAAACGT 

195101 TCTCCTGAAT CTCCCGCAGG GAGTTCTGTA TTTATTTCTA CCCTGGTTTC 

195151 CTGAATTGCC TCATTGAGCA AGGATTTTGG GGAGGGAGTG GTATCTACGT 

195201 AGGAGGATAT AGGCGCATTT TGCGAGGTTT CTTCAATTAA CAGAGTCTTT 

195251 CCTAAATTTT CGGAAGCTGA AATTAAGAGA TGTTCTGGAT TGCCTCCCGG 

195301 TAGGAGGGAT ACTAGGGAAA ACAGTGAATG TTTAGCTTTC CATATCTCGT 

195351 AAGGGTTATC TGTGAGTAAT GCAGAGGGTG CGGGGAGATT CTGTAGGTCC 

195401 TTGGGTTGGT GTTCCCAGTT TTCTTTAGAA CGATTGATTG CGATAGAGAT 

195451 AGAAGAATAC TGTAGGTTGG TGGAGACCCT TTTTGGTTTT ATTTCTTTGT 

195501 GTTTGCCACA AGCACGCGGT TTTAAGAGGG TGGTTGCTCG TTTATAAAGG 

195551 ATAAAGTTGC TAAGAACAAC AGCAATAAAT GAAATTCCTG TGAGGAAATA 

195601 GATAACAGGA ATATTTAAAA GCAAACCAAT AAGTAGGGTC CCTATAGTAA 

195651 CTAAGACAGC AAAGATTGCA GTAATCAATA GAATTCGAGA CTGAATCTTA 
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195701 TCAAAGTTGC TGCAGGATTT TTTTTTAGTG TCTACTAGGG GATGATCTAC 
195751 AGTTAAAGAT ATGGGTGAGA TTGTAGCCAT AATTAAGGAA TTCACCTGAG 
195801 TTCCTTTAAA TTATATCTCT TTTATTGATA TTTAAATAAT TAAAAACATT 
195851 TCTGCATTTA AAAATATTTT TTAATTTGTC ATCTTTCTAA GTAAATATAA 
195901 AAAATAATAA GTTTATTTAG TTAAATTTTT TTTGAAAACA AGAAAGATAG 
195951 GGGAAAGGCT CGCGAGTAAA TCAAGCCTTT CCTGAATGCA AAGAGTCGCA 
196001 TAACTACTTA GAAAAGAGTT CGTCAGAATA GAAGTTGTAT TAATATAAAA 
196051 TTTTTATTTT CTAAGATTAG AAAGTAACTT TGACACAGCC ACCTTTGATG 
196101 CGGCACTGAC AAGCAAGACG TTCGTTAGAG TCTTCGGGTT CTCCTAGAAA 
196151 ATCGTATTCT GGTTCCGTAA ACTCAGAAAG ATTCTCACGT CCTTCTAAGA 
196201 CCTCTATCAC ACAAGTTCCA CAGACACCTT CTGTACAAGC AAAGGGAATG 
196251 CCCATGGATT CACAAGGCTC TGCGATCTCA CTATTGTCTT CTAACTCGAA 
196301 CTCTTGTTGT TCATCATCAG AGGTAATGAC TAGCTTGGCC ATGGAGTTTT 
196351 CCTTCTACGT ATTGTCGGAT AGATTAAAAT GAAGTTCGGA GTAGAGGGAT 
196401 TCGAACCCCC GACCTATTGC TCCCAAAGCA ACCGCGCTAA CCAAGCTGCG 
1964 51 CTATACTCCG TAAAATAAAA TTTTCGAGTA AAAGAGATCA ACGTTAGCAT 
196501 AGAAGGAAAT AGTCGACAAG ATCAAAGATG CTTCATAACG TCTCTTTAGA 
196551 GTTTTACTTG CAGATATATG GAAGAGGGAA GTATGAGAAA AGGCAGGGAT 
196601 TCTCTAGGAG GCTGTTTTTG TGTCTGGGAA GAAAGATGGT GTAAGGGGAA 
196651 TGATCTTTGT CCCTCTTAGC ATCCTAGTAC TAATCTTTTT ACCTCTTCCT 
196701 CAGATCCTTC TTGATTTTGG ATTGTGTATT AGTTTTGCAT TGTCTTTACT 
196751 AACGGTCTGT TGGGTCTTTA CCTTAAATTC AAGCAATTCA GCGAAGCTTT 
196801 TTCCTCCATT TTTCTTATAT CTTTGCCTAT TGCGGTTGGG ATTGAATCTT 
196851 GCATCAACAC GATGGATTGT CTCTTCAGGA ACCGCCTCTT CTCTGATTGT 
196901 TTCTTTAGGC AGTTTCTTCT CTTTAGGAAG TCTATGGGCA GCAACGTTTG 
196951 CGTGCCTCCT TCTTTTCTTT GTGAACTTTT TGATGGTTTC AAAGGGTTCG 
197001 GAAAGAATCG CAGAGGTCCG TTCGCGGTTT TTCTTAGAGG CTCTTCCAGC 
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197051 AAAACAGATG GCTTTAGATT CTGATCTTGT TTCTGGAAGA GCTTCTTATA 

197101 AGGCTGTCAA AAAACAAAAA AATGCCCTTA TAGAAGAAGG GGATTTCTTC 

197151 TCTGCCATGG AGGGGGTCTT TCGTTTTGTT AAAGGGGATG CAATTATTAG 

197201 TTGTATCCTT TTACTCGTGA ACGTAGTTTC TGTAACTTGT CTTTATTATA 

197251 CTTCGGGTTA TGCTCTTGAG CAGATGTGGT TTACAGTTTT AGGAGATGCT 

197301 TTAGTGAGTC AAGTACCTGC TTTACTTACT TCGTGTGCTG CAGCCACTCT 

1973 51 TATTAGTAAA ATCGATAAGG AAGAGAGCCT TTTAAATTAC CTGTTCGAAT 

1974 01 ACTACAAACA GTTGCGTCAG CATTTCAGGG TGGTGTCGTT ATTGATCTTT 
1974 51 TCTTTGTGCT GCATTCCCAG TTCTCCAAAA TTCCCTATCG TTTTGCTCGC 
197501 GAGTCTTTTA TGGTTGGCGT ATCGAAAAGA AGAGCCTGCA TCAGAAGATT 
197 551 CTTGTATAGA ACGTGCGTTC TCTTATGTTG AGGGGGCCTG CCCTAAGGAA 
197 601 CAAGAATCAC AGTTCTATCA AGTATATCGT GCAGCATCCG AAGAAGTATT 
197 651 TGAAGATTTA GGAGTTAGAT TGCCTGTGCT TACTTCTCTA CGTATTGAAG 
1977 01 AGCGTCCTTG GCTCCGAGTA TTTGGCCAGA ATGTATACTT AGATGAAATG 
197751 ACTCCAGAGG CTGTGCTTCC TTTCCTTAGA AACATCGCTC ATGAGGCTCT 
197801 CAATGCCGAG GTAGTTCAAA AGTACCTTGA GGAATCAGAG AGAGTGTTTG 
197851 GCATCGCTGT TGAAGACATC GTTCCTAAGA AAATCTCTTT AAGCTCTCTT 
197901 GTAGTTCTTT CTCGCCTCCT TGTTAGAGAA AGGGTATCGC TTAAGCTTTT 
197951 CCCAAAGATT CTAGAGGCCG TTGCGGTATA CCAAAATTCT GGAGACAGCT 
198001 TGGAGATCCT TGCGGAAAAA GTGCGAAAGT CTCTCGGATA TTGGATTGGG 
198051 AGAAGTCTCT GGGATCAGAA ACAAACCCTT GAGGTAATTA CCATAGATTT 
198101 TCATGTTGAA GAATTGATAA ACAGCTCATA CTCAAAGTCT AATCCTGTAA 
198151 TGCAAGAGAA TGTGATCCGT CGAGTAGACA GTCTTTTAGA ACGGTCGGTA 
198201 TTTAAAGATT TTCGAGCCAT AGTTACGAGC TGTGAAACAC GATTTGAGAT 
198251 GAAAAAAATG CTCGACCCAC ATTTCCCTGA TCTTTTGGTT TTATCTCATG 
198301 ATGAGCTTCC TAAAGAAATC CCTATTTCCT TCTTAGGGAT CGTTTCAGAT 
198351 GAGGTTTTAG TTCCTTAATT TAATTTAATC TTCTGTAAGC ACTATTACTT 
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198401 TCGTATTTTT TTTGTTGGTG TAAATATTGT TTAAAAATTT 

198451 AATCTAATAA ATAACTAGAT AAAAAAAAAT TTGTGAAAAC 

198501 CAAAACATCA TAGAGGTTTG GAACTTCTAC TGGGAGACTC 

198551 GTATCGCGAT AGCTTAATTG AGTTCTATTT GCCTTTAGTA 

198601 TTCATCGTTT GATTTCAGGG ATGCCTTCCC ATGTAAAGAC 

198651 TATGCTTCGG GTGTTGAAGG TCTCGTCCGT GCGGTGGAAC 

198701 TGAGAGAAGT CGTCGTTTTG AAGGTTATGC GGTATTTCTG 

198751 CCATTATTGA TGATCTGCGT AAGCAAGACT GGGTTCCTCG 

198801 CAAAAAGCGA ATAAATTGTC AGGAGCTATG GATTCTCTTC 

198851 AGGCAAGGAA CCCACGGATC TTGAACTGTG TGAGTATCTC 

198901 AACAAGAGCT TTCGGGATGG TTTGTATCTG CCCGTCCTGC 

198951 TCTCTGAATG AAGAGTGGCC TTCACAAAGT GATGAAGGAG 

199001 TCTTGAAGAG AGAATCCCCG ATGAACGTGC CGAGACAGGG 

199051 TAGATAAACA AGAATTTTCT TTATGTTTAG CCAATGCGAT 

199101 GAGGAAAAGG AACGCAAGGT CATGGCCCTG TACTACTATG 

199151 CCTTAAGGAA ATCGGTAAGG TCCTTGGGGT AAGTGAGTCT 

199201 AAATTCACTC TAAAGCATTG CTTAAGCTTC GCGCAGCACT 

199251 CGATAAATAC AGTTCTCAGG TTTTAAGAGC AGTCCTAGAG 

199301 CCCTCCTACG ACACAGAGTA ATCCGTAAAG AATTTGTTTA 

199351 TAACAATGCA GCCAGCAATT GTGAAGATTG CTCCTACAAT 

1994 01 ATGCGGGGCA GCCATTCTTT AATACGTTCG GCTAGGGTAG 

199451 GGGTGCCTTG CTAGTTGTGG CATTTAGACT AGTTGCGGCG 

199501 ACGGAACCCC AATTGGAGAT GTAGACATAG CGACCTAATC 

199551 ATAGAAGTTA TGGTATCTCA GCAAAGGAAT TATTCTCAGC 

199601 TTCTTTATTG TTAGATAGGT GGTGCCTTGC ATATAAGCTT 

199651 AAGATAGAGC TTGGCAATTC TTAATTATGT ACGATCACTC 

199701 GGTTACAATC TTTACAAGAG CGAAATATTT TAGAGAATTT 
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TTTTTGAATT 
ACAGCAAACT 
AGGAAATAGA 
AAAAGTGTGG 
CGAGGATTTG 
GTTATAATCC 
ATTAAGGCTG 
TAGTGTCCAT 
GCCAGTCTTT 
AATATTTCGC 
ATTAATCGTG 
CCGGAATGGC 
TACGATGTTG 
TCAGGAACTT 
AAGAACTTGT 
CGCGTCTCTC 
CTCTGCATTT 
CTAGGAGAAG 
GTCAACGCCA 
CAAGAGAAGA 
AGGTAGGAAG 
TTTAGCATCG 
TTTAGAGAAG 
GTACAATCTT 
TAGGATCGAT 
ATGCAATCCT 
TACCGCAGGT 
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199751 TTGGAATCCG TAGAGGGACC TATCGCCGCT TATTTAGGAT TTGATCCTAC 

199801 CGCACCTGCT CTACATATTG GTCATTGGAT TGGGATTTGT TTCTTGAAGA 

199851 GACTCGCTGC TCTGGGGATT ACCCCCATAG CTTTAGTCGG GGGAGCCACA 

199901 GGTATGGTTG GAGATCCCTC AGGGAAACAG AGCGAGAGAT CGTTACTTCA 

199951 GACAAGTGAA GTTTTTGATA ACAGTCAAAA GATCACGGCG TGTCTCCAGC 

200001 GCTATCTTCC CGGGGTGACT CTTGTAAATA ATGCAGACTG GTTGCAGGAG 

200051 ATCTCCCTGA TTGATTTCTT AGGGGATATA GGAAAACACT TTCGTTTAGG 

200101 CCAAATGCTA GTGAAAGATA CAATAAAGCA GCGGGTGCAT TCTGATGAAG 

200151 GAATTAGCTA TACCGAGTTT AGCTATTTAA TCCTGCAATC CTATGATTTT 

200201 TATCACTTAT TTAAAAATTA TGGCACGATC TTGCAGTGCG GTGGTAGCGA 

200251 TCAGTGGGGG AATATTACTT CAGGAATCGA TTTTATTCGC CGTAAAGGGT 

200301 TGGGTCAGGC CTACGGCCTT ACCTATCCTT TATTAACGAA TGCTCAGGGG 

200351 AAAAAAATAG GGAAAACAGA GTCGGGAACT GTATGGCTCG ATTCAGATTT 

200401 AACCTCTCCT TTTGAGCTGT ACCAATACTT ACTCCGTTTG CCCGATGATA 

2004 51 CCATCCCTAA AATTGCTCGT ACGTTAACTT TATTGAGCAA TGAAGAAATT 

200501 CAAGATATTG ATAGGCGTGT ACAGACGGAT CCAGTTGCAG TGAAGGAATT 

200551 TGTAGCCCAA GATATCTTAA GTGCTATTCA TGGAGATCTA GGGCTTGAAG 

200601 AGGCTCTTTC TGTAACTCGT AGCATGCATC CAGGGAATCT TTCATCCTTA 

200651 TCGGAAAAAG ATTTTCATGA ATTGTTTGCA GGAGGGATGG GGGCCTCATT 

20 0701 GGATAAATCC GAGGTGTTAG GGAAACGTTG GTTAGACCTA TTTCTTGTTT 

200751 TGGGACTATG TAAATCTAAA GGGGAAATTC GAAGGCTAAT TGAACAAAAA 

200801 GGGGTATATA TTAATAATGT GCCCATCGCT AATGAGCATA GTGTTTGTGA 

200851 AGAACAAGAC 'ATCTGTTATG GTCACTATGT GTTGTTGGCT CAAGGTAAAA 

200901 AACGAAAGCT TGTTCTATAT TTAAATTAGT TTAAGGAGGC TAGGTAGCTT 

200951 TGCAAACGAA TATTGGTCTT ATTGGCTTAG CTGTCATGGG GAAAAATCTT 

201001 GTCTTAAACA TGATAGATCA TGGTTTTTCT GTCTCTGTCT ATAATCGGAC 

201051 CCCAGAGAAA ACGCGGGACT TCTTGAAAGA ATACCCTAAC CACCGAGAGC 
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201101 TTGTAGGGTT 

201151 CGAAAGATCA 

201201 TCATGCGTTA 

2012 51 GGAATAGCTA 

201301 AAGGGGATTC 

201351 ACGTCACGGC 

2014 01 TAGTGGCTCC 

201451 TGCTGTTCTT 

201501 TCACAATGGT 

201551 GTATCTTAAG 

201601 TTGAAAGAGT 

201651 TTCTGAAGTC 

201701 CGATTTTAGA 

201751 GATGCTTTAA 

201801 TGCTCGTTTC 

201851 ATTATCCAGG 

201901 TTCATACAAG 

201951 TGCTCAGGGA 

202001 GATTAGACCT 

202051 CAAAGTGCAT 

202101 GAATACCTCG 

202151 CGGAGATGGG 

202201 CCTATTCCCT 

202251 AGCAAGCTCT 

202301 CTCATACCTA 

202351 GATTGGGTGC 

202401 CTCGAGAAGA 
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TGAATCTTTA GAAGATTTTG TGAATTCATT GGAGAGACCA 
TGTTGATGAT TCAAGCAGGG AAACCTGTGG ATCAGAGCAT 
CTGCCTTTTC TAGAACCCGG CGATGTGATT ATCGATGGGG 
TTTTAAAGAT TCCGAACGAC GATGTAAAGA GTTGCAAGAA 
TCTTCTTAGG CGTGGGGATT TCTGGAGGAG AAGAAGGTGC 
CCATCAATTA TGCCTGGAGG AAATCCTGAG GCGTGGCCAT 
TATTTTTCAA TCAATAGCAG CAAAAGTACA GGGCCGTCCC 
GGGTAGGAAC TGGCGGTGCA GGCCACTATG TAAAGGCTGT 
ATAGAATACG GCGATATCCA GTTGATATGC GAAGCTTACG 
AGATTTCCTA AAGCTCTCCG CAACTGCCGT TGCTACAATT 
GGAATACTCT AGAGTTGGAA AGCTATCTAA TTCGTATTGC 
CTAGCATTGA AAGATCCGGA AGGAATCCCT GTTATTGATA 
TGTCGTGGGC CAAAAAGGTA CAGGAAAGTG GACCGCAATC 
ATTCTGGAGT TCCCCTTTCC TTAATCATAG GAGCTGTTCT 
CTTTCTTCTT GGAAAGAGAT ACGCGAGCAA GCTGCCCGTA 
AACCCCCTTA ATATTTGAAA TGCCCCATGA TCCCTCGGTA 
ATGTCTTTCA TGCTTTATAC GCTTCCAAGA TCATCAGCTA 
TTCATGCTTT TAGGAGAAGC TTCAAAAGAA TATAATTGGG 
AGGAGAAATT GCTTTGATGT GGCGCGGGGG ATGCATTATT 
TTTTAGATGT TATACATAAA GGATTTGCTG CCAACCCAGA 
CTCATCTTCC AAGAATATTT CCGTGGAGCA TTACGCCATG 
ATGGCGTAGA ACAGTAGTGA CTGCAATTGG TGCAGGGCTA 
GTTTAGCAGC AGCAATCAGG TTTTATGATG GCTATCGTAC 
TCAATGTCGT TAGCTCAAGG ACTGCGAGAT TATTTTGGAG 
CGAGCGTAAC GATCGCCCTC GAGGAGAGTT CTATCATACC 
ACACGAAAAC TACAGAAAGA GTGAAGTAAA AATAAAAAAT 
CCCTAGGTAG CTCGAGATTT AGTTCACCAC TACCGAATTT 

150 



WOj^02^9_94Jfile:/M^ ... . Page 27S ci 330 

WO 00/27994 PCT/US99/26923 

2024 51 TCAATTGTAA GCAGTGTGCT GATTTTAAGC GTCAATGTTA ATCTAATTTT 

2 02501 AGAACTTCAA TGAAAGCTGT ATTGGGAATG GAAACTTTTC CAAATTCCTT 

202 551 CATACGTTTT TTTCCTTTCT TTTGCTTTTC CCACAGCTTG CGTTTCCTAG 

202 601 TAATATCTCC GCCATAACAC TTTGCGGTCA CGTTCTTAGA AAGCGCACGA 

202 651 ATCGTTTCTC TGGCAATGAC TTTTTTGTTA ATGGCAGCTT GGATGGGAAT 
202701 CTTGAAGAGT TGTTGTGGAA TCACGTCCAC AAGCTTTTCG CAGATACTTC 
202751 TTCCACGAGA TTCTGCTTTA TCTCTATGGA CTAAACAAGA AAAAGCATCT 
202801 ATGGGCTCCT CGTTAATAAG AACCTCTAAT TTGATGATCG ATCCCTTACG 
202851 GTAATCCCCA AGACGGTAGT CAAAGGATCC ATAACCTTTA GTTACTGACT 
202901 TCAGCTTGTC ATTGAAATCC GAGACAATCT CATTTAAAGG GAGTTCGTAA 
202951 GCAAGAACTA GACGGTGCTG ATCTAGCATT TCTGTTTTTA CGCAGATCCC 
203001 ACGTTTATCT AAACAGAGGT TCATAATGTT GCTCAGATAT TCTTGAGGGG 

203 051 TGATAATATT CACATGAACC CAAGGCTCTT CCACATGCTC GATGATCGCA 
203101 GGATCCGGAT ATCCTGAGGG GTTATCAATA TCTAGAACTT TCCCGTTTTT 
203151 TAAGACGACT TTATAGATGA CACTTGGAGC CGTTGCAATA ATATCTAAGT 
2032 01 CAAATTCTCG AATGATTCTT TCAAAGATAA TCTCAAGATG AAGAAGTCCT 
2032 51 AAGAAGCCAC AACGAAAACC AAAGCCTAAA GAGTGACTGC TTTCTTGTTC 
203 3 01 TATAGTTAAA GCAGAATCAT TGAGCTGTAG TCTTCCTAAA GCATCTTTCA 
203 3 51 AAGTATCAAA ATCAGAAGAA TCTATAGGAT AAATTCCAGC AAAAACTACC 
203401 GGATTGATCT CTTTGAAGCC TTCCAAAGGA GTTTTTGCAG GATGTTTTGT 
2034 51 TTTCGTGACT GTATCGCCGA TCTTCACATC CTTCACTTTT TTGAGATTGG 
203501 CAATAAAAAA ACCCACCTGA CCAGGGCGTA AGGAACCTTC TATAAATGTT 
203551 GCTTTAGGGA GAAAGGCCCC TATACCTAAG ACTTCAAACG AGGAGCCTTT 
203 601 AGCCGCCATA AAAGTAATGC GGTCTCCTTT TTTTAATTCC CCGCTAATAA 
203 651 TGCGTACGTA GACCATAATG CCAACGTAAG GGTCATAATG AGAATCAAAG 
203701 ACTAAAGCTT TAAGCTCTGT TTCTGCAGGT GCTTTTGGAG GAGGAACAAG 
203751 ATCGATAATT GCTTTCAGGA TTGCAGGGAT CCCCTGACCT GTTTTTGCAG 
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203801 AACAGGCAAT AATGTTCGTA GTGTCTAGGC CTATATAATC TTCAATCTGT 

203851 TGAGCAATTC TCACGGGATC AGCGGCAGGT AGATCAATCT TGTTTAATAC 

203901 AGGAATGATC TCTAAATCTC TTTCAAGGGC CAGGTAGACA TTAGCAAGAC 

203951 TTTGTGCCTG CACCCCCTGG GCGGCATCTA CAATAAGTAA GGCGCCCTCA 

204 001 CATGCAGATA GAGATCGAGA GACTTCATAC GAAAAGTCCA CGTGACCAGG 

204051 GGTATCAATC AGGTTCAGTT GATACACCTC TCCTTCATAT AGATACGTCA 

204101 TGGTGACAGG ATGAGCTTTA ATTGTAATGC CACGCTCTCT TTCAAGATCC 

204151 ATGGAATCTA AGAGCTGCTC ACGCATCTCC CGTTCTTCTA CTGTGCTCGT 

204201 ACTTTCTAAA AGGCGATCAG CAATTGTAGA CTTCCCGTGA TCAATATGCG 

204251 CTATGATTGA AAAATTGCGA ATGTTCTCTA TCTTATATTC TTTCAAAATG 

204301 TACTGTAGTG TTATCTAGGT TTATTCCTGG TTTCATAAAG CTGCATTGAG 

204351 AAGGCTCCTT CAGACAAAGA CCAGCTTCTC AAATACTTTT GCGGACTGCA 

"204401 TCTCTTTATT AGCAAAATGA TCATGCGGTC ATCTTTAGCT TTGGATAGAT 

2044 51 GAAAGTATAA TAGATGAGAT GTAGAAACCA CAAGGGTTTA AAGTCGAATC 

204501 TGATTTGAGG TAGAGTTCCA GAGTGGCTAG CAGTAGAAAA AACTAAGTGA 

204 551 GAAGAAGTGC TCTCCGTTAG TATGAAACTG ATTCCCACTC AGGATTCTAT 

204601 AGAAAGGGAA ACCGATTCTA AAAGAGATAA AAAAATATTT ACCATTTACA 

204651 TATGTTCATC TAAAGTCCTT GCGGGTCATT TTTTCAGTCA TTTAGACAAG 

204701 CATAATAAAA TTCATGAAAG CATTGGGGTT TGAGATAGTT AAATCGACTC 

204751 GATCCAAGAG TATAAGAGAG GAATGAGTGT TCTTATGTCC GAGATAGTGC 

204801 TCTTCAATGT AGTGAAGAAC TGGAAAGTCA GGAGATCTAT TTGAGTAGGA 

204851 GTTCAGGTCT TCGCAATCGA TTTTTTCAAG CTCTGGAGTC TATGTGAAAG 

204901 AAATTTATAG AAGTAAAATA ATTCCATGAA TTCTAAAATG ATTAGGAAAA 

204951 TAATATAACA CGCTGATACT CAAGTCACAA TGGAGCTTCT ATGGTTAATA 

205001 TACAGCCTGT GTATAGGAAT ACCCAAGTCA ACTATAGTCA GGCTACCCAA 

205051 TTTTCGGTGT GCCAGCCAGC GCTTAGCCTG ATTATCGTTT CTGTTGTTGC 

205101 TGCTGTACTC GCTATTGTAG CTTTGGTATG CAGTCAATCT CTTTTATCCA 
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205151 


TAGAGTTAGG 


AACTGCTCTT 


GTTCTAGTTT 


CT CT T ATTCT 


TTT 1 VjL 1 1 L 1 


205201 


GCTATGTTTA 


TGATTTATAA 


GATGAGACAA 


GAACCTAAGG 


AG I I 1 on I 


205251 


CCCTAAGAAA 


ATCATGGAAC 


TCATCCAAGA 


ACATTATCCA 


AGTATTvjI Ivj 


205301 


TTGATTTTAT 


TAGAGATCAG 


GAGGTTTCCA 


TTTATGAGAT 


ACATLAL I i L- 


205351 


ATCTCTATTC 


TTAATAAGAC 


GAATGTTTTC 


GACAAAGCAC 


CAGTATA III 


205401 


ACAAGAAAAA 


CTCTTACAGT 


TTGGCATTGA 


GAAGTTCAAA 


GAl G I ALA 1 L 


205451 


CAAGTAAGCT 


CCCTAATTTT 


GAAGAAATTC 


TTCTACAGCA 


TTGLLLAi 1 L^ 


205501 


CATTGGTTGG 


GACGTCTGGT 


ATATCCCATG 


GTATCGGATG 


I LAC- I LLAyo 


205551 


AACCTATGGA 


TACTATTGGT 


GTGGTCCTTT 


AGGACTGTAC 


(jAUAALViL J. L 


205601 


CCTCTCTTTT 


TGAACGTCGA 


TCTCTTCTAT 


TGTTAAAGAA 


* n trim x ^/"*Mwi*fT' 

AATTAG CTTi 


205651 


GGAGAGTTTG 


CTCTTTTAGA 


AGATGGTCTC 


AAGAAAAACA 


CGTGGAGTTL 


205701 


TTCGGAACTC 


GTTCAAATCA 


GACAAAACCT 


TTTTACAAGA 


TATTATGL 1 <j 


205751 


ATAAAGAAGA 


GGTAGATGAA 


GCAGAGTTAA 


ACGCTGATTA 


CGAACAvj ill 


205801 


GATTCCCTCC 


TTCACCTTAT 


TTTTTCTCAC 


AAGCTCTCTT 


Vj AAAuL AAtj 1 


205851 


GCAATTATTT 


CAATATATGA 


ATGAGTCCGG 


ATGGGATTGG 


CTTToioAl 1 


205901 


TTGATTCTCA 


AGGCGAGGGA 


TTCCAGTTAT 


CACGTCTGGT 


1 VjvjoL 1\J 1 1 A 


205951 


CATTCGTCCT 


GGGCATTATA 


CGAAGCAAAA 


GAGCAATTTT 


ALL 1 1 LL 1 oA 


206001 


GGTTTCTCTA 


TTGACCTGGG 


AAGAACTGAT 


AGAAATGCAG 


TT ATTAAtjL A 


206051 


AACCAACAAA 


ACACGGGGTT 


GCAAAAGATC 


TTTGTAATGT 


ATTTGAAAAA 


206101 


CACTTTCAAA 


GGTTTAGACA 


GTACCTAGGT 


TCCTTAGATC 


T AAATL AAACj 


206151 


GTTCGAAAAT 


ACCTTCTTGA 


ATTATCCTAA 


ATACCATTTA 


GATAGGuACj I 


206201 


GAGAAAAAAA 


TCCTAGGTCA 


GCTTGGCAGA 


AATTTTTGAA 


ATCClTAAGr 


206251 


GTTCGATCTG 


CATTTTTTTC 


GGGGATTGTA 


AAAAGTTTGC 


CTAATCTGAA 


206301 


GGGTAAGAGA 


GAGCTATTTT 


TCATGGGATT 


TTTTATTACA 


GGAAATTCTT 


206351 


GAATTAGAGA 


TTTTTATTTA 


CACATAATCT 


ATACTGCCTT 


CAATAGATCT 


206401 


ATATCTAAGG 


AGTTGGCTAT 


GAGCATGACG 


ATCGTTCCAC 


ATGCTTTATT 


206451 


TAAAAATCAT 


TGCGAGTGTC 


ATTCTACCTT 


' TCCTTTGAGT 


' TCAAGGACTA 
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206501 


TTGTAAGAAT 


AGCCATTGCC 


AGCCTCTTTT 


GTA 1 AGG I oC 


A TT A PP APPT 
Mil nOUnuv. J. 


206551 


TTAGGCTGTT 


TGGCTCCTCC 


CGTTTCTTAT 


Ax 1 G 1 IooIjA 


PTPTTTTAPP 


206601 


TTTTATTGCC 


TTTGTCATTC 


TTTCTTTA G T 


AAl X 1 1 Aot,l 


TTP A TTTTTG 


206651 


GAGAGAAGAA 


GCTTCCACCA 


ACACCAAGAA 


TCA1 IL.LJ.oA 


TAPATTTAPT 


206701 


CACGTGATAG 


ATGAAGCTTA 


TGGCCTTTCA 


ATCTCTGCA1 


TTPT1 AP APA 


206751 


ACAGCAGGTA 


ACATTAGCCG 


AGTTTAGACA 


ATTTTC I AC 1 




206801 


GTAACATATC 


TCCTGAAGAG 


AAAATCAAAC 


AATTGCC i i c 


TP A ATTPPPA 


206851 


AGTAAAGTAG 


AGAGTTTTGG 


TATTAGCAGG 


CTCGCAGG I G 


* rprprp A P A A A A 
A 1 X X Aonnnn 


206901 


GAATAATTGG 


CCAATATTTG 


AAGATCTTTT 


AAGCCAAACC 


lbLtLol XM1 


206951 


ATTGGCTTCA 


GAAATTTATA 


TCAGCAGGAG 


nm^ORrh APT 

ATCCAC AAG 1 


TTPT^r A PAP 


207001 


CTAGGTGTCC 


CTAGAGAATG 


TTATGGGTAC 


TATTGGC I AG 


OUV. LIU 


207051 


ATACAGTACA 


GCTAAGGCTA 


CAATTTTTTG 


T AAAG AG AC G 


/"> A r PP A T A TTr 


207101 


TTCAACAATT 


AACGAAAGAG 


GACGTTCTTT 


TAi 1AAAAAA 


p a a rr*c r rc > 'T r r 

\~ ArtoVjV- 1 v. X X 


207151 


CAAGAGAAAT 


GGGATACTGA 


TGAAGTCAAA 


GCAATTG 1 AC 


irrrtTiTPT a 

/io Lulni V- in 


207201 


CACTACCTAT 


ACGGCACGAG 


GAACTC TAAA 


/^Arr^R A PP A 

GAG C G AAGG A 


uouoonL x xrt 


207251 


CAAAAGAGAC 


AATCAGTAAG 


GAATTGL tat 


1 G 1 J. unul 1 1 


PP A TPPPT AT 
uLn X OOV» x n x 


207301 


TCTTTTGATC 


AGCTACAGCT 


GATCACTCAA 


CTTGG x AG AO 


ATPPTTPPP A 
/i X bL X X OUUn 


207351 


TTGGCTGTGT 


TTTGTAGATA 


ACAGTACCGC 


ATACAACC1 1 


r* & OPTTTPTP 
LnuL 1 1 lulu 


207401 


CTCTTGTAGG 


AGCTTTGTCA 


TCCCAAAATC 


TTCTTGACGA 




207451 


GATTTTGATG 


TAAACCTAGG 


CCTGTATGTG 


ATTCAGGATG 


rp Tv > ^7A/^TA7i^ ,, /^ 

I AAAAvjAAoC 


207501 


TGTTCAAGCA 


TTTTCTGCTT 


CTGATGAGCC 


AAAGAAAGAA 


CI AGGI AAA1 


207551 


TCTTGTTAAG 


GCATTTGAGT 


TCAGTTTCTA 


7i /"•/"»/** a it»<ti * * 

AGCGATTAGA 


GAG 1 G 1 A I x A 


207601 


AGACAGGGTC 


TTCACAGAAT 


AGCTCTAGAG 


CATGGAAATG 


CCAGAGCTAG 


207651 


GGTTTATGAC 


GTCAATTTTG 


TAACAGGAGC 


TAGAATTCAT 


AGGAAGACGA 


207701 


GTATCTTCTT 


TAAAGACTAA 


ACCAGGTAAC 


TAGCTTTTTA 


GTCTCGAAAG 


207751 


GGGCTACGAG 


AGGGAGTAGA 


ATTTTTCTTA 


TTCCAAAATA 


GAAATCTCTA 


207801 


ACCTATCAGC 


AGGGAGGACT 


CGTCTTCTTC 


AGGTTGGGAG 


AAGACGCTTC 
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207851 


TCTTGAGTGG 


ATGGACTGTT 


ATAGGTGG TA 


AGC 1 X CI AAA 




207901 


TGAGTTGCTA 


AATTCTCAAG 


GAACATCGCT 


ATATTovjI 1\j 


AC i uALonLo 


207951 


GATCGTTTTA 


AACTGATCTA 


AAcTAAGGAG 


4 vj 1 CAA 1 uAA 




208001 


CTTCGCTTTC 


TTTATGAATT 


AGGCCTTGTG 


TG T AbAUAO 1 




208051 


GAGGCAAATT 


CCATGTCTTG 


GGGTCGTCCT 


GTATGGT CIA 


AAo 1 LAnuwn 


208101 


ACAGAGGTTT 


TCCCATAAAT 


CAGCCGGGAC 


TGTTTTTA1 1 


a rpp a r*ATPT 


208151 


GCTTCCACGA 


GTACCTATGC 


TTAAAAATAC 


AC AAG AG C AO 


OTTATArtA 2VT 


208201 


TCATCCTCTT 


GAAGAGAGGG 


TTGAGATAGA 


TCT1 TA I AOC 


ATTfS ATAPf5P 


208251 


CGCATGATAT 


TTAGCAAAGA 


GCTGCTGTTT 


CATAGT. A I 1 1 


ACA 1 C AC 1 /i 1 


208301 


GATTCAGGTT 


CCACTCGCCA 


TTTAATGCTG 


CAT ACT! AAt? 


tTPTTfi 21 A A A 
Alul loAAAA 


208351 


ACATCACGAG 


TTAAGACTCT 


AGCTAAAACT 


AATGTGTG 1 A 


ji rprnrp A O A A T* 

GATT^i AVjAAI 


208401 


AGAGGAATTT 


GTTGGTCTAA 


AATCTAAAGG 


AATCAACCAA 


TA\jtj 1 AobAA 


208451 


GCATCTCTGG 


TTTTATATAA 


TCACTAAGGT 


CTTCTTCGAG 


CjC 1 oCC I CCA 


208501 


ATTTTCTTAA 


CTCGATGGAG 


CTCTACAACC 


TTGCTGCCTVj 


r'A/^A RATAAA 
C AVjAAA 1 n/w 


208551 


CTCTCCTAAC 


CAGAAAAATG 


GAAATACCTT 


TTGTAAtjAi. i. 


mmm/*» pn» A A A r? 
1 1 i Vjo 1 rvrvrto 


208601 


AGGGGAAGAG 


GGGACGGGAA 


GAAGCAGCGC 


TTTC AAb 1 C 1 


TTTA AHAl^A A 


208651 


GCAGTACGGA 


CTTCATTTTT 


TAAGCGTGCG 


Pil ACCC 1 CVjA 


APflTATPTAT 
ulnlLlnl 


208701 


TTTCTGTTGT 


AGTTCTTCAG 


ATACGTTGTA 


ATTTG 1 At? A 1 


uAl UCAttC 1 1 


208751 


CAGAGTGCAA 


TTGATTTAGA 


AGATCAATAA 


AAC1 lAluAy 


ATPTTTA Af^A 
A1L1 1 1 nnun 


208801 


TTTGGTTTAG 


CTTCTGAAAC 


AAAATCAGAG 


AC AAA I 1 IALj 




208851 


CCTGATTCTA 


TTTACGAGTT 


/ Mill ■ y> y> y> ik ii 

CTTGGGGGAA 


TACTTG I 1 C I 


111 on 1 \3t\J\n 


208901 


TCATAGGCGT 


GATTTTTTTT 


ATTC CT AAAA 


TCACACC AA J. 


\»AAoov.lnl 1 


208951 


AATAATCCTA 


•ATCCTAACAA 


TGCGCCACTT 


AGAATATAGG 


AAACAGGAGC 


209001 


TGCTACACAC 


AAGAAAGCTA 


TCAAAGCTCC 


GCAGAGTAAG 


ATGGCACTGA 


209051 


TAACAATATG 


AATAGTGGTT 


GAATTCTTTA 


ATTCAAAATA 


ATAATTACAA 


209101 


GAGCGATTAT 


TTTGAATAAC 


TGGCGAGGTT 


ATATTACTCA 


TACATTTCCT 


209151 


AGGCTCATCA 


ATTTGCGTAT 


TATATCGTAA 


ATTTAACAAA 


AATCCTATTA 
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209201 ATAGAAATGT TTTTATTTTT AAAATTTTTA TTGAAATTGT TTTGTATTAT 

209251 TGATGAAAAG TTTTCATTGA AGATAGCTTC TAGAAGAAGA ATAACGAGAG 

209301 AGCTTGTTAT GCTTTTTTTA AAAAAAGGGC GAGCCCCTTT TTGTTCTTTT 

209351 CCCGATTGGG AATAACGTAT TCTTTTTATA AAAATTAAAG ATTTTTTTAT 

2094 01 ATTTCTAAAT CTCAATTATA AAAAGTCTTT CAAACGCAAT TTGCTTGTAG 

209451 GTATTGCAAA TGATCAGACT CCCCTTATAG GGAAGATCGT ATAGGGAGAC 

209501 TGCTTTCTAT TTTTTTCTCT AGTTTGTAGG TTTGGGCGTG GAAAGCATAC 

209551 TGAGGTAAGC TTGATGTCCA TTGTGAACTC GTATTACGAA CAAATTGCCA 

209601 TCGTTTTAAC AGATTTTGAT GGTGCGTTGT ATTTACGCAT ATCTTCTGAA 

209651 CAAGAGCACT CGCTGGGTGC ATAGGACTTT CTTTTACTTT CTCTAGTAAA 

209701 ACCTTCAATT CTTTCCACGT CATGAAGTTC ACTGTAGGTT CATAGTTAGA 

2097 51 GGATACTGGA TCGAACATAT TTGTTTCAGT ATTCAAAAAG CCTCCAAATG 

209801 TTGCCATGGA ACAGTGGCCT CCTGCTTTAT CAAACTGACA CAACATTTTC 

209851 CAATTATCAG GATTTATAAG TTGAATCATC TGAGCCTGTT CCCAAGTGAT 

209901 ACCATGAGAA AAGAAAAGAA ATAAGAATTG TGAGATTCCT TGAACATCCT 

209951 TCCGGAAAAT CATATTGTGG GGGAGTTCCT TGAATATTTC TTCGCAGGTT 

210001 TTTTTCACAG AAGGAGAATC CCATTGATTC TTAGACGCTT TACTATATAG 

210051 GAACTTATAC TGTGATTCTG AGATTAATGT TAGTAGAGGG CGTGTATAAG 

210101 AGTGGAAAAT AGTTGTATAT CCTTTATGAA ACGCTAAGGG CCC AAGTAAA 

210151 CCATAAACTT TTTGTGTTTT ATTTAATCCG ATTTCCCCAG CAACAGATTC 

210201 AGTTTTGTCT ATAAAATGGG AGAGCCAGTA TAACGGGCAG TTTTGAAGAA 

210251 GAATCTCTTC GAACTCTGGA AACAGGGTTA AATCTATAGA TTTTAGAATA 

210301 TCGATCCCGA AAGCCTCTGC TTTTTTATGT AAATTCGGAG GCAGGTCTGT 

210351 ACCGCTTTTC CAGCAATTAA TAAATATTTT TAATTCGTTA ACAGTCAGGG 

210401 AGTGTGTTTT CACGAAATAA AAGACTTCTT TAGGATAGCG ATTGTAAATA 

210451 ATCTTCTGAA GTTCGTTGGG GATGGGCAGA ACCTTTGATT TAGCTAGCAG 

210501 TGCTACGACT AGCGTTATAA TCAAGATTAC GATAGCGGCT AAAGCTAAAG 
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210551 


TTCCTCCAAT 


AGCATAGCTA 


ATCGGCGCAG 


CAAGGAAAAC 


AAAAGAAAG1 


210601 


GCGCTAACAA 


GAGCTAGAAC 


AAGCCCAAGA 


ATGAGTCGGG 


CAATTGTCL 1 


210651 


AATTTTTAAA 


GAACAAGATC 


TATGACATTC 


GCAGTCATTT 


T»T*T» 7\ A H 7a Ta T ?a 
I I 1 AAAAA1 A 


210701 


TATGGGGAAC 


TAGTGTTAAA 


GGACTACACT 


TCATAGTACA 


ACCAGAA 1 G 1 


210751 


CTCACGGTTA 


TTATAACCTT 


TTGATTGAAA 


AAAGCGTACG 


AAT 1 GG l oAA 


210801 


ATGGGAGTTA 


ATAAAAAAAA 


TCCCTACCCA 


CT AAAAAG CA 


GG 1 AGGGA I \- 


210851 


AACAAGAGTA 


AGAGAAGCAA 


CTCTATGAAG 


AAGCAGGAGG 


1 o AA 1L1 


210901 


TGAGCCACTT 


CTTGTTCTTT 


AAGAGCAGAC 


TGCGCTAAGA 




210951 


TAACTTAGTT 


GCAGAAACCA 


ACCAAATAGC 


AATGATGAAA 


AG AAWvi I C A 


211001 


CTGCAAGATA 


AGGGGTCATA 


GCTCCAATAC 


TTCCACAGAT 


AACGAGCAAA 


211051 


CCTTGTTGGA 


TTAAAGCTCC 


TCCTGATTTT 


CCGAAGCGGG 


CAGCAAC I AG 


211101 


ATCAATAGCA 


GCCTTACCTT 


TGACTTTTTG 


CTCTTGGTCA 


AGAGGGA1 Al 


211151 


AGGCCATTTC 


TTTAGTTGAG 


TCAAAGAGAG 


CGTATTTTGT 


GGATTTCGAA 


211201 


AGAATATTCT 


GTATAGCTCC 


GACAACCACA 


GCTAGCATGA 


GAGGAGTTGT 


211251 


ACCGAACATA 


GCGACCAGCC 


CAGAAGCTTG 


GTTTCTAAAG 


AT AAC AAG A G 


211301 


CGAAGAAAAC 


GATACCTGTT 


AGGAGAACCA 


TGACAGGAGT 


GACTAGGGG I 


211351 


CCAGTTAACC 


ATCCAAATTT 


ACGAATGACG 


TTACCACCAA 


C AAAT AG CAT 


211401 


GATAAGTACG 


GATACTACGC 


CAGTCCAGAA 


GGAGAAGTTC 


CCCATGAAG 1 


211451 


CACTATAGTC 


ATTCATATTA 


GGATATTGCA 


GTTTCAGCTG 


ACTTTTCCAA 


211501 


GTCACTTCGA 


TTAAGTTAAT 


GCAAATACCA 


TAGGCAATAA 


CCAAGAGAGG 


211551 


TAATAAAAGA 


ATATAAGGAG 


ATCTAGCAAG 


ATAGAGGAAG 


CTAI Gil 1GA 


211601 


TATTCATTTT 


AGGTTTAGCA 


CCTTTTTTCC 


CCTTTTGCAT 


TTCTTG I GG A 


211651 


TTATAGAAGC 


GAGGATCGGT 


CAATACGTTC 


TTATTGATCC 


ACCAGTAACT 


211701 


GGCCATAAGA 


ACAAGTCCAG 


ATACAATAGT 


CATAGCCATC 


AAAAGACGTA 


211751 


AAGAAATTCC 


CCAAGGATCT 


ACACCTTCAG 


AAACGGAAGC 


TCTCAACTTT 


211801 


GAAGCCCAAA 


CAATTGCACG 


ACCAGAAGCT 


AGTAAAGAAA 


TATTAGCTCC 


211851 


GATACCGAAA 


AGAGCGTAGA 


AACGCTTTGC 


TTCGTGGATT 


TTTGTAATTT 
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211901 C ATT AG C AAA TCCCCAGAAC ATTAGAGATA GCATGACGCT TCCCCATAGT 

211951 TCAGCAAGTA CATAAAATGC AGCAAATGTC CAGTTTCTTA AGATGGCAAC 

212001 GAGTCCTAGC AATCCTGGAG GTAGGATGGC CTGTAAACGG TCAGCAAATT 

212051 CTGTAGGATG TAAAACATCG CGTAGCGGAT AAATTACAGT CGGGAACAGG 

212101 GCAAAGAAAA TTAAAAAGGG CGTTCCCACT GCATAAAATA AGGCCTGCTT 

212151 ACTTAAAATA TTACTTAGCT TTGCATAAAT AAGCATAAAG ATAATAGCAC 

212201 AGGGGACAAC AAGCCAAAAC TTGATGAAAG GTATTGCCTC TGCACCAGAA 

212251 CCAGGAGCTC CCACAATAAG AGTGTCTTTT GTATCGCGTA ACACCGTATA 

2123 01 GTTAAATGTA ATACAGAAGA ACATTAGGAA CATTGGCAGA ACTTTCTTTA 

212351 GCTCGTGAGT ATGTATCGGC CACAAGAAAG AGCGCAATTT TCCAAAAGGT 

212401 TTTTCTTCGG TTTTTGTCAT ATTTACCCTC TGAAATACTT TTATTTTCTA 

212451 TTCTGATAGT TTTTTATTTC GTTGTGGACT AACTTGAATT GTTTCCATTT 

212501 CGAAAGACCT TATAATATAC CATCTTTTCT CTCTATTGAC AAGAGGAAAG 

212551 GGATACTCCT AAGGGATAAG AGTTCTTAGG CGACTTTATT GCTAAGTCGC 

212601 CTGTAAAATT TATTTTTTAA TTAAACGAAA GGAGAGGTAA CTTCAAGGTT 

212651 GGGAGAGCTT ACAGCCTTCA CAGCTTCGAC GTAGGCATTC CATAGCTCTA 

212701 CGCTCACGAT ACCGTTGCGA ATCGAACGAA TTAACTCCCT TTTTAAAGAA 

212751 GTTAACGGTT TTTTAGGGCT AAAGCGAGCG AGGAGGTCTT TGTCTCCAAC 

212801 TTGTCTTGCC AGTTCTAAAA CGCGTTCAAT ATCTGTTTTA GTAAAGTTAA 

212851 TAAAGTCACG ACGTTTTTTA GAGCCTCTAG GAGCTCTAGG CTTAGGATTA 

212901 ATTGTGCGTG GGGTTTCCGA TTCTAAAATA AATGTTTTTA ACATTTTTAA 

212951 AAGTTGTTCG GGAGCAGCGC TTTTCATCTT TTTTAGAGTG AAATGATGCA 

213001 TATAACCGCC GCTAGGGCCT GGAAGATAGC GACATAAATC GTTTTCTTTG 

213051 TTTCCGCAGA CTCTTTTAAT AGCTTTTCCG ATCAGTTTTT CTATTTCTTC 

213101 TTTTATATGT GATTGTTCTA CGGCCATGAA ACCCACCTTT TATAATTGAT 

213151 GCTTACATAT TTTACAAGAA AGAATGTGGC ATACTTTCAA TTTAATGTTT 

213201 AGAGAAATAA AATTAAAGCG CTGTTTTAAT CAATCAGCGA ACATAAATTA 
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213251 TATTGCAAGT TCTCCTTTTT GCAATATTTT TCCTCTCAGG ATCTTTTTGT 
213301 TTCTTAACAA AATTCAAAAC ATAAAAATCG TAAAGTAAAG AAGATTTTTT 
213351 CTATAACATT TTGTTTTCTT ATAAAACAAA CGGGTTTTCT AATTTTTTTA 
213401 AAACTATCCT GAACAAAGAT ATTTCTTTCA TTTCTATTAG TTGTATTTCT 
213451 TGCTGGATAA AATATATTTT GAATAGAACT TGTTTTTCTG GTACTTTAAG 
213501 TGGTTTTTAA TGGATCTTAA AAAATGCTTC TAGAGAGATG GATGCGAAAA 
213551 TGGGATATAT ATTTAAAGTG ATGCGTTGGA TTTTCTGTTT CGTGGCATGT 
213601 GGTATAACTT TTGGATGTAC CAATTCTGGG TTTCAGAATG CAAATTCACG 
213651 TCCTTGTATA CTATCCATGA ATCGCATGAT TCATGATTGT GTTGAAAGAG 
2137 01 TCGTGGGGAA TAGGCTTGCT ACCGCTGTTT TGATCAAAGG ATCCTTAGAC 
213751 CCTCATGCGT ATGAGATGGT TAAAGGGGAT AAGGACAAGA TTGCTGGAAG 
213801 TGCCGTAATT TTTTGTAACG GCCTGGGTCT TGAGCATACA TTAAGTTTGC 
213851 GGAAGCATTT AGAAAATAAT CCCAATAGTG TCAAGTTAGG GGAGCGGTTG 
213901 ATAGCGCGTG GGGCCTTTGT TCCTCTAGAA GAAGACGGTA TTTGCGATCC 
213951 TCATATCTGG ATGGATCTTT CTATTTGGAA GGAAGCTGTC ATAGAAATTA 
214001 CAGAAGTTCT CATTGAAAAG TTCCCTGAAT GGTCTGCTGA ATTTAAAGCA 
214051 AATAGTGAGG AACTTGTTTG TGAAATGTCT ATTTTAGATT CTTGGGCGAA 
214101 ACAATGCTTG AGCACAATTC CTGAAAATTT ACGGTATCTT GTCTCAGGTC 
214151 ATAATGCGTT CAGTTACTTT ACACGTCGCT ATTTAGCTAC TCCTGAAGAA 
214201 GTGGCTTCCG GAGCATGGAG GTCTCGTTGT ATTTCTCCTG AGGGTCTATC 
214251 TCCAGAAGCT CAAATCAGTG TTCGTGATAT TATGGCGGTT GTAGATTATA 
214301 TTAATGAGCA TGATGTCAGT GTGGTTTTCC CTGAGGATAC TCTGAACCAA 
214351 GATGCGTTGA AAAAAATTGT TTCTTCTCTG AAGAAAAGTC ATTTAGTTCG 
214401 TCTAGCTCAA AAACCATTGT ATAGTGATAA TGTGGACGAC AATTATTTTA 
214451 GCACCTTTAA ACATAATGTC TGCCTTATCA CAGAAGAATT AGGAGGGGTG 
214501 GCTCTTGAAT GTCAAAGATG AGACTTTTTG GTCTGTACAC AACCTTTGTG 
214551 TAAACTATGA GCATGCAGCC GTTCTTTATC ACATATCCTT TTCCTTGGGA 
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214 601 AAGGGGTCAT TAACTGCTAT TTTAGGTCCT AATGGAGCTG GTAAAAGCAC 

214651 TCTCTTAAAG GCTTCCTTAG GCCTGATCAA ACCCTCTTCG GGGACTGTTT 

214701 ATTTTTTTAA TCAAAAATTT AAGAAGGTGC GTCAGCGCAT AGCCTATATG 

214751 CCTCAGAGAG CTAGCGTGGA TTGGGATTTT CCAATGACTG TCTTAGATTT 

214801 AGCCCTTATG GGGTGTTACA GCTATAAAGG AATGTGGGGG AGAATTTCTT 

214851 CGGATGATCG AAGGGAGGCC TTTCATATTT TAGAAAGAGT TGGTTTGGAA 

214901 TCCGTAGCAG ATAGACAAAT AGGACAGCTC TCAGGAGGAC AGCAACAAAG 

214951 AGCATTTTTA GCACGTGCTT TGATGCAAAA AGCAGATCTA TATCTTATGG 

215001 ATGAGTTGTT TTCAGCGATT GATATGGCTT CGTTTAAAAC ATCTGTAGGG 

215051 GTTTTGCAAG AGCTGCGAGA TCAGGGAAAG ACTATCGTCG TTGTTCATCA 

215101 TGACTTGAGT CATGTGCGTC AACTATTTGA TCATGTGGTT TTATTGAATA 

215151 AGCGTTTGAT TTGTTGTGGC CCTACTGATG AATGTCTGAA TGGAGACACT 

215201 ATTTTCCAAA CGTATGGTTG TGAAATTGAA CTTTTGGAAC AAACCCTGAA 

215251 GCTCTCTCGA GGAAAACAAT TTGGATCGTG CTGATTATGC TCAGTTGTGT 

215301 TTTTTCTGAT ACGATTTTCT TATCTAGTTT TTTAGCTGTC ACTTTGATTT 

215351 GTATGACCAC AGCTTTGTGG GGGACAATTC TCTTGATTAG CAAGCAGCCT 

215401 CTTTTAAGCG AAAGTTTATC TCACGCGTCG TATCCAGGAC TTCTAGTTGG 

215451 AGCTTTGATG GCGCAATATG TTTTCTCATT GCAAGCTTCT ATTTTTTGGA 

215501 TTGTGTTGTT TGGGTGTGCT GCTTCGGTAT TTGGTTATGG GATCATTGTT 

215551 TTCTTAGGGA AAGTATGTAA ATTACATAAA GACTCCGCCC TTTGTTTTGT 

215601 TCTTGTGGTA TTCTTTGCTA TCGGAGTGAT TTTAGCCAGT TATGTCAAGG 

215651 AAAGTAGCCC TACGCTATAC AATCGCATTA ACGCCTATCT ATATGGGCAA 

215701 GCAGCCACTT TAGGTTTTCT TGAAGCTACG TTGGCTGCGA TCGTCTTTTG 

215751 TGCTTCGTTA TTTGCTTTAT GGTGGTGGTA TCGACAAATT GTTGTGACTA 

215801 CTTTTGATAA AGATTTTGCT GTTACTTGTG GCTTAAAGAC TGTTCTTTAT 

215851 GAAGCACTCA GTCTAATTTT TATATCGTTG GTGATCGTAA GTGGAGTTCG 

215901 AAGCGTAGGG ATTGTTTTAA TTTCTGCTAT GTTTGTGGCT CCTTCTTTAG 
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215951 GTGCTCGTCA GCTTTCCGAT 

216001 TTCTTTGGAG GGATTAGCGG 

216051 CACATGTCGT GCTATTATAG 

216101 CGGGACCTTT GGTTGTCATT 

216151 CTTTTTTCTC CAAAATCTGG 

216201 CTTTTCGTTT TCAAAGGATC 

216251 TTTCTCATAA TCGTTTAGAG 

216301 TATAAGTATC AGGAGTATTT 

216351 TCAGATTTTA GAATGGCGGG 

216401 GACTCACAAA AAAAGGAAGA 

2164 51 AGATTATGGG AATCGTATCT 

216501 TGTTCATGAG TTGGCTGAGG 

216551 ATCATACCTT GACAGAGATT 

216601 CAAATTATCC CAAATAAAAA 

216651 CCTTATTATG GAGTATCTTT 

216701 ACTGTTTTCT GGAAGTCTTT 

216751 AGATTATAGT ATTCCTTGCT 

216801 TTTTTAGTCT TGCGAAAGAT 

216851 TGTCCTTTTT GGTTTGGTCT 

216901 CCCTCTCTTT GGGTACCTTG 

216951 ACAGGGTTTC TTATTTACTT 

217001 GAGCAGCACC GCTCTAGTCT 

217051 TGTTAGTCTT TATGACAAAG 

217101 GGAAACGCAG ATTCTTTAAC 

217151 GATTTTGGCT AATGCTGTAA 

217201 GTTCTTCTTT CGATTCTGTA 

217251 TTGGTTGATT ATTTGATTAT 



CGTCTAAGTA CAATTCTTAT CCTTTCTGCA 
AGCTTTAGGA AGCTATATCT CTGTAGCATT 
GGCAACAGGC GGTGCCTGTA ACCTTGCCTA 
TGTGCTGGAT TATTGGCCGG TCTATGTTTG 
GTGGGTCATT CGTTTTGTCC GTAGGAAGCA 
AAGAACACCT TTTAAAGGTG TTTTGGCATA 
AACATTAGTG TTCGAGATTT TGTCTGTAGT 
TGGGCCTAAG CCTTTCCCTA GATGGAGAGT 
GTTATGTTAA AAAAGAACAA GATTATTATC 
AGTGAGGCCT TAAGATTAGT TCGTGCTCAC 
TGTGAATTCT TTAGATTTTA GCAAGGAAAG 
AAATAGAGCA TGTTCTTACT GAAGAATTGG 
CTCAATGATC CTTGTTATGA TCCTCATCGA 
AAAGGAAGTC TAATGGCTTT GGGACCTTCT 
TTTCCAATTT TTTTCAGTAT TTTTTTCGAG 
TCACGGGTTC TGTCTATATT GATGATATTC 
ATTTCCTGTT CAGGTGCTTT TGCAGGAACT 
GGCTATGTAT GCGAATGCTG TCTCTCATAC 
GTGTTTGTTT GTTTACGCAT CAACTGACGA 
ACTCTTGCAG CAATGGCAAC AGCTATGCTG 
TATTCGTAAT ACTTTTAAAG TTTCAGAAGA 
TTTCTTTATT ATTCTCTCTG AGCCTTGTTT 
AATGCTCATA TAGGAACGGA GCTTGTGTTA 
GAAAGAGGAT ATTTTCCCTG TCACTATTGT 
TTACTATTTT TGCGTTCCGT AGCTTAGTTT 
TTTGCCTCTT CTTTAGGAAT TCCTATTCGG 
TTTTCAACTT TCTGCATGTC TTGTAGGAGC 
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217301 


TT TT AAGG CT 


fcTTAGG lolAl 


taatggpapt 


TG CTTTTCTG 


ATCATTCCAT 


217351 


CGCTTATTGC 


TAAGGT1 All 


n p a a A A TP P. A 


TAAGGAGTCT 


TATGGCTTGG 


217401 


TCGTTAGTTT 


TTAGTAT rob 


T A P A fir* A TTT 


TTAGPTPCTG 


CATCTTCGAG 


217451 


AGCAATTCTT 


AGTGCTTATG 


Axil uyoo l l 


ATPGAPTTCG 


GGAATCTCTG 


217501 


TAGTGTTCTT 


GACGATGATG 


1 vjo 


TT AAA TTT AT 


AAGCTATTTT 


217551 


CGAGGCTATT 


TTTCTAAAAA 


111 


ATAAGTGAGA 


AAAGTTCTCA 


217601 


ATATTAGCAG 


TGATTTAAAA 


PA ATP A A ATT 
oAA 1 wnnn 1 1 


TTAGAGTTTA 


GTCCACTTTA 


217651 


GTTAGAAATA 


GTATCAATAb 


A P A A TTfl A P A 
AoAA 1 1 Vj/iwrt 


ATTCCTCGAC 


TTGCGGAGTA 


217701 


TGATTCTCTT 


TTTCACCTAO 


mm > A * rv; T A G 


PATGCTTGAA 


ACATTTAGCC 


217751 


GTTCTTGGGT 


CAACAGGTAG 


T JlTTPPPPPT 
1A1 HjoCV-wI 


PAAAPATTAG 


AGATTGTGCG 


217801 


GCGCTATCCT 


TCAGAAl 1 1A 


A A ATTATTTP 
AAnl iHl x 


TATGGCTTCT 


TATGGAAATA 


217851 


ATCTAAGGTT 


ATTTTTTLAb 


r*A aptapapg 


AGTTTGPTCC 


GTTAGCCGCA 


217901 


GCGGTCTATA 


ACuAAuAou 1 


TTATA APGAG 


GPPTGTCAGC 


GATTCCCCCA 


217951 


TATGCAATTT 


TTCCT AC?\jt- t- 


A fir* A PPPTTT 
AoV» AvjvAj A 1 A 


A APPPAAPTT 


TGTATCATGG 


218001 


ATACAGTCAC 


TACTGTCGT1 


uL 1 1 1L1 1 


PAGGAATPGA 


GGCGCTACCC 


218051 


GCGATTCTAG 


AG TC G ATCj AA 


7A TA 7A 'hrT 1 A AAA 


PPAPTAGPTT 


TAGCAAACAA 


218101 


AGAAATTTTA 


GTTTGTGCTG 


r*r*PA ATTPPT 


TTPT A AG APT 


GCAAAGGAAA 


218151 


ATGGTATAAA 


AGTTCTTL.L 1 


a TTf a t a ppg 


AGPATAATGC 


TTTGTATCAA 


218201 


TGTTTAGAAG 


GCAGGACGA 1 


1 unuuuAft 1 


A APAAAPTGA 


TTCTTACAGC 


218251 


TTCTGGAGGG 


CCTCTGCTCA 


ALMw 1 L. 1 1 1 


APA AGAGPTT 


TCTTGTGTAA 


218301 


CAAAACAAGA 


TGTTTTGAAC 


C j\ r P^T* r P AT AT 
LA1UL 1 Al MI 


PGAATATGGG 


TTC AAAAG TG 


218351 


ACTGTGGACT 


CATCCACATT 


r*r*rp/-* a Tv»ra TaP 
(jkj 1 CAA 1 A/\o 


PPAPTPGAAA 


TTATCGAGGC 


218401 


GTATTGGCTG 


TTTGGTTTAG 


AAAATGTTGA 


AATCCTGGCT 


GTAATTCATC 


218451 


CTCAGAGCTT 


AATCCATGGT 


ATGGTAGAGT 


TTTTAGATGG 


GAGTGTGATT 


218501 


TCTATCATGA 


ATCCGCCTGA 


TATGCTCTTC 


CCAATACAAT 


ACGCTTTAAC 


218551 


AGCTCCAGAG 


CGTTTTGCAT 


CTCCTAGGGA 


TGGTATGGAT 


TTTTCGAAGA 


218601 


AACAAACTTT 


AGAATTTTTT 


CCGGTAGATG 


AGGAGCGATT 


TCCTAGTATC 
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218651 CGTTTAGCAC AACAGGTATT 

218701 TAATGCAGCC AATGAAGTAT 

218751 CTTGGTGTGA CATTTTACGC 

218801 GTTTATGCCT GCCACTCTTT 

218851 TAGAGCTCTT GCTCAAGAAA 

218901 ATTTTATTCT AGCAGCCCTA 

218951 CTTGGTCATC TGGTAGTAGC 

219001 TAGCATAGGC TTTGGTCCTG 

219051 AATATCGCAT TGGATGCATT 

219101 ATGGAACGTA CCAAAGAAAA 

219151 TGATATTCCT CAGGGATTTT 

219201 TTCTTGTTGC TGGTCCTCTT 

219251 AGCATTCTTT ACATGAATGG 

219301 TAAAGTGGTA GGTTGGGTCC 

219351 CTGGAGACGA GATTCTTACG 

219401 GACATGCTAA CAACCTCTTT 

219451 ACGTCCTGGC TATTTGACAG 

219501 AGTTTGATCC CACAAAATTC 

219551 TTGTATAGCA ACCAGGTGCC 

219601 AGAGCTACGT CCGAATGATC 

219651 TCTCAATGGC TCAGATATCT 

219701 AAAGTAGCAC GGAATGACAA 

219751 GGCTTCCGTT TTACATTACA 

219801 CGCAGTATGA GGCTGGACTT 

219851 CCTTATGTAA TCAATAGTTA 

219901 AGATCCAGAG TCTCCTTTGC 

219951 ATCGCATTCT AGCTATTGAT 
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AGAGAAACAG GGGTCTTCTG GAAGCTTTTT 
TAGTGCGGAG GTTCCTTTGC GAAGAGATTT 
AAATTAACGA CTCTTATGGA ATGTCATAAG 
AGAAGATATT TTAGAAGTAG ATGGTGAGGC 
TATAATCGAG TAGGTATATG ACAATAATCT 
GCTTTAGGGA TTTTAGTGTT AATTCATGAA 
AAAAGCTGTA GGAATGGCTG TAGAGAGTTT 
CTTTATTTAA AAAGCGTATA GGCGGCATAG 
CCTTTTGGAG GCTATGTTCG TATCAGAGGT 
AGGGGAGAAG GGGAAGATAG ACTCTGTCTA 
TTAGTAAGTC TCCTTGGAAA CGCATTCTGG 
GCCAATATTT TATTAGCTGT CTTGGCTTTC 
GGGAAGAAGT AAAAATTATA GCGACTGTTC 
ATCCTGTTTT ACAGGCAGAA GGATTGCTCC 
TGTAATGGTA AGCCTTATGT GGGAGATAAG 
ATTAGAGGGG CATCTCAATC TAGAAATCAA 
TTCCTAGCAA AGAGTTCGCT ATTGATGTTG 
GGGGTTCCCT GTTCTGGAGC GAGTTATCTT 
CCTAACGAAG AACTCTCCTA TGGAGAATTC 
GTTTCGTTTG GATGGATGGC ACACTTCTTT 
CAGATACTCA ATGAGTCTTA TGCTTTTGTG 
AATCTTCTTT TCTCGTCAAC CTAGGGTATT 
CTCCCTACCT TCGTAATGAG CTTATAGATA 
AAAGGCAAGT GGTCTTCGTT ATATACATTG 
TGGATACATA GAAGGTGAAC TTACTGCTAT 
CACAACCTCA AGAGAGGCTA CAGCTTGGGG 
GGAACTCCTG TTTCTGGAAG TGTAGATATT 
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220001 


TTACGTCTTG 


TTCAGAACCA 


1 LGGG 1 Q. rG I 


hTTATTPTTP 
Al 1 Al lol It 


APPAPATGAG 


220051 


TCCGCAAGAA 


CTTGAAGAGG 


TG AATTCTC G 


AGA 1 oL 1 VjM 1 


A H^rfifiTTTA 

rino LOW 4 * A *» 


220101 


TCGCCTCTTA 


TCATTCCGAA 


GATCTGTTAC 


7v R TTTTP A & 
AAA! 1 1 iVj/irt 


PPATTTAPGA 


220151 


GAGTCTCACC 


CAGTAGAAGT 


CGCGG GTC CT 


1A1 tol L J it 


TTP A C C CTGT 


220201 


TCAGCCTCGT 


CCTTGGATTG 


ATGTTTATTC 


1 1 Luonono J. 


TTPPATAAAC 


220251 


AGTTGGAAGT 


AGCTAAGAAG 


ATT AAG AA C A 


a/T'ATAAAPA 


AAPATACTAT 


220301 


TTGGAGCGTC 


TTGATGCTGA 


^ TV H TV TV IV X X 

GAAGCAAAAA 


T\ TfTTT A P 
LLAi G ill H\j 


PGR TTTPTTT 


220351 


GAAAGATCTT 


AAGGTGAGGT 


ATAATCCTTC 


AGG 1 Gl GG1 1 


ATPTTATCAA 


220401 


ATATTACTAA 


GGAAAGTTTG 


ATCACCTTGA 


AAuL 111 Ao 1 


TAPTPPACAT 


220451 


CTGAGTCCAC 


AATGGCTTTC 


AGGACCTGTG 


GG 1 A 1 1 Va 


APPTTTTAPA 


220501 


TACAGGATGG 


TCGGTAGGGT 


TTTCTGAAGT 


GL 1L1 1 1 i 


ATPPPTPTAA 


220551 


TTAGTATGAA 


TTTGGCTGTC 


TTGAATTTGC 


I 1 LL 1 Al 1 V_V_ 


TP TTTTfifJ A T 

lUl 111 \3\3t\ i 


220601 


GGAGGTTATA 


TCCTTCTATG 


TTTGTGGGAG 


ATAG 1 GAAAA 


p A ir apPTTT 


220651 


GAATATGAAG 


ATTGTGGAAA 


GGATTTTGGT 


TGLG I 1 LAV- 1 




220701 


TCATCTTCTT 


TATTTTTCTA 


ACTTTTCAGG 


AT 1 1 A 1 1 1 Lo 


1111111 OVJ 1 


220751 


TAAGGCTCCA 


TTTTTTTGGA 


AATCCGTAAA 


Go 1 1 G 1 A/\VjVj 


APP APAAPPA 


220801 


AAGCAGTGTA 


TAACGTAGAT 


GTCTTGGTTA 


GGG I AAwov. 1 


TA APTPTTPA 
1 nnU 1 V- 1 1 \sn 


220851 


GTTCTTTTGT 


TTGTCTTCCA 


TAACAGTTTG 


TTf*T fi ft R PPT 
1 1 G 1 AAAVjV- 1 


PPTTPPPPAA 


220901 


CGTGCTCGAT 


TATCGGGGTT 


GTATTTCCTT 


I 1 AAAAA 


U 1 t> • 1 1 N^«» 1 •» 


220951 


AGATAGAGCG 


ACTTCCGATT 


AGT AATG AT G 


1 AGG 1 A 1 Vj/iV- 


Tf^T^Tfi PAT 

1 ^_ lol oo/» 1 


221001 


CACGTAAGCT 


TTATGTGGAG 


TATCTAGCTT 


1G1G1 LI AIM 


PTPP AT APPT 


221051 


. TACCATCGTT 


TTCATAGGGC 


AGGCGGAAAG 


GAAGGAA 111 


P PT A TPTPTP 
uLlnlol^lu 


221101 


TTCCCACTAA 


GAATGAGAAC 


GTCTAAAGAC 


GAAGGGAGTT 


TCCCGACATT 


221151 


TAACATCTTT 


GTGGGGCAGT 


AGGTAAGAAG 


CTGTCGTCCT 


AATTTTCCTC 


221201 


CAAATACGAA 


CTGTACGAAT 


TTCACACAGC 


GGTAGCGTCT 


AGCTAGTGTA 


221251 


GACCCTGCGT 


TCGGAGGAGC 


CATGAGAATA 


GCTTTTCTTT 


TTTGGCTTCT 


221301 


TCGGGGCAAT 


CAGGGTGAGC 


GAGTGCTACA 


, CGAACAATGA 


CTCCTCCAAT 
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221351 


AGAATGAGTT 


ACGAAGTT 1 A 


1 AuoyAv. 1 \-V- 


APPPTTa APT 
nuot I l M/\o i 


T P AG tT ATTT 


221401 


TTTTCAGCAA 


GCGATTGAGA 


.TO 1 I LAGGA I 


OV, 1 1 1 1 V- 1 A\J 


AGTAAACTTG 


221451 


CGCGTCTCAT 


AATTCCAAAT 


AAAGALA i CG 


T A A TGTTPTT 


TTTCTAGAAC 


221501 


GCGAGCAATA 


GGTTTTAAAG 


AT G 1 A TAAG A 


TPTT A a. A A A P 


GCATGCACGC 


221551 


AGACCACGGA 


TTCTTTTTGT 


TTTGAGGTTT 


^pmmm * *TPP 

t-L. Ill AA * C-v- 


rrGAATTCCA 

v^. Vw r\j* i x Vw 


221601 


GATGGAAGGG 


TTTGGATTAC 


CGAGGTTTGL 


p ap A A T A A A G 


GATTACCCAC 


221651 


AGCTAAAAAG 


AGTATAGTTA 


ATAAAAAI 1 1 


pmfpp a TflT GT 
CI 1 \-Al ol 


PAGCGATATA 


221701 


ATTATTTTAA 


ATAAGTTTTT 


TATTAAAAAA 


P 2i Ik T A TP TTT 


ATTAAAACTT 


221751 


ATTTATAAAA 


ATAACTAATT 


CGCTTACT I I 


t a a ppp A A A a. 


GAAAAATGAT 


221801 


GTTATTGTTT 


TAGATAATTT 


GCTCAGAAAL 


ttp inrnTTr 


TPGATACTTA 


221851 


AAACAAGGCT 


TGTTTTTGGA 


AGTTCCCCAT 


GL.A 1 A 1 woo I 


PnAAGTA ATP 
uortnu 1 /\ft i V- 


221901 


AGAGTCTGAC 


CCAGAGTTGG 


GGCAGGGTCA 


a a ot^pp a p 
AGGAo 1 1 «ML 


PGAP APGTTP 


221951 


ATTATCTAAT 


CCAGCATGGA 


TATCATCTAG 


ACAuAL. 1 Ao A 


GGGGAGAPGT 


222001 


GATGAGATTG 


CTTTAGATAT 


AGGCACTCAG 


r* a a r^rr^PT a a 

CAAoH- 1 1 AA 


GATTGPPAAA 
uftl 1 uU\-nAn 


222051 


AGACTGTGTT 


TCTGCCCTTC 


ACT AG AG AA 1 


a p a. p a p ap 

1 oAoAWACAo 


GPATTTGGTT 


222101 


CATAGTGAGT 


AGAAAGTCTT 


CGCGATGAGG 


rnc^'L appp a A 


GTGPTTCCTC 


222151 


ATTCGAGATC 


TCTAGGAAGT 


GATATAGAGA 


iul 1 1 A 1 « 


AAATTPTTCG 
ft/\ft 1 i 1 * V-O 


222201 


GCAACAGCAG 


TTTCAGAAAT 


* rr*r* rn r P r P 

ATCAG AATT 1 


1 iAAl lAnou 


A APTTTTAAA 


222251 


TTTTAAGGCC 


AATTGTTCTT 


TTAGGTTGTT 


a p a rr a a a pt 


TPTTTGGATA 


222301 


AATCTGAAAG 


TTTCTGACTA 


CAGAGAAACC 


Gl luuAlouA 


T A PPT AGP TP 
l ftoVj i ftoo x o 


222351 


CCGTGTTTGA 


CCAACTGTTC 


ATCCCAGAGG 


LLALuul 1 VjA 


PPTTTPPTTG 
uy l i iuli lo 


222401 


CTTTTTAAGA 


GAGCATTTCT 


CTG CT GAAG A 


uLotuAl oft 1 


AGTAPGATAA 

ftu 1 /\V-Oft 1 /vft 


222451 


GCAGAGGGTA 


TAGTGGTTAT 


CGCATTGAGA 


TAAAAGCAGA 


TTTAGGAAAA 


222501 


GGCGACGATC 


CGCAGGAGCT 


CCTGAAATTA 


GAAGGCGGTC 


TTTTGAAGAG 


222551 


AAAAGCACAA 


TAGGTACTTT 


CCCTATCAGC 


TGCGATAAGG 


TTTTTATAGG 


222601 


AAGTTGGTTA 


TAGCAGATTT 


TTTTTCCTTG 


CTTGTCTGTA 


TAGATGGAGA 


222651 


GAGCTTGGGG 


AAGGTGGTCT 


TTCTCAAACT 


GTGTTTCTAA 


GAAGAAATGG 
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222701 GAAGATCCGA AGGTGATGGT ATCTGTGAGA TGTTGCGTGC GAAAAGACCT 

222751 TCCCAAGGAC AAAACATAAA GCGCTTCTAG GAGGTTTGTT TTTCCTTGGG 

222801 CATAATTGAG TTTAGGAGCC AGTGAGATTT CTAAATCACT GTGGTTACGA 

222 851 AAATTTTTTA GCTTCAGAGA GCAGATTTTC ATAAACATCG GCAGG GTAGT 

222901 GACCTAAAGG ATTCTCCTTA GGGAGTTTAT TAATCATCAT GTAGCCTCAT 

222951 AGGCATGATG ACAAATAATC CTGAGGCAGA ATCGGTAATG ATTCCAGGAT 

223001 TATAGGAATC CGAGATCCCT AAGCTGACTA ATTCATCCTT ACTATGCTTC 

223051 AGGATATCTA AAAAGAAAAA GGGATTAAAG GCAATTTCTA GGAGTTCGCC 

223101 AGAATAATTT ACAGCCATGC TTACCTTTCC TTCACCCACC TTAGTACAGT 

223151 TGGCTGTTAG AGTGAGCTCT CCGGGTAAGA AAGAAAACTT CACGGAGTGA 

223201 GAGGACTCAT TTGTAAATAA AGCCACTTGT TTGAGCAGAG TAATTAGTTC 

223251 TTCGCGATGC AGATCGAGTT TTACGTTGCT TTCTGTAGAT ATGACGGGGG 

2233 01 AGAAATCTGG AAATTCTCCA GAAAGAAGTT TTGTGATCAG GAGAGTATTG 
223351 TCACATTCAA CCGCAATCTT ATCTTGATCC AAGAAGATCG CAGCTTCACC 

2234 01 TTCATCGGAG CACATCTTTA TAATTTCTTC TACTGCTTTG ATAGGAATAA 
223451 TATATTCCCC AGAAAAACTT TTATCTAAAG TAACTTCAGC ATCTATTTTT 
223501 GCTAAACGCT TTCCGTCAGT CCCTACGATG GTAGCCACGC CATTGGCGAT 
223551 AGCAAGCAGG ACTCCAGTAA GAACATAGCG GCTTTCTTCT CTAGATACAG 
223601 CGAATGAAGT TCTCTGTAGC ATGGTTTTTA GCTGCTCTGC AGGCAAGGAA 
223651 AAACGCAAAG CATTTTGTAT ATCAGGGAGC ATGGGGAAGT CTTCTTTTTC 
223701 CATGCTGAGT AGGCGAAAGC ATGAAGATCC CGAGGTGATT TGTGCCATTT 
223751 CCCCTGCTGA AGAGGAAATT TCTAAATTTG CCTCTGTTAA TTCTTTTACT 
223801 AATTGAAAAA ATCTCTTGGA GGGAATGGAA ATAGCGCCTT TCTCATAGAC 
223851 TTTAGCTTTG GTGACGCAAC GTGTGCTCAC TGTCAGATCC GTAGCAGTGA 
223901 AAACTAATTC ATCATTATAA GTTTCAATCA AAACATGGGT GAGTACTGGA 
223951 ATAGGTGTGT TTTGAGGGAC GACACTTTGA ATTTTTTTGA TAAGGTTTCC 
224 001 TAGCTCATTT CGGGATACAA CGAATTTCAT ATTTTCCTAT AACCTGAGTC 
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224051 TAACTTTATA TGAAGGGCTA CGCCTCTCTA GGAGAGACGG TAGAGGTACG 

224101 ATCGGGATAT TACTACAAAG TGATTTTCTG GTCTTCTAGA TATTTTACTA 

224151 ATAAAAAAGG CTTTAGATTG AGGAAATCTT CCCTGGAAAT CAAGGAAAGA 

224201 GGATTCTAAT CATTGTCTTA AGAAGGAAAA ATTGCTTTTT GTTATACTGG 

224251 TTTTTGTCCC CAATTATGGG GGAGGATCTT ATGGCACAAA AAGAAATTGT 

2243 01 TTCTAATCGC AAGGCTCTGC GTAACTATGA AGTTATAGAG ACTTTAGAAG 

224351 CAGGCATCGT TTTGACTGGG ACTGAGATTA AGTCGTTGCG CGATCATGGG 

224401 GGAAACCTCG GTGATGCTTA TGTCATTGTT TCTAAAGGTG AGGGGTGGTT 

224451 ATTAAACGCG AGTATTGCTC CCTATCGGTT TGGAAATATC TATAACCATG 

224 501 AGGAGCGTCG TAAACGTAAA CTCCTTCTTC ATAGATATGA ACTTCGTAAG 

224551 TTAGAGGGTA AGATTGCTCA AAAGGGCATG ACTTTGATTC CTCTGGGAAT 

224601 GTTTCTGAGT CGCGGCTATG TTAAGGTACG TTTGGGTTGT TGTCGTGGGA 

224 651 AAAAAGCTTA TGATAAGCGT CGTACGATCA TAGAAAGAGA AAAGGAACGT 

224701 GAAGTTGCCG CTGCTATGAA GAGGCGCCAT CATTGATATA GGTTAGGATA 

2247 51 TGGTGTTCTT CAGCCCACTG TTTTGCTTCT ATTTTAGAAT CAAAAGTCAT 

224 801 GAGGACTGTG GCAATAGCGT CGGCGTATGC GGAGCTCGGA TGGACTACTG 

224851 AAACACTTTG GATAGGATAG GAGCTTAGCT CTAGGGGTTT CCCTGTACGA 

224901 GTATCAAGAA TATGGGTGTA AATTTTTCCT TCAACACACC ATTTTTGAAT 

224951 ATGATTTCCA CTTGTTGCAA TTGCCATATC ATCGATATCT AAGATCGTAC 

225001 CTGCTGCTTC AGAAAAAATA CGCCAAGGTC TTCCCGAGGG ATGATGCCCT 

225051 GACGTTTTGA TCTCTCCTCC CCACTCTACA TAGTTGTTCG GACAAAAGGT 

225101 ATTGCAAATT TCATTTAGAC AATCTACGGC ATAACCTTTG ACAACACCAC 

225151 AGAGGTCGAT 'TTGAACATGA GGATTCTTTT TGATTAGAGT TTTTGTGTTT 

225201 GACTGAAACT CCAAGTGTTG CCAGCCCATG TCTTTATAAT GTTGTTCCCA 

225251 AACGTCTTTA GGGGGGAGGG TTTGACTTTT GAGATGTAGA AGCCATAGGG 

225301 TTTTTAAAGG TCCTACAGTA GGGTCAAAAC GTCCTTCTGA AAGTTTGTAA 

225351 AGTGTATCTA CCTGATCTAG AAACTCGGAA AGTTCTACAG ATAAAGTTAT 
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225401 GGGGACATCT GCTGGAGCTC GGTTGATTAT CGAGAGTTCA GAATAGGGAT 

225451 TCCAGTTGTT ATAAATCGAG TCGATCTTAT GAAAGCATCT ATCAATTTGT 

225501 TGGGATAAAG ATGCTTTTTC TTTTGCGGAT AAAGAGGTTC CCAGAACAAT 

225551 GCGATAGAAG ATTGTCATCT GCTCTCCTTC GATTGTTGTC GTTTTTTGAG 

225601 AGCATGAACA GAGTCCAAGA CATAAAAGAA CTAAGAAAAA TTTTGGTAAC 

225651 ATCGCCATGT ATTGCTCATG AGCATAGCGA CAGTCATGGG ACCAACGCCT 

225701 CCAGGAACTG GAGTGATTGC TGCGCATTTT GTCACAACGT TATTAAAATC 

225751 TACATCCCCA AGAAGAGTAT AGCCTTTCGC ATTGTCTGCA GGGACTCTTG 

225801 TTGTTCCTAC ATCTACGATC ACAGCATGTG GGGCTACCAT AGTTTCCTTT 

225851 ATAAAAAGCG GTGCTCCTAG AGCAGCAATA ATGATATCAG CTGTCTTTAA 

225901 GATTTCTGGG AGGTTTTCCG ACTGGCTATG AAGAACTGTG ACTGTACAGT 

225951 TAGTTTGAGG ATGCTTTTGC ATCATGAGGG CCGCTAAGGG TTTCCCCACG 

226001 ATGTTGCTTC TCCCTACAAT AGCGGCATGG CGGCCTCGAA GAGGAATTTC 

22 6051 ATAATAGTTC AGGAGTTCAA TAATTCCTGC AGGAGTGCAG GGTAGAAGTC 

226101 CATCAAAATT TCCAAGGAGC AACTTTCCCA TGTTCACAGG GTGAAGCCCG 

226151 TCCACATCTT TGTCTGGGGA GATCGCTTGG AGAATCACTT CGCTGTCCAA 

226201 GTGTTTGGGC AAGGGAAGTT GCACGAGGAT GCCGTGGATG CTAGGATCTT 

226251 GATTCAATCG TTCTATGAGC TTAAGGACTG AGGAGAGGGT AGAGTCAGAG 

226301 GGTAACTTGT GCGCTTTGGA GATAATTCCG ATTTCTGTAG CTTTTTTGAC 

226351 TTTCATGCCA ACGTACACCT CAGATGCGGG GTCATTGCCA ATCAGGACCA 

2264 01 CAGCAAGCCC CGGAGAGGTA GGACTTTGTG AGATTTCCTC TTTGAGTCTC 

2264 51 TGAAGGATTT TTTCAGCTGC AGGAATCCCT CTCAGTAACA TACCAATCTC 

226501 CTAAAAAAAG ATATCATTTT ATGGATTCCT GAGTTTATTG CGCAAGGAAG 

226551 TATATAGATT ACAGAAGTGA AGATGAGATT ATCTAAGAAA GAACATTTTA 

226601 GAGGTATCAT TAGCACATCT ATTGACCAAC GGACTTCATG TCCGAACAAG 

226651 GCGCACATAG GAATGGTTAG GAGTAAAAAG ACTCCATAGA AAATCATATT 

226701 CACAATGACC AAGGCGGCTT TGCTCTCTTT TGAAAAAATG AGGTGTGCCT 
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226751 


TATGGGTGAT 


CAGAGCAGTT 


AGAGGGTAGA 


GCAGTAGAAA 




226801 


CTTCCTAAGG 


CAAGATCACT 


TAAGACACCA 


CAGCCTAGAG 


r*p AHTAPCAfi 


226851 


TCCCTTATCT 


TTTGAGAAAC 


AATAGAAACA 


AAGAACAATA 




226901 


AAAGGATCAG 


ATGTGCTTTA 


GGGAAGAGCG 


TTGGAAGGG 1 


PA APAT^CIAT 


226951 


AAAATACACA 


AAATAAAGCA 


TTTGCGCGGT 


GTTATGTGGA 


C Aij ACM 1 o\j 1 


227001 


TTACGCTTTG 


AATTATGATT 


ATCGATTTCA 


AAAGGTTTTG 


f* a P a PTTf^T A 
LnunU i i win 


227051 


TATAGCAAAG 


CATAGGATTC 


TTAGCTTGAT 


TTTTTATGTT 




227101 


GCTTTTAAAC 


AATTATTTAT 


ATAATATTTT 


» 1 Mil If IIMM 1 1 TV ^ fT^ 

ATTTTTAGAa 


aiiii aTTAfiT 
1 AAAA 1 1 Avj i 


227151 


TTATTCTTAA 


TAAGTTTTTT 


AATTAAAATT 


AGTTTATTAA 


\3 1 1 \Ji\l 1111 


227201 


TTGTTTTTGT 


ATTTTTTATG 


GGGAAGCCTA 


AG AAG AG L AG 


a ar***^ ATAfin 
AAv-ooA 1 


227251 


GCTTTGGCTC 


AGGAGATTCA 


AAAGAAATCA 


ACGGAAGTGT 


1 taAAoAAVjUv- 


227301 


TGCGCGGATA 


AAAGCTAAAA 


ATCGTCGTAA 


ATTTC TT AT T 


VjC 1 AAboHMt 


227351 


AGAAAACTCT 


TAAACACCGT 


GCTCAAGAAT 


ACGATCAGTT 


AG 1 It VjL. 1 V- 1 


227401 


CTCTTAGATT 


CTCAGAAGAA 


GGACACCGAT 


AAAGTTTTGA 


1111 U AA 1 1 A 


227451 


TGAGAATGGG 


TTTGTTTTTA 


CTGACAAGGA 


CCATTTTAGT 


AAu 1 AC 1 \- 1 A 


227501 


TCCGTCTTTA 


GGCGGGGTTA 


AATTAAAAAA 


AAATCGCAGA 


arrrrir tat 
ALGotjAt 1 A l 


227551 


CCCGTGATGG 


AACGACAGGA 


AGCCTGGGTA 


ATT AA TTT AG 


a r""PT Ann a ap 

AG 1 1 AooAAV- 


227601 


TGTCTCGGTA 


TTAAGATCTC 


CAAAATCTTA 


ATACCTAGAC 


AG 1 L-VjoA 1 V-V- 


227651 


GAGTGATCCC 


TCTAGCTCGA 


CAGCGATGCA 


CTTTGTTGCA 


Tt 1 AC1 CGC 1 


227701 


GTTCTCTGTC 


CTCCCATAAG 


TTTTTTTACG 


CAAGGTGTTT 


CACC I 1%jGG I 


227751 


TTTTTTTTGT 


TTTTTAGATT 


TTTAAAATTT 


GTTTGTAAGT 


TG llllllll 


227801 


AAATTAATTA 


TTTTTTGTTT 


TAAAATTATA 


AGTAGTTATA 


AGTTT1 AI 1 1 


227851 


TTTAAATTCA 


GAGAATCATT 


ATGGCAGTTT 


CAGGTGGCGG 


AGGGGTTCAG 


227901 


CCTTCTTCGG 


ATCCAGGAAA 


GTGGAATCCT 


GCTCTGCAAG 


GAGAGCAGGC 


227951 


AGAAGGCCCG 


TCTCCGCTAA 


AAGAATCTAT 


ATTTTCTGAA 


ACCAAGCAGG 


228001 


CCTCCTCTGC 


TGCGAAGCAG 


GAAAGCTTAG 


TGCGTTCAGG 


ATCTACAGGA 


228051 


ATGTATGCAA 


CAGAATCTCA 


GATAAATAAG 


GCTAAGTATC 


GTAAAGCTCA 
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228101 AGATCGATCA 

228151 AAATGCGCGC 

228201 TCGAGAGTGT 

228251 ATTGCCGACA 

228301 CTGAAATGCA 

2283 51 TCTGATATTT 

2284 01 AGGTGCCAGG 
228451 CGTTTGGATC 
228501 AATGCTTGGA 
228551 CGATCCCAAT 
228601 GCAACGAAGG 
228651 ACAGCCATGA 
228701 TTCTGATTCT 
228751 TAGAAAGGGC 
22 8801 AGTGAGGATC 
22 8851 GGCAGCGCCT 
228901 CAGTATTTCC 
228951 GGAGATAAAT 
229001 TACGAACTTT 
229051 AAAGTTTGCC 
229101 GTTTCCAGGG 
22 9151 AAATCCAGAG 
229201 TTTTCCCTAA 
229251 AGTTCCTCAT 
229301 TCTACCTCGT 
229351 GTCCTGGAGG 
2294 01 GTTGAGCCGC 



PCT/US99/26923 

TCAACCTCTC CAAAATCCAA ATTGAAAGGT ACATTTTCTA 
TAGTGTGCAA GGATTCATGT CAGGATTCGG ATCTCGGGCT 
CAGCAAAGCG TGCTTCCGAT AGTGGTGAGG GAACATCCTT 
GAGATGGATG TTGCTCTAAA GAAGGGAAAC CGTATTTCAC 
GGGATTTTTC TTAGATGCTT CGGGTATGGG AGGGAGTTCC 
CTCAGCTTTC TTTAGAGGCT TTGAAATCTT CAGCATTTTC 
AGTTTAAGTT TAAGCTCTTC AGAATCTAGT TCCGTGGCTT 
TTTCCAAAAG GCCATAGAGC CTATGAGTGA GGAGAAGGTA 
CAGTGGCTCG TTTAGGAGGG GAGATGGTCA GCTCTCTTCT 
GTTGAGACCT CATCATTAGT GCGCAGGGCA ATGGCAACAG 
CATGATAGAT CTTTCTGATT TAGGACAGGA AGAGGTCAGT 
CATCTCCCAG AGCAGTAGAA GGAAAAGTAA AGGTATCTTC 
CCAGAAGCGA ATCCAACAGG AATTCCAAAT TCTAATACTT 
GGAAAAGGAA GCAGAGAAAC AAGAAAGTCG AGAGCAGTTG 
AGATGATGCT TGCACGTGCT ATGGCTGGGC TTCTTACAGG 
CAAGAGGTAT TGAGTAATTC TGTTTGGTCT GGTCCTTCTA 
TCCTCCCAAG TTTTCAGGAA CTTTACCCAC CCAGAGATCG 
CAAAGCATAA ATCTCCAGGA ATAGAGAAGA GTACGAACCA 
TCTCCTCTTC GGGAAGGTAC TGTGAAGAGT GCTGAGGTTA 
TCATCCAGAA AGTATGTATC GTTTTCCTAA AGATAGCATC 
AGGAACCTGA AGCCGTTGTT AAAGAATCTA CGGCATTCAA 
AATAGCAGTC AAAACTTTCT CCCTATTGCT GTGGAGAGTG 
GGAAAGTGGT ACGGGAGGGG CTTTAGGAAG TGATGCTGTG 
ATCATTTCCT TGCGCAACGT GGAGTGTCTT TACTCGCTCC 
GCTACTGATG ACTATAAAGA GAAGCTCGAA GCTCATAAAG 
TCCTCCAGAT CCTTTGATTT ATCAGTATCG AAATGTTGCT 
CAATTGTTCT CCGTTCTCCC CAGCCGTTTT CAGGATCTTC 
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229451 ACGTCTATCG GTTCAAGGAA AGCCTGAAGC TGCTTCAGTT CATGACGATG 

229501 GTGGGGGGGG AAATAGTGGT GGTTTTAGCG GAGATCAAAG AAGAGGATCT 

229551 TCGGGCCAGA AAGCTTCCCG TCAGGAAAAG AAGGGAAAAA AATTATCTAC 

229601 GGATATTTAG GGTTTTAAGT CGGTTTGATG TGATGCGTAT AATTCGTTTT 

229651 GATCCTTATG GTGCGCTATC TGCACAAAGC ATAGCTAAAG ATTCCCGTCA 

229701 AAACTCTCCT TTAGTAGAAA AAATTTCTGA GGAAATTGCT ACGAATGAAG 

229751 CGATTCGGCT TGCCTTGCTA GCTATTGGAG ATCGCGAACA AGAGGAGAAG 

229801 AAACAGAGGC ATCGTTATAA GCTACTCGGA CAAAAGCAAG CCAAGGTCTT 

229851 GCTTTCTCAG TTGCGTCATG TGCATTTAGA TTTTAAAAAA CTATATTGCG 

229901 ATAGTAAGAA AAAAGAAGAT CAGGAAAAAG ACGAAAAAAA CAAACAGAAG 

229951 CGATCTATTA AAGTTACAAA GAAAAAAAAG GGCATCTCTT TAGGGGCTGC 

230001 CGCTTCTCAG GCAATTGCAG CAGCAGCAGA AGCTTGGGTA ATTGCTAGAA 

230051 ATAAAGGAGT CTTAGAAACT GCCTCCACTC TTTTTTATCA AAAGGATGAA 

23 0101 GAGGCCTAGA CATACTTGAA TCACGAGCTA GGCCTTCTTC TGTGACTATT 

230151 TTAGATCATC ACGCTGCTTC TTGCTCTTCT ACTGGAAGAC TTTTTTCTAT 

230201 AATTTCTTTT TCTTCATCGT CTACTGTAGG ATGTTCTGAA TGCTTAGCTA 

230251 GATCTTTCCA AATCATTCGA AGTTTCTGAT TTTGTTGTTT GGTCAGGTTT 

230301 TCCAGTATGA TCAAGCTTTC ATCATTTAAG GAGAATCTTC CTTTAGACCA 

230351 ATTTATAGAT CCTGCAAGTA GAGTTTTATT ATCTATAACT GCAAACTTAT 

230401 GGTGAAGAGT ACAGGGTGCG GTATTTATAG AAACAAAGTC TTTATTGATA 

230451 TTTAATTGTC GTAATTGCTT AAAAGTAAGT TTGCTATGAC TTCTATCAAT 

230501 GATAATATCT ACATGGATTC CTCGTTGTTT TGCTTGATGT AAGGCTTGAA 
23 0551 TAATCTCCGA GTGGGTCAGA GCAAACATAG CAACTTGGAT GGTTTTCTGA 
230601 GCTGTCTGGA TTTTTTCGAG TACAGCTTGT ATTGCAATTT TACGATCTTG 
230651 AGGAAGAACA AAATACTTTC CTGTTTGATC CTTTATAGAA AAGTCTCCAG 
230701 AGGTATTTGT GATAATGAGA TCACAGAGCT CCGAGCTATG CATTCCTAGA 
230751 ATGAGATTAT TATCTAAACG TAGAGAAAGA TTGGTGTAGT TCGCAGATCC 
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230801 


TAGCCAAGCA 


TCTTTCTTAT 


G l A l ouAAAo 


apptttttpa 


TGPATCAGTT 


230851 


TACGCCCTGC 


TGGAGGTTGC 


TGG ACT1 AAAG 


TT A P A TTPPT 


PPPTTGCTTT 


230901 


AAGATTTGGG 


GAATTTTAAA 


TTTTTGATAG 


1 AoA 1 Col >vrt 


PTTTGTTTTT 


230951 


TGCTTGAGCT 


TGTCGAGTTA 


AACTCTG TM G 


kjrt lv.il ouo 1 


TPTGAGAGGT 


231001 


TATAAATACG 


TAGGAAGATC 


TCTTCATCAG 


Cul ul 11. Inl 


AGPATPGCAT 


231051 


AGAATTTTAC 


GCATGTCCTC 


ATTGCATTGA 


111 uAo 1 nort 


TP, AT AGPTTC 


231101 


TTCAGACTTT 


AAAAAAGTCT 


TAAAAGTGTC 


ja /-*/■* i pp i pp a 
ACC AG G AGG A 


PPTPTTGPAA 


231151 


AAATTCCTAC 


TAAAATCAAC 


GTGCTAATAA 


TAAG AGAG A 1 


T* r P r P r P A ATTTA 
1111 rtrtl 1 in 


231201 


TCTTTTTGTC 


TTTTATTCAT 


TTTTTACTCT 


TAAAA 1 AGG 1 


1111 lol 1 1 1 


231251 


TATATATTTT 


TAAAAATAAA 


AATAACTAAT 


Tl AA 1 I AAAG 


/""TfpTA ATAP A 
gill 1 r\\Jr* 


231301 


ATAAGATTCT 


TATTAAATAT 


AAAAAGTTTC 


j. 1 I A 1 1 GGoo 


A A AT ATP^PPT 


231351 


GCCATCGTTC 


GGAAACCTTT 


TGTTTTG 1 AG 


1 bou 1 Lu/iV. 


PTPTAPTTPT 


231401 


TTAGGATAGG 


AAGGCTTCAT 


CAGGGCATGG 


Al AALoAAbu 


H A A A P,TTf5T A 


231451 


ATTAGGACGG 


TGAGTAGCAA 


AATGGCTG tg 


GAGGGt-w 1 o/\ 


AP.ATPATTTP1 


231501 


CTGGGGCACA 


TCGTGTGAAG 


GTCCTCCAGA 


G AAAA 1L1 1 1 


TTPAPTTTP*A 

X i /»v— 111 0<i 


231551 


ATGGTTTCTC 


TCAGATTATC 


GGCAAGGATA 


A 1 LAuAuwH- 


ATP.ATTTTAG 


2316C1 


ATCTGGATGG 


TGAAGGAGAG 


ATTTAATACA 


1GGG1GG ILo 


APPP.ATPTTT 


231651 


CCAACACTAG 


GCAACCACGA 


CAAAAGGGAG 


GGAIG1C 11« 


A APTPPATP.Pi 


231701 


ATTTTTCCTC 


CCTGATATCC 


ATGGGGAAGG 


1 G 1 GGoA 1 oo 


PTTTTPPTAT 


231751 


TCCCATGAAG 


ATTCCCTTGG 


AGCCCTTATT 


TAAGG 1 IGG1 


PPTPT AT APT 


231801 


CTAACGTATC 


GTTTGCAGTT 


TCTGAGAAAA 


TAATAAG A 1 G 


TPPPTPTPPP 
1 GGG 1 G IoVjV- 


231851 


TGTAGACGCT 


CTAAAATGGT 


TTCTAGAACC 


ACG GAGAAG G 


1 G 1 LoAuAuo 


231901 


CACCTCTTGG 


TCTGTGACCA 


TTAGGAATTT 


CGTTAGGGAA 


AGTTGGCCCT 


231951 


CTCCAAGAAT 


TCTAAGAGCT 


GTGGTTAGAG 


ATTCTCTCCA 


ATAGCGTTCT 


232001 


TTAACGACAG 


CCGCAGTCAG 


TGCATGAAAC 


CCTGATTCTC 


CGTAACTTTT 


232051 


AAGTCTACGC 


ACACCAGGCA 


TAACTAACGG 


AAATAAAGGG 


GAGAGGTATT 


232101 


CTTGGAGTTT 


GTTCCCTATA 


TAAAAATCTT 


CTTGGTAGGG 


TTTGCCGACT 
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232151 ACTGTAGCAG GATAGATTGC ATCTTTTCTG TGATAGATTT TATGACAGTG 

232201 GAATTCAGGG AAGTCATGTT GGAGACTGTA GTATCCGAAA TGATCGCCAA 

232251 AAGGACCTTC AGGACGACGT TTCCCGGCCG GAGATTCTCC GACCAGGATG 

232301 AATTCCGCAT CGTAGAGTAG AGGGTGGGGA TGGTCGTTTG TTTTTTTATA 

232351 AAGGAGCTTC GCTCCTTGGA GGAAGGTAGC AAAGAGAAGT TCCGAGACAT 

2324 01 TCTCAGGTAG GGGGGCAATC GCAGAAAGGG TTAAAAAGGG GTTTCCAGAC 

232451 AAAAATACCG AAACAGGAAG GTTTTGCTTT TTTTGCTCTG CTTCATACAG 

232 501 ATGCATCCCT CCGCCCTTCT GGATTTGAAA ATGGAGGCCC ATGGTGTTTT 

2 32551 GATTGAACCG TTGCACGCGA TACATCCCAA GATTGGGTGT AGTAAGAGTC 

232601 GGCGATTCCG TATAGACAAG AGGAAGTGTG AGAAAGGCTC CACCATCTTC 

232651 AGGCCAGCTT GTGAGTAAGG GAAGGTGATC TAAGTTAACT GAGGACATAG 

2327 01 AAACAAAAGG AAAGCGACGG AATCGAGCTT TTTTGAGCCC TAAAGAGCTT 

232751 ATTCTTTTTA ATAGATCCCG AGATTTCCAT AGAGAAGAAA GCTTTGGTGT 

232801 AGAAGAAATA AGGTGGGCAA CTCGAGCGAT GAGGTTATCA GGAGCTTGAG 

2 32851 AAAAAAGTTG GTCTACACGA TGTTTTGTTC CAAAGAGATT GGTCAGGACT 

232901 GGGAATGACG ATCCGATGAC ATTATGAAAA AGAAGGGCAG GGCCTTGATC 

232951 TTCAATAACA CGACGATGAA TCTCAGCTAA CTCGAGGTTA GGACTTACGG 

233001 GAGCAAAAAC ATCAATAAGT TGTTTTTGTG AACGAAAAAG AGAAATATGA 

233051 CGCCTTAAGA AAGACATAAA TATTTCCCTA TACTTAAGTT AAATTAAAAA 

233101 TTTTTACTTT TAGCTCTTTC GAGAACTTTC TCTAATCCGA GCTTATCAAT 

233151 GTGACGAAGA GCGCTAGCAG AAATTTTAAG CTTAAGAAAA CGGTTTTCTT 

2 33201 CTGTAGACCA TAGACGCTTG GTCAACATAT TAGGGAAAAA TCTTCTTTTA 

233251 GTCTTCCCTG TCACTTTCAA ACCAATTCCT TTTTTCTTTT TAGCAATACC 

233301 TCGAAGTGTA TAGCTATAAC CACGGCGAGG TCTCTTTCCT GTAAGTGGGC 

233351 ACTTTCTTGA CATATTCTTC CTATGAATTC TCTAACATCC GCTCAATAAA 

2334 01 GCAACTCTAT GAAAAGGAAT CTATACTAGA GCTTTTCGAA TATATGGGGA 

233451 AGAGGTTTTC TTAGAAAATG TGATTGTCTG TTGTTGATAA TGAAGGAATT 
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233501 CTTTTAAGAA AGACGGTCGT TCCAGGAAAT TATTATAAGG TTCTTACTAT 

233551 AAAAAGACTT AAATGTTTTA TTGCTATCCT TACAGTCCTG TAAGGATCTT 

233601 CTCAATGTAA CCATTAAATT TTTTATGAAT AGCGAGTTCT TCTAAGGAAG 

233651 GCCGAACTCG ATACGACCAA TTCTTTTTAG AAATTGTCCC AGGTGTATTA 

2337 01 ATGCGTTCTC TTTGTAGATT TTTTGATACT AAATCAGGGC AGAGGGCGAG 

233751 ATAATCGTTA AAGAGGTTGA TATGAAAGAT AGATGCTGAT TCATGAGAAA 

233801 GTTTTAAGAT GTCTATTTGA GTTTCTGTAG TCAGGGTTTT TTGAAAAGGA 

233851 AGATGTAGAA ATTTAGCAAA TTGCTTAGCT TCCTTAGGTG AATTGAGCCA 

233901 CCATTGGGCA AACGTATCAG AGTCGTGGGT AGAGAGAGTG GTCACAGAAA 

233951 GTGGATTATA ATCTTTTAGG GGAATGAAGG CACTGTCGCT TTCCCAGTTG 

234 001 CGTTCCCATC GTGGAATCCG GGTTCCACAG ATTCCTAAGT GTGTTAATGT 

234 051 CGTTTTGACG TCTTGGGGTA TAATCCCTAA ATCTTCTCCG ATAGGTAACA 

234101 TAGAAGAGGC TCCGAGCATA GTAGAAAGGA TCTCCGTGCC CTGCTTTATA 

234151 TAGTCTTTAG GATTGTCTGG AATGAACCTT CCTCTTCCTG AAGAATCCCA 

234201 AATCCACAAA CGGAAAAATC CTATAATATG ATCTAAGCGA TAGACGGAAT 

234251 AGAAGTTTTG AGCATATCGC AGACGCTCTT TCCACCAAAT GTAGTCGTCT 

234301 TTGGCAAGTT GTGAAAAATT ATAAATAGGC AGATGCCAGT TTTGTCCTTC 

234351 AGAATTGTAG AGGTCAGGAG GAGCTCCTAC AGACCTTGAT GAAGAAAAGT 

234401 AGTCTCGGAA ATACCAAACA TCACAGCTAT CCTTGCTAAT AAGAATAGGG 

2 344 51 AGGTCTCCTT TAAGCAGGAC GTGGTGTTGA TCTGCATAGG CTTTCACTTC 

234 501 GCAGAGCTGT TGGTAACAGA GAAACTGTAG ATAGGAAAAA AAGAGGACTT 

234 551 CATCATGGAA TTTTTTAGTT AAGTCCGGAA AATTCTCCTG ATCTGTGAGC 

234 601 GACTTCGGCC AGTTATTAAT AGGTTCTCCG TGCATATGAT GTTTGATTGC 

234 651 ACGAAAGGTC CCATAGGGAT AAAGCCAATA GCGCTCGCTT TCTAGAAACT 

234701 CAGAAAAATT TGAGTTTCCT TCGAGGGAAG ACTTGCAACA TTTTTGGTAG 

234751 TACTCTCTTA AGAATGCCCA TTTTTTTTCT TTAACTTGAG TATAGCTGAC 

234801 TGATGGAGTC GAGCATAACT CATGCATATC TTGAAGTTTC TTGGCAACTT 
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234851 CAGGGATGGT ATCGATATTT GGAAGAGAGG ATAGGGAAAG GAATAGGGGA 

234901 TTCAGGGCTA CGGAAGAGAT GCTGTTATAG GGACTCGTAT CTTCACCAGT 

2 34951 ATCATTTAAA GGGAGAAGCT GAATAACGCT GAAGCCCTGT TTTTGGCACC 

235001 AAGAGATCAG AGGAATGAGA TCTAAAAATT CACCGATTCC ACAGCTATTT 

23 5051 TTTGTGTGTA TTGAAAATAG TGGGAGATAA ATCCCGTGTT TAGGAGAGGT 

235101 TCCTATAAGT TTCCAAGCAT GTGCTGAGGG TGAGTGTTTT GTGTATTTTA 

235151 AAACATTCAC GCGACGTAGT AGATTCCCAA AACATGAAGG TCAGGAAGAT 

23 5201 TCCCTGTTCT TAGGGATTCC ATCCAAATCT TAGCATGCAA AGAAAAGAGC 

235251 TTGAGATACT CGAATAACTT TTCTCCATTT AGGTAGTTCA TACTCAGGAT 

235301 ATCTGAAAGA TAGAGCTGTT GAGTGACCTC ACCGTAGCCT AGAATTCCCT 

23 5351 TGATGCTGGA TTGGAACGAG CCATTTACAG AGAGAGCAGC TTTGAAAATA 

235401 CGCTCGCGAA ATACGTTTTC AGGAAGAGTA CCTAGTAGTG TCGATACTGC 

235451 AAGATCTCCG GAATTTCCAT CTTCTTCTAT TTGCACAGGG ACATGGGTAT 

23 5501 CGCTGAAACG GATTAAACAA GAGTTATTTT TATCTGGAGC AAGTGTCGTA 

23 5551 TTTAATAGGG GTGCTAAGGA TTCTAGTAAT TGCTCGTATT GGTTTTGCAT 

23 5601 GGCAATCCTT TTTTAATCAT GACCAAGGAT AGGGTTTAGG GAAGTCTGAT 

235651 GCTTTGGGAT AATCTTCATT GTTTATATTT ACAGCATCTA AAGCATTAGC 

235701 AATCATAGCT CCTAATTGCT GACGTTTGTC TGCTGAAGAG AAAAGGCGTG 

235751 ACGACGTTTG ACGTAAAGCA GAAAAGAATA AGTTCAAGAC ACCGGTCACA 

235801 GAATCAACAT CGTCTCCTAT GAGATTGCGG ACTTCTCGTT CTACTTTAGA 

235851 TGCTGTTGGG AACTTATCGT TAATGATTTT ATGGTAGGAC TCAGCTACCT 

235901 TCACAAAGTT TAGATCAGAA GGAGTTTGGA TTCCCTCAGC TTTTAAGCTA 

235951 TCGAGTAAAA TAGGAACGCG ACTTTCAAAG TAATCGTACG AGGTAAGAAC 

236001 TGCTTGCAGG TTACGAGTTT CTGTCATGAG AACTTGTAGT TGCGCACTGG 

236051 GTACGTAGGG ACCCTGCCTT TTTAATTCTG TTQCCATTCC TTTCATTAGA 

236101 AAGGAGCTGA CAATAGCCAT ATCTTGGTAG GTATAGCGGT CTTGAAGCAT 

236151 AGAAAGTAGC TGATCACAGG TATGTGTGTC TCCAGTCACT TCTAAGTACA 
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236201 AAGAGCGAAG CCCTGAAGGA GAAACATTCA GTTGGTCTGC ATATTCTTGA 

236251 GAGGCAAATA AGATGTTTTT CGCACCAATA GCAGTTCGTC CGAATTGCTC 

236301 CGTATGAGTA TTCCTTGCTT GGATAAGCGC TTCTTTTAAT TTACCTTGGG 

236351 AGGGTGGAGT CGTTTGAACC AGGTAGTCCA AAGCTGTGGA TTGCAGAGCT 

236401 GGGTCTTTAA TTTTCTCTTG TACAAGAGCA AGAATGTCTT CTGGAGAAGC 

236451 ATCGTCTCCT ATTGCATCAC GCAGGCCGCG AAGTTCTTGA CCAGAGATTT 

236501 CAGAATTCCC AGAAGCATAC TTATCAGCAA GATCTGTGTC AGGCTTCTCT 

236551 TCTGTAGATT CAGATTTTTT CTCAGCCTTT CCAGCTTCTC CTTTTTTCCG 

236601 AGATTCTAGA GTTTGAAACT TCTCTTCCTT TTTTTTCGTG CGTGTTGCTG 

236651 CTGCGGGATT TGTCAGGTCC TGAGATTGTT GAATCATGTT CATCTCAGAA 

2367 01 CCTTCTTGGC TGGCTACAAC TTCTGCTGCA TCTGCTTTTG CAGCTGCAGC 

236751 TTCTACAGCT GCAAGGTTGA CACCCTGAGT GCCTCCTAAA CCACCTGTGC 

236801 CTCCTGATGC TGCCATATGC CTCCTATGAG CGACAACGTA TCAATTAGAA 

236851 AATCTGAATT CTTCCTAAAG GCTGGATGCG GATTTCTGGT AGGATTTCTT 

236901 GATAAGAAAT CACAGCAATG TCAGGGAATT CTGTTTCTAT TAATTTTCGT 

236951 ACATATCTTC TTACATCAAT TGCTGTCAAT AATACTGGTG GTTGGCCTCC 

237001 TGCAGGTGTT GGCGTGATCG TATTCCTCAT AGATTTTAAA ATTAGGTTCA 

237051 CAGAATCAGG ATCTAGAGCA AGGTAAGAAC CTGCCGATGT CTGTTTAATT 

237101 GCTCCACGAA TCATCTCTTC AATTTCTGGA TCTAAGAGAT AAACAGAAAT 

237151 TGCTGATTGT CCTTGAGAGA ACTTGAAGCT GATATAAAGC TTTAAAGAAG 

237201 ACCGTACATA TTCTGTAAGC AAAACTGTAT CTTTCTCAGT TTGCGCCCAC 

237251 TCGCTCAGAG ATTCTAAGAT TGTACGTAGG TCTTTAATTG AGATTTGCTC 

237301 TTGAACCAAT CTCTTAAAGA TTTCCGTAAG CTTTTGCAAT GGAATAAGCC 

237351 TTGTGACTTC CTTCACTAAG TCCGGGAATG AACGTTCCAT AAATTCGATC 

237401 ATAGAACGTA CCTCTTGAAT TCCCAAAAAC TCTTGAGAGC TTTTATGGAA 

237451 AAAGTACGAA AGATGGAGAA TGATCACTTC GAGCGGCGTC CAATATTTAA 

237501 TTGCTGCCTT CTCTAGAATA GCTTTTGCAT CTTCACTAAC CCAAGCTGAA 
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237551 GGAAGACCCG CAGCATTCTT ATAGGTAATG AAAGGTAGAT TATAACGGCT 

237601 GAGATTGTCC TCCACCTCAT TGGTTAACAC ATGGTGCGGA GGAATTTTTC 

237651 CTCGCACATA AGGGACTTCA TTAAGCAGAA TCATATAATC GTATCCTTCT 

237701 AAAGAAGGGG AATCTGTGCG AACATGAATG CCAGGGTATC GGATTCCGAT 

237751 ATCCTGATAG AGAGCTTGCC GCATTTTAGG AATCATATCA TCAACAAAGC 

237801 TTTGTCCTGA TTTTGTCTTG TGTTGGATAA GCTTAGAGAG ATCTTTTCCA 

237851 AGTTCTAGAA TTACGGGAAG AGTTAGAGAA TAGTCATCGG GATTATCCCC 

237901 AACAGTAGCA GCGCCATCAC CAGCAGCCCC TACGGTTGTT GAAGCTCCTG 

237951 AGCCACCACC TTTTTTTCCT GCCGCTGATT TCTTAGTCAG TAGGAGAATC 

238001 CCTAAGGCAA CGAAAATTAA TGCTAAAATG GAGAAGGACC ATAGAGGGAA 

238051 GCCCTTGAAG AAACCAACCC CTAAAGTTGC AGCACCTGCA AGGAGTAGTG 

238101 CTCGTGGTTC TTTAACGAGC TGAGTAGAAA TCTCTTTACC CAAGTTCGTA 

238151 TTTTTGTCAC TCGATACACG AGTCGTGACA ATACCCGCTG TCAACGCAAT 

238201 CAAAAGAGAA GGAATTTGAG AGACTAAACC ATCTCCAATG GAGAGAAGAG 

238251 TGTAGACGTG AGCTGCTTGA GCGAGGTCCA TGCCGTGCAT AGCCACCCCA 

238301 ATCGTCAAAC CGCCAACAAT GTTAATCAAA GAGATAACGA TACCAGCGAT 

238351 AACGTCTCCT TTGATGAACT TCATGGCACC GTCCATGGCT CCGTAGAGTT 

238401 CACTTTCCTT TTGGATTTGA GCCCTTTTAT CACGAGCTTG TGTGGCATCA 

238451 ATCATACCAG CTCGTAAGTC CGCATCAATC GCCATCTGTT TACCTGGCAT 

238501 CGCATCCAAT CGGAATCGGG CAGCAACTTC GGCAACACGC TCGGCACCCT 

238551 TAGTTACTAC GATAAACTGA ATGATTGTAA TAATGAGGAA GATAATGAAC 

238601 CCGACCACAT AGTTCCCTCC AACCACGAAG TCTCCGAAGG CCTGAATGAC 

238651 ATGACCCGCA TACGCTTTAA GGAGAATCTG TCGAGAAGAG GAAATATTAA 

238701 TCCCCAAGCG GAACATCGTA GTGATGAGGA GCAACGAGGG AAAAACAGAC 

238751 AGCTGCAAAG CACTTGGAAT ATAAAGAGCC ACCATCAATA AGAATACAGA 

238801 GATCGATAAG TTGATGGTGA TCATCAAGTC AACGATAGGC GGAGGCAAAG 

238851 GAATAATGAT CATTAAGACA ACGCCCATCA TCCAAAGAGC AAGGATTAAG 
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238901 TCGCTGGACT TATTGATCAT GTTTAAGGCG GTATCGCCAC CAAGTGTTCT 

238951 GCTGACGAAA TTGAGTAGCT TATTCATTAT AAATGATCAG GTTGGTTAGT 

239001 ATTTTTATTA TTAGGATTTT GCGCATTCAG TGAAGTGATA TAGAGTAGAA 

239051 TTTCTCCAAT AGCTTCGTAA GTAGATTCTG GAATAAATTT TAATTCCTTC 

239101 CCTTCATCCA AAAGCTGATG TGCTAAAGGT ACGTTTCGCA TAATGGGAAT 

239151 TCCGTACTTT TCAGCTTCAT CAAGTATCCT TTTAGCTCGT AAGTTGATGC 

239201 CCATGGCAAT GATCCAAGGT GCTTTATATT TTTCAGGCAT GTAGCCAATA 

239251 GCAACAGCAA TATCTTTGGG ATTAGAGACT ACGGTGCTTG CATGTTTCAC 

239301 CTGTGATGAC GAGTCTTCAT AGGCAATTTC TTGAGCAATT TGTCGACGAC 

2393 51 GGCCTTTAAT CTCAGGATTT CCTTCCGTGT CTTTAAACTC CTGCTTAACC 

239401 TCAAACTTCT CCATCTTTAA TTCTTTAGCG AAATTGTGGC GCTGATAGAC 

239451 AAGGTCAAGA ATCGCAACAA TCAAAAAGAA AATTCCTATC GAGGTTACTG 

23 9501 CTTTATAAAA AATTTCTTTG AAGATTTGAG CAGTAATTAT AGGAGAGACT 
239551 CCTGCAGTTT CTATAATTAA AGAGACTTTG CTTTTTAACG TTATGTATAA 
239601 AATTAAGGCT GCTCCAAAAA TTTTTAAAAT CGATTTGATC AGCTCTATGA 
239651 GAGTCTTTAT TTTAAACTTT TGTTTGATGT TCTCAATAGG GTTGAACTTC 
239701 TTGATATCTG GTTTAAAAAC TTCGGTAGAA AATGTAGGAC CAACGATAAG 
239751 AAAACCTACA ATGACGCCAA CAACAGCAAC AGCTCCCAGT AAGGGAAGTG 
239801 ATGCTGTTAA AATAAGCATA AGACAGTTCT TAAGATAAAA TAAGGTAATT 
239851 ACAGGATCAT GGCGAGTGGG AGCTTGTGAG AGCATGGAAA CCAGAAAGCC 
239901 ACCTAAATGC TTGAAAAAAA AGGTCGATAG GGAGAAAGCC GTAAACATAG 
239951 AGACGATAAA GGTAACCGCA GAAGGAAAAT CCTGAGATTT TGCTACTTGA 

24 0001 CCTTTTTTCC GAGCATCTCT AAGTCGCTTC GGCGTGGCCT TTTCTGTTTT 
240051 TTCACCCATG CTGTTGCCAA GGTTCGATTA AGAGGACTAT ATTCGGAATT 
240101 ATCTGCAAAT TACCTTATGC TCTTGTTTTT CTCAAGAGTG AAGCCCAAAC 
24 0151 GTTTTGTTCT TAGTAAGATT ATAAATGAAT CTTAGAGATA AAGAAGAATA 
24 0201 GTAGAAAAAA AGAACAATAA CGAAAAGTTG TTGAATTTAG CAATACTTTA 
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240251 GGTCACTTCA AGATGGTTCT ATATGAAGAA AGTATTAGGT AGGGAAAACT 

240301 TCAATCGATT ATAGTGGTTT TTTATCATTT TCAAAGCACT CTTGATTTTG 

24 0351 AAGGTACTAT TTTTCTAGCT ATTGTGCTTT TTTGAAACAG AAGAGTTGTC 

240401 TTTCTAATAT TCGAAAAAGC ATGTCATTAT TTTTATTTTT AGATGTCTTA 

240451 TGAGTCATAC TGAATGTGGA ATTGTAGGGC TTCCTAATGT AGGAAAGTCT 

240501 GGCTTATTCA ATGCTTTAAC AGGAGCTCAA GTTGCCTCCT GTAACTATCC 

24 0551 GTTTTGTACT ATCGATCCTA ATGTGGGTAT TGTTCCTGTT ATCGATGAAA 

240601 GACTGGAAGC CTTAGCTAAA ATTAGCAATA GTCAGAAGAT CATCTATGCG 

24 0651 GATATGAAAT TTGTAGATAT TGCAGGTTTA GTTAAGGGAG CTTCCGATGG 

240701 CGCGGGTCTG GGAAATCGGT TTCTCTCTCA TATTCGAGAA ACTCATGCTA 

24 0751 TTGCTCATGT AGTGCGTTGT TTTGATGATC CAGACGTTAC ACACGTTTCA 

24 0801 GGAAAAGTCA ACCCTGTTGA GGATATTGAA GTTATCAACT TAGAGCTCAT 

240851 TTTTTCTGAC TTCTCCTCAG CAAAAAATAT CCATAGCAAA TTAGAAAAGC 

240901 TAGCCAAAGG AAAGCGTGAA GTAGGAGCTC TCTTGCCTCT ATTTGATACA 

240951 ATTATTGCTC ACTTAGAAAA GGGGCTGCCG CTACGTACTT TAGAATTAAC 

241001 TCCAGAACAA ATTGTGGCAT TAAAGCCCTA TCCGTTTTTG ACCATGAAGC 

241051 CTATGTTTTA TATAGCTAAT GTTGACGAGA GTTCTCTACC AGATATGGAT 

241101 AATGATTATG TTGCCGCTGT TCGGGAAGTT GCTGCAAAAG AAAATTCTAA 

241151 AGTGGTTCCT ATCTGTGTTC GTATAGAAGA AGAAATCGTT TCCTTACCTA 

241201 TTGAAGAGCG CTTAGAATTT CTTATGAGCT TAGGTCTTGA AAAATCAGGA 

241251 CTTCATAGAT TAGTGCGTGC TGCGTATGAC ACTTTAGGAC TGATTTCTTA 

241301 TTTTACTACA GGTCCTCAAG AATCTCGTGC ATGGACAGTG GTTCGAGGGT 

241351 CTTCTGCTTG 'GGAAGCTGCT GGAGAAATCC ATACGGATAT TCAAAAGGGC 

241401 TTTATTCGTG CTGAAGTGAT TACTTTTGAA GATATGATAG AGTGTCAAGG 

2414 51 TCGTGCAGCT GCTCGAGAAT TAGGGAAATT ACATATAGAA GGACGTGATT 

241501 ATATCGTCCA GGACGGTGAT ACTATGCTGT TCCTTCATAA TTAAAGGAAC 

241551 CCTTTGCAAA CCAATCTTGA GCATCCAAAA TGTCTTTTTC AATTGCTCGT 
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241601 ATTAGAGTTT CTTTTGATTG AAACTTTTTT TCTTCTCTAA GAAATTTTCT 

241651 CGGGATAATG CTCACTTCTT TGCCGTATAG ATTTTCCGCA AAGGAAAAGA 

241701 TATGCGCCTC TGCATATAAA GACTCTCTTC CAAAAGTAGG GGCAGTTCCT 

241751 AAATTCATAA CACCCTGACA GGTAGTGCTA TCATAACGTA TTTCACAAGC 

241801 ATAAACTCCT AGGGGAATTA AACTTTCTTC TCTAGGAAGA TTTATAGTGG 

241851 CGAATCCTAG AGAACCTCCT ATTCCGGAGC CCTCGGTTAT TTTTCCAGAA 

241901 ATGGCATAGG GATGACCCAA AAAACGATGA GCACATTCAA GATTCCCTGC 

241951 GGACAGAAAC TGGCGGATTG CTTTGCTGGA GACAACTATG TTATCCATAC 

242001 GGTAAGGAGG AATCTTGATG ACCTCTATAC CTAACGGCTT GCCTATAGTA 

242051 TCGAGAGCCT CGGTATTGCT TTGCTGTTCT TTCCCTATGC AAGAATCATA 

242101 ACCTAAGATG AGGCGTTTGC ATTTCAAGTT ACGATGTAAC AAAGTAAGAA 

242151 ATTCTTCTGC CGATTGATTC GCAAAATTTA AATCAAAAGT AAGGACACCT 

242201 AACCAGTCTA TGGGAAACGT TTGCAATAAT TGGAGGCGCT CTTCTTTTGT 

242251 ATTGATGAGT TTCGTGTGAT TTAAAGAAAG TACCGTTTGA GGATGAGAAT 

242301 CAAAGGTAAT AACTCCACTG GATCCAGAAT AGGAAGTAAG AATAGATAAA 

242351 AGATTGCTAT GCCCTAGATG ACATCCGTCG AAAAAACCTA CAGTTACAGA 

242401 ATCTACAGAA AACGAAGACG TTAAACTATA GGCTATTTCC ATGGGCATCT 

2424 51 CGTAGGTAGG GAGAAATATC GAAATCGGGG TGGTCTAATA GATTCCCATC 

242501 AATACATTCA TCTATAGAAA AACGGCCACT GCGTAAACGG CGTAGCTGCT 

242551 CAAGATAAGC TCCACAGCCT AACATCGTGC CAAGCTCATG AGCAATGCTG 

242601 CGAATATAAG TTCCTTTGCT ACAAGAGACT ACAAAATGCA ATAAAGGGTA 

242651 CTCATATTTC GTAATCTGCA AGTGAACTTG AACTGTAGAA TGGTGACGTT 

242701 CTATAGATAA ACCTTTTCTA GCATATTCAT ACAGCTTTTT CCCTTGGACT 

242751 TTTTTAGCGG AAAACATGGG AGGCAGTTGC TGGATCTCTC CTTGGAAATA 

242801 CTCGGCAGCT GATAATACTT CTTCGAGACT AGGAATCTTC TTAGATCTTC 

242851 CTACAACTTT GCCGTCGCAA TCATAAGAAT CGGTAGTTGT CCCTAAATGG 

242901 GCAATTGCTT CGTATTCCTT GTCTTCAAAA AGTAAAATAT CAGAAAGTCT 
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242951 


AGTAAATTTA 


CGGCCAATCA 


ACATGACCAT 


AACGCCAGTA 


GCGAAGGGAT 


243001 


CTAAAGTTCC 


TGCATGACCA 


ATCTTTTTAA 


CGCCTATTAA 


CTTGGTTAGA 


243051 


GCGCGGATAA 


GGCTAAACGA 


AGTTCTCCCT 


TGAGGCTTGT 


CTACAAGAAG 


243101 


AATGCCCTCT 


TTTAATTCTA 


CTGCAAGATC 


CATAGTCATG 


TCTTTAATAG 


243151 


TATTCAAGTT 


CCCAAAAAAT ATAGTTTATT 


AACTCTTTTC 


TTTCTCTTGA 


243201 


ATCTGCCAAA 
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TATATAATCT 
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AAATATCATC 


243251 


GAGATAAAAA 
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CTCTATGAGC 
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LA 1 1 ALAbA 1 


APATAAAPAC 
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243401 


CAAATCCTTA 
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CACo A\j I 1 At- 


PP.TYI ATPP A A 


AGATTAGAAA 


243451 


TCTTGGGATG 


CTTAACATCT 


rpn"»m x t\ > k mm * 
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pr"PTTP.PA A T 


GGCTTCTTGT 


243501 


AATAAAGCAT 


TTACCCGTTT 


AAI ACLjIC iA 


1 1 1 ILiOi vn 


TACAGTACTT 


243551 


CAAGTTATAG 
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243601 


TCACCTATTT 


GAGCTTGCTo 


tjlAH-\_A It 1 
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CACACTCTAA 


243651 


ACCTTTGCGA 


ACTTCTTTGA 


CATC 1 I L 1 1 1 


AALntvJ 1111 


AATGAAGATA 


243701 


ACGTACCTTT 


CCAAAGGATC 


TCTTTATTAC 


GTAATACTCG 


GACTTTATGA 


243751 


TTGCGAGTCA 


TAATTCCTTC 


AGTAACTATG 


CAACCGTAAA 


TAGATCCTAC 


243801 


TTGTGAAGAC 


CTAAAGATTT 


CTTTAATCTC 


AGCAGAACCT 


TCATCTTTTT 


243851 


CTTCAGCAAT 


AGGATCTAAT 


AGAGAAGTCA 


TAATTTCTTT 


AATTGCATCA 


243901 


ATAGCATGAT 


AGATGACGGT 


AAATAGTTCA 


ACTCGGACTC 


CTAAGCTCTT 


243951 


AATTAAAGGT 


TCCGCATGAC 


TTTCTATTCC 


TGTATGGAAA 


CCGATGAGAA 


244001 


CTGCTTTAGA 


GGCGGCAGCT 


AAACGAATGT 


CTGATTCTGA 


AATTTCTCCT 


244051 


ACACTGTTTG 


TTAAAATTTC 


AACATCTACT 


TTTTCTGATT 


TAATCTTAGA 


244101 


TATTGAACTG 


ACCAAAGCTT 


CTATGGAACC 


TTGAACATCA 


GCTTTAATCA 


244151 


TAAGCTTAAG 


AGTCTTTTTA 


TTCTGTAACA 


TAGAATCAAA 


GTTAGGCCGC 


244201 


TTCTTTTGCT 


GTAAAGCAAA 


ACGCTGTTGT 


CCTGCGGATC 


TAGCTTCAAT 


244251 


AATGTCTCTA 


GCCGTTTTCT 


CGTTTTTCAC 


GACGAAGAAA 


GGATCGCCAG 
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244301 


r>mnr»T* IS rT* A AT 

CTT 1 Ao wtA 1 


GTCCGATAGA 


CCTGTGATCA 


ACACAGGAAT 


AGATGGCCCA 
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GCTTCI Hv.A 


TCAATTCATT 
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244401 
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244451 


TS IS I"!/"* IS A A IS ^ 
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ATAACAAGTu 


CTCGAGCACG 
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AAAGCTAACA 


TCTCTAAAAG 


TTCTGAAAGA 


CCTTCTCCTG 


244601 


TTTTTGCGGA 


GGTATTTACT 


GTAACAGTCG 


AGCCTCCCCA 


AGCTTCTGGC 


244651 


AATAGATTGA 


TTTCAGAAAG 


TTGTCTATAG 


ATGGTTTCGG 


AATTAAAATT 


244701 


AGGCTTATCA 


CACTTGTTGA 


TAGCTACAAC 


AATAGCGATA 


TCAGCAGCTT 


244751 


TTGCATGTTC 


AATAGCCTCT 


AAAGTTTGTT 


CTTTAATTCC 


TTCGTCTCCA 


244801 


GC G AC TACAA 


GCACAACAAT 


ATCACAAACT 


TPAGCTCCAC 


GGGCTCGCAT 


244851 


TGCAGAGAAA 


GCTTCGTGAC 


CAGGAGTATC 


TAAAATTGTT 


ATGTCTCCCA 


244901 


CTGGGGTGGA 


GCAGCAGAAG 


GCTCCCATGT 


GTTGGGTAAT 


CGCTCCAGCT 


244951 


TCTGTTGCAG 


CGACATTACT 


TTTCCTTAAG 


GAGTCAATGA 


GTGTTGTTTT 


245001 


TCCGTGGTCG 


ACGTGACCCA 


TAAACGCAAC 


AATAGGGGAG 


CGAATCACAA 


245051 


GCTTGCTGGG 


ATCTGTAGAT 


TGAATTTCGT 


CTCTTACAGT 


GTCATTGCTT 


245101 


AG G C AC AAC T 


TATCTTGCTC 


AGAATAGTCG 


ATGTCAATTG 


TACATCCAAA 


245151 


CTCTAAGCCA 


ATAAATTGTA 


CTGCAGTTTC 


GCTGTCTAGA ATATCATTGA 


24 5201 


CTACATAGGT 


CATTCCATGA 


ATGAATAACT 


TTTGAATGAC 


TTCTGAAGCC 


24 5251 


TTGAGCTICA 


TTTCTGCTGC 


CAGATCTTTCj 


ACGGTAATTG 


GCAAGGAAAT 


245301 


TTTGATATGC 


GTAGGTCGCT 


GGATAGAGGC 


TTCGTCATAG 


TGTTTTTTAG 


245351 


GCTTATAAAC 


ACGTTTTTTT 


CGCCATCTGT 


CTTCTTCTCC 


GCCTTCATTT 


245401 


aatccgtaac 


GATCTCTTCC 


TGTAAAAGCC 


TTTAGGCTTT 


CATCAGATTT 


245451 


CTTAGAACGA 


TCACGAAAGT 


CGGTAAGATT 


TTTCTTCCCA 


GCATCTCTTT 


245501 


TTGGACCAGA 


AGCAGGACTA 


CGGTTAGCAG 


GATTGAATTG 


TTTTTCTCGA 


245551 


TTATTCTGTT 


CACCACCTTC 


AGATGTTCCA 


GGTTTCCCTG 


TTTTATCTGA 


245601 


AGCAACGGGC 


TTTGTGCTTT 


TCGAGCCAGC 


TACGACTTTC 


TCTTCCTTGG 
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245651 CAGGAGCCTT GAATGTTTTT GCTAGGAGAT GATTGATATG CTTTCCTGTA 

24 5701 GGGCCGAACT TAGATTTAAT CATTACGACG CTTTTAGGTT CAGCAGGCTT 

24 5751 CACAGGCTTA GGCTCTAATT CTTTTTCTTG AGGTGGGGTT TCGGGCAATA 

24 5801 CGGGTTGCTC AGGAAGAACT TCAGCAACTG GATGAACCTC AGGACTTTCG 

24 5851 TCACAAACCT CATCGACTAC TTCTAACTCA GGCTCAGGAT CTGCTATGGA 

24 5901 GACTGGAGCA GGTTCAGATG TATCCACTGG AATATGAGCA GAAGACTCTT 

245951 CTTCGGATGA TGAGAACGAC GAACGATTTT TAGCACGAAT GCGACGTGAA 

246001 GTAGACTCTG GTGAAGCTTG TTCCGCACTT GCCGTAGGGG TAGAAGTTGC 

246051 GGCAAGAGCT ACTTTTACAG ACTTTTCTTT CGCAGAAGGT TTTTCTGAAG 

246101 AAGATTTAGC TTCAGAAGAT CCTGCTTGGG CAAGTTTTTG TTTTAGCTTA 

246151 TCCAGCCCCG CGGCTTTCGT TAATTGAGCA TTCTTAATCT TCAATTTCAG 

246201 GTTTTTCGTC AACTTTACTT TCTCCATATT TGCTGACTTG CTCAAGGATC 

246251 TTATAAGCAA GCTCTAAACT GATCCCAGGA ACAGATGCCA GATCATTAGC 

24 6301 ACTCGCTAAT AATACTCTTC TAATTGTGTC ATATCCAGCA TGTTCTAAAT 

246351 TTTGGATGAC TAGCTTACTA ATCCCTTCCA TTTCTAAGGG TTGATCTAAG 

246401 TGCGGACTAT CGAATTCTGC TAATTGAAGG CGTTGAATTT CTAGCAACTT 

246451 ATTGTACTCA CTCATACGTT GTACTTCGAG CTCGTAGTCT AGAATGTGGC 

246501 TAATTAAACG AGCGTTAATT CCTCGTTTAC CAATAACAGT AGCGTAGTCT 

246551 GCATCATTAA CGACAATTGC AATCACTTTG TCGTCTTCTA AAATAGCAAT 

246601 CTTTTGGATT TCTATTGGAT AAAGAAGATT CTGTAATAAC TCTGTAGAGA 

24 6651 CGGGGGAGTA ATTGACAATG TCAATTTTCT CATCGTTCAA TTCTCGAATG 

246701 ATATTTTTTA CTCGAGAACC TCGCATACCT ACAAAAGCTC CAACAGGATC 

246751 AGTTTTAGGG TCTGACGATC TTACAGCTAG TTTCGTGCGG TACCCAGCTT 

246801 CACGAGCTAT CTTAACAATC TCCACAGAAC CTTCTTCTAG TTCTGGGACT 

24 6851 TCTTGAATAA ATAATTGTTT AACAAATTCT GCGTGACTAC GACTGAGGAT 

246901 AACTTCCGCT CCACCATTTT CAGACTCTTG AACTTCATAG AGTAGGGCGT 

24 6951 AAATTTTATC ACCGATCTTA TGTTTTTCTG TTTTAGGATA AAACCGGGTA 
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24 8351 CCCTCTTCAA TTTCAATGAA AGCTCCGTAG GGGAGAAGCT TCACAATTTT 

248401 ACCAAGAACT CGTTTTCCAG GAGGGTATTT CTTCTCAATA TCTTCCCAAG 

248451 GATTATGCTC TTTTTGTTTG AGACCTAGAG CAACTCGTCC TTTTTCTTTA 

248501 TCTACGCTTA AAATAATTAC TTCCAACTCT TGATTCAATT CGACCATTTC 

248551 GGAAGGATGT CGTATGCGCT TCCAGGTCAT ATCGGTAATG TGGAGAAGAC 

248601 CGTCAATACC ATCGAGATCT AAGAATACAC CAAAGTCAGT AATGTTTTTA 

248651 ACAACTCCTT TGCGGTATTC TCCGATAGAA ATTTGTTCAA TAAGTTCGGC 

248701 TTTCTTAGAG ATTCTCTCAG CTTCTAAGAG TTCTCTTCTT GAGACAACAA 

2487 51 TATTGCGACG TTCAACGTTA ATTTTTAAAA TTTTGAATTC ACAAACTTTT 

248801 CCGACATAAT CATCTAAATT TTTGATTTTC TTGTTGTCAA TTTGTGATCC 

248851 AGGTAGGAAG GCTTCCATTC CAATATCTAC AATAAGGCCG CCTTTGACTT 

248901 TACGTGTAAT TTGACCTTTA ACAATAGAAC CTTCTTCACA ATGAGCTAAG 

248951 ATGTATTCCC ATTGACGTTG TCGTGTGGCT TTTTCTCTAG AAAGGACAAC 

249001 TTTGCCCTCT TCGTCTTCGG CTTGGTCGAG ATAGACTTCT ACTTCAGCTC 

249051 CAAGCACTAA ACCTTCTGAA GAGTCTATGA ACTCTGACAT AGGGATCACT 

249101 CCCTCAGACT TCAGACCAAC ATCAACTACG ACAAAGTCTT TATTAATATC 

249151 AACTACGGTA CCTTTTAGGA TGGCGCCAGG CTGTATTTCG TTATCAGATT 

249201 CTTCTTCGCT CGAAGTAATT CTGTGTGCCG TATAAAGCAA ATCTTTAAAT 

249251 TCGGCAACGT CTTCTGTGAG GCATTCTATA TTGTCCAGAA TTTTTTTAGA 

249301 TCCCCAAGTA TATTCAGCTT GTTTTGGCAT TTATTGTTAT TCTCCTAAAA 

249351 ACTACAAAGT CAAAAAAGAC AGTGTAAATA TAAACCTTAA AAAAGGCAAG 

249401 ACCTCCCTTG CAGGAGACGA TTAGTTCTAT ATAATCGAAT TAATCGTAAA 

249451 ATCCTATACT 'TCCGTTCTAG AGATGTATAA GGCCTCTTTT TTAACTTTTC 

249501 TATATTACCA GATTTTATAA AATACGTGTG GTAATTATCA TAAATGTGCC 

249551 TTGTCATATG TAGAGAAACA GGATTTGGCG CATAGAAAAC ATATCGTTTT 

249601 TTTCATAAGC ACCCCATAAA TTCTCGTCCC AGATGGGCTA CATCAGCTTT 

249651 GTTCTTAAGC CTGATAGATC TTTTTGGTTG CTGCTCAAGA ACCAAGTGCA 
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249701 TTCAGAAGTA GAACCATACT TATTGAGGGT TAAATAATAG GATGGGTCTT 

249751 GGTTTCTTAG CTCTAAGAAA ATTCTAAAGA TTAATCACAG AAATGTAGGT 

249801 GTTATTACTG GAAGATGAAG ATCTTGAGAT TTGAAACTTT AGTCTTAAAT 

249851 CCGTATAAAT TAAAACTAGA CAAAAGCTAT TCAAAAGCAA GTTTTTTGCA 

249901 TAGAGAAACT GACGCATAGA CGCTAAGTGT TGTTTATTAG ATAGGCTAGA 

2499 51 ATTAAGTGTC TTTTTTCTAG TAGATCAAGA ACAGTGTTCT CAATTTCTAA 

2 50001 AGTTAGATAA GGAAATTATA AATGATTCAT TCCCGGTTAA TTATTATTGG 

250051 TTCAGGTCCA TCTGGATATA CAGCGGCAAT TTATGCATCA AGAGCGCTTT 

250101 TGCATCCTCT TTTATTTGAG GGGTTTTTCT CTGGGATCTC TGGTGGCCAG 

250151 CTTATGACTA CAACAGAAGT TGAGAATTTT CCAGGGTTTC CTGAAGGGAT 

2 50201 TCTTGGGCCA AAACTTATGA ATAATATGAA GGAGCAGGCT GTGCGGTTTG 

250251 GGACCAAGAC ACTAGCTCAA GATATTATTT CCGTAGATTT TTCTGTTCGC 

250301 CCTTTTATTT TGAAATCAAA AGAAGAAACC TATTCTTGTG ATGCCTGTAT 

250351 CATAGCTACA GGAGCTTCTG CTAAACGTTT AGAAATTCCT GGAGCAGGAA 

250401 ACGATGAATT TTGGCAAAAA GGAGTGACTG CTTGTGCCGT TTGCGATGGG 

250451 GCTTCTCCTA TTTTTAAAAA TAAAGATCTT TATGTGATTG GGGGAGGGGA 

250501 TTCTGCTTTA GAAGAAGCTC TTTACCTGAC TCGTTATGGA AGCCACGTAT 

250551 ATGTAGTTCA TCGTAGAGAT AAACTGCGGG CTTCTAAAGC TATGGAAGCT 

250601 CGGGCGCAAA ACAATGAAAA AATTACATTT TTATGGAATA GCGAGATTGT 

250651 AAAAATTTCT GGAGATAGCA TTGTCCGTTC CGTAGATATT AAGAATGTTC 

250701 AGACTCAAGA AATTACAACT AGAGAAGCTG CGGGGGTGTT CTTTGCTATA 

250751 GGCCATAAGC CAAATACGGA TTTTCTCGGA GGACAGCTGA CGTTAGATGA 

250801 GTCGGGCTAT ATTGTGACTG AGAAAGGAAC GTCCAAGACT TCTGTCCCTG 

250851 GAGTATTTGC TGCTGGAGAT GTTCAGGATA AGTACTATCG TCAGGCGGTT 

250901 ACTTCTGCAG GAAGTGGTTG TATAGCAGCA CTAGATGCTG AAAGATTCTT 

250951 AGGCTAATGC GATTGCAGTT GCTGTGGCAT ACTCTTTGCA GTGGCTTATA 

251001 GAGAGAATGA CTTTAGAAAT TCCAATTTTT GCATAGACAT GCGAAGGGAG 
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251051 GAGAACTTCG GGTCCGTGAG ATACTTTAAA GACTTCGATG TCTTTCCAGG 

251101 CAACAACGCT CCCTATGCCA GTTCCTAAAG CTTTTGCTAC AGCTTCTTTT 

251151 CCAGCAAAGC GACCTGCAAA TGAAGGGATG GGATCGGTCT TTTCTAAGCA 

251201 ATATTTCTGT TCTGCTTCTG TAAAGATTCT ATTGAGTAGT CGATTGCCGT 

251251 GAGTTGCAAT TGCCTCGCGA ATGCGGCTAA TTTCAATAAT ATCGGTTCCT 

251301 ATATGAATGA TTTCCATAGA GTTCGCACTA ATTTCCTTCA GGATTTTCCA 

251351 TTAGGGCATT CTGTAATTTT GAGAAGTAAG CGATGGCAGT TTTAATGAGA 

251401 GCATAGCCCA CTGTAGCTCG AATCACGAAT AAGGCTTGAG GATTGCTGAT 

2514 51 ATAACGACGA AGGACTAAAG AGAAGGCAAT CATAATGACC ATGATGAGAA 

251501 ACCTTCTAGA GATAAAGGTC TGCTTTATAT AGGCAATCCA TGTAATTTTT 

251551 TGTGAAAAGA ATTCAGAAGA TAGACTCAGC TGTTTCCTTA TAGTTTTGCT 

251601 TAGCAAGTAG CGATGTTTTA GGGATGCTAA TCCCCAAGAA ACCATAGATA 

251651 GAAGACAGTA GGTAAAAAAA GAAGAAAGGG GGTCTAGGGC TATTGCGGTA 

251701 GCTTTTAGCA AGAGTTTCAT TCCTGCTGAT ATCCACAAAA TACCAGGAAA 

251751 TAGTATCAGG AAATATTTGA TGTTTCTTGC CATAGTGCAT TACACTTAAT 

251801 TTCTAAATAC TCTATTTTAC TTGGAGTATG AATTTTTTGC TAACTGGTAT 

251851 TAAGATATGC GAGCTGAGAT GGCTGTGATC TATTGGGATC GCTCAAAAAT 

251901 TGTCTGGTCT TTCGAGCCAT GGTCTCTAAG ACTTACTTGG TATGGCGTCT 

2 51951 TTTTTACTGT AGGGATTTTT CTAGCATGTC TCTCAGCAAG GTATTTGGCT 

252001 CTTTCCTATT ATGGTTTGAA AGATCATTTA AGTTTTTCCA AAAGCCAGCT 

2 52051 ACGCGTGGCT TTAGAAAACT TTTTTATATA CTCTATTTTA TTTATTGTCC 

252101 CTGGAGCTAG ACTTGCCTAT GTGATTTTTT ATGGATGGAG TTTTTACTTA 

2 52151 CAACATCCTG AAGAGATCAT TCAAATATGG CACGGAGGCT TGTCGAGTCA 

252201 TGGAGGCGTT CTTGGCTTTC TTTTGTGGGC GGCCATTTTT TCTTGGATAT 

252251 ATAAAAAAAA GATTTCAAAA TTGACTTTTC TCTTCCTTAC AGACTTGTGT 

252301 GGATCAGTTT TCGGAATTGC AGCGTTTTTT ATTCGTTTGG GTAATTTTTG 

2523 51 GAATCAAGAA ATTGTAGGAA CACCGACTTC TTTGCCTTGG GGGGTGGTTT 
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252401 TTTCTGATCC TATGCAAGGT GTCCAGGGAG TTCCTGTGCA TCCTGTGCAG 

252451 CTTTATGAGG GAATCAGTTA CTTGGTCGTC TCTGGAATTT TATATTTTCT 

252501 TTCCTATAAG CGTTATTTGC ATTTAGGTAA GGGATATGTG ACTTCTATAG 

252551 CCTGTATTTC TGTCGCTTTC ATTCGTTTTT TTGCGGAGTA TGTAAAAAGC 

252601 CATCAAGGGA AAGTTTTAGC AGAGGATTGT CTACTTACAA TTGGTCAAAT 

252651 TTTATCTATA CCTTTATTTC TATTTGGTGT GGCCTTACTT ATCATTTGCT 

252701 CATTGAAAGC TCGAAGGCAC CGTTCACACA TATAGCAGCA TCTTTTTGGC 

252751 ATGAGACTTT AGAATTTTAC AGTCTATCAC AATAAGAGAC CTGTATTGAG 

252801 AGATTCTCCT TTGGGAGATA ATGTGGGTTT ATGATATTTT GACCTCTTAA 

252851 GTTTAATTTT TAGAGTTTAT CAAATGAATA AACGCACTTT GCTTTTTGTT 

252901 TCTTTAATTG GGATTGCTTT TGTAGGATGT CAAATATTTT TTGGTTATAA 

252951 TGAATTTCGT TCCTGCAAAA ATCTAGCAGA GAAACAAAGA AAGATTTCAG 

253001 AACAGACGCT AGCTGCAGTA GAATCTGTAG GGTTAAGTGT AGCTTCATGG 

253051 GACACCGATG TAAACGGAGA AGAACATAAG AATAACTACG CAGTTCGTGT 

253101 TGGAGACAAG TTATTTTTAT TACATAATGG AGAAGCTGCT CAGTCTGTTT 

253151 ATTCTTCTGG GGAATCTTGG AGCTTTGTAG ATCACAAGTG TGGTTTCGAT 

253201 AATATTCACT TGGCTTTATA TCGTCAGCAG GGTTCTTCTT TCAATCCTAC 

2532 51 GAATACGGGA AAAGTTTTTC TTCCTACGAA TCATGAAGGT TTACCTGTAC 

253301 TAGTTGTTGA GTTTCGTAAC AATAAAGAGC CTTTAGTATT TCTAGGTGAG 

253351 TACGCACAAG GAAGAATTTC CAATAAAGAT AGCACGATCT TTGGTACAGC 

253401 GCTTGTCTTT TGGAGATCAG GAAGCGACTA TATTCCTTTA GGTCTCTATG 

2534 51 ATTCTCGAGA AGAAAAGTTA GTTTCTTTGG ATCTTCCTAT TACACGAGCT 

253501 GTAATTTTTG GTAATGACCA AGATTCGGCA AAGTCGTCAG ATACTGCGAA 

253 551 CCACTATGTT TTATTTAATG ATTACATGCA GATTATTGTT TCTGAAGAGA 

253601 GTGGTTCTAT AGAAGGTATC AATTTACCTT TTGCTTCAAC AAATAATAAA 

253651 AGCATTGTGA ATGAAATTGG TTTTGATAGG GATTTAGCTT CAGAGAAATC 

2537 01 TCCTGAAGCT CTTTTCCCTG GGCTGTCTTC AAAACTTCCT GATGGCCAAC 
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253751 AAGCCAAAAA CTCGATTGGA GGTTACTACC CTTTATTGCG CAGGGGATTA 

253801 TTAAGTGATT CTAAGAAATT ACTTCCTCTA GAGTATCACG CATTAAATGT 

253851 GGTTTCAGGA AGAGAGCTAG CGACTCCTGT GGCTTTAAGA TACCGAGTTC 

253901 TTTCCTATAC CCCCCATTCC ATTCAATTGG AAAGCTTAGA TAGATCGGTT 

2 53951 CAGAAGGTAT ACAAACTTCC AGAGAATCCG GAAGAAAAGC CCTATGTTTT 

254001 TGAAACTGCA ATTACTTTAA CGAAAGAAAC CGAAGATGTA TGGGTAACTT 

254051 CAGGAGTTCC TGAAGTGGAG ATCATGTCAA ATGCTTCAGC CCC AACCATT 

254101 AAATACAGGG TTATCAAAAA AAATAAGGGG TCTTTAGATA AAGTTAAGCT 

254151 TCCAAAAGTA AAAGAGCCTT TAGCTATACG TCGTGGTGTT TATCCTCAAT 

254201 GGATTTTAAA TTCGAATGGA TATTTCGGTA TTATTTTAAC TCCGTTGTCT 

254251 GAAATTGCTT CTGGCTATGG ATCTCTCTAC ATTTCGGGTT CTACGGCTCC 

254 301 GACAAGATTG TCTGCTATTT CTCCTAAAAA TCAACTGTAT CCAGTATCAA 

254351 AATATCCTGG ATATGAGACC TTGCTTCCTT TGCCAAAAGA TGCAGGGACA 

2544 01 CATCGATTTT TAGTGTATGC AGGTCCCTTG GCAGAGCCTA CACTTAAAGT 

2544 51 ATTAGATAAG ACAATTACTC AGGAGAAGGG AGAAAATCCT GAGTATCTTG 

254501 ATAGCATTTC TTTCCGTGGT GTTTTTGCAT TTATTACAGC TCCTTTTGCA 

254551 GCACTCCTAT TTATTATTAT GAAGTTCTTC AAATTGGTTA CGGGTTCTTG 

254601 GGGAATTTCC ATTATTTTAC TTACTGTATT TTTGAAATTG CTTCTCTATC 

254651 CTTTAAATGC ATGGTCCATA CGATCTATGA GGCGTATGCA GATTTTATCT 

254701 CCTTATATTC AGCAAATTCA GCAAAAGTAT AAGAACGAAC CTAAGCGTGC 

2 54751 TCAGATGGAA ATCATGGGCT TGTATAAGAC AAACAAAGTG AATCCTATCA 

254801 CGGGTTGTTT ACCTTTATTG ATACAGCTTC CTTTCCTAAT TGCGATGTTT 

2 54851 GATTTATTAA AGTCATCATT CTTATTACGA GGAGCCTCGT TTATTCCTGG 

254901 GTGGATTGAT AACTTAACAG CTCCTGATGT GTTGTTTTCT TGGCAGACAT 

2 54951 CGATATGGTT TATTGGAAAT GAGTTCCACT TACTTCCTAT TCTATTAGGT 

255001 ATAGTGATGT TCTTACAACA GAAGGTCACG AGTTTGCATA AGAAAGGACC 

2 55051 TGTTACGGAT CAGCAGAAAC AGCAACAAGT TATGGGGAAC ATGATGGCGA 
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255101 TTTTATTTAC CGCTATGTTC TATAACTTCC CTTCAGGATT AAACATCTAT 

255151 TGGCTTTCGT CTATGATTTT AGGAGTCGTC CAGCAGTGGA TCACTAATAA 

2 55201 GATATTAGAT AGCAAACATC TTAAAAATGA AGTGGTTTTA AATAATAAAA 

255251 AACATCGATA ATACTGAAAA AAAAGTCGAA TTGTCTTAAT ACAGGCTTAT 

255301 TGAAACAAGC GATTTATATG AGCTCATAGT TTATGCGAGC ATGGGAAGAA 

255351 TTTCTTTTGC TACAAGAGAA AGAAATTGGC ACAAATACTG TAGACAAGTG 

2554 01 GTTGCGATCT TTAAAGGTCT TATGTTTTGA TGCTTGTAAT TTGTATCTTG 

255451 AAGCTCAAGA TTCTTTTCAA ATTACTTGGT TTGAGGAGCA TATAAGACAT 

255501 AAGGTTAAAT CTGGTCTTGT AAATAATAAC AATAAGCCCA TTCGTGTTCA 

2 55551 CGTTACTTCG GTAGATAAAG CAGCTCCTTT TTATAAGGAG AAGCAGATGC 

255601 AGCAAGAGAA GACAGCATAC TTTACCATGC ATTATGGAAG TGTGAATCCT 

255651 GAGATGACCT TCTCTAATTT TTTAGTTACC CCTGAAAATG ATCTTCCTTT 

255701 TCGTGTTTTA CAGGAATTTA CTAAGAGTCC TGATGAAAAC GGAGGAGTTA 

255751 CTTTTAATCC AATTTATCTG TTTGGACCTG AGGGATCTGG AAAAACTCAC 

255801 TTAATGCAGT CAGCTATCAG TGTTCTTCGT GAATCTGGAG GTAAGATTCT 

255851 CTATGTTTCT TCGGATTTGT TTACAGAGCA CTTAGTCTCT GCTATCCGTT 

2 55901 CAGGAGAAAT GCAAAAATTC CGTTCTTTTT ACCGCAATAT TGATGCTCTA 

255951 TTCATTGAGG ATATCGAGGT TTTTTCAGGA AAGTCGGCAA CTCAAGAAGA 

256001 GTTCTTCCAT ACGTTTAATT CTCTTCATTC TGAAGGGAAG TTGATTGTAG 

256051 TGTCTTCATC CTATGCGCCT GTGGATCTCG TTGCTGTTGA AGATAGATTG 

2 56101 ATCAGCAGGT TTGAATGGGG AGTTGCAATT CCGATACATC CTTTGGTTCA 

256151 GGAAGGATTG CGCAGTTTCT TAATGAGACA GGTAGAGCGC TTATCTATTC 

256201 GCATTCAAGA AACGGCCTTA GATTTTTTAA TTTATGCGCT ATCTTCCAAC 

2 56251 GTAAAGACCT TACTGCATGC ACTGAATCTT TTAGCAAAGA GGGTAATGTA 

256301 TAAAAAACTC TCTCACCAAT TACTATATGA AGATGATGTG AAAACTCTTT 

2563 51 TAAAAGATGT TTTAGAAGCA GCAGGAAGCG TTCGTTTAAC TCCTTTAAAG 

2 56401 ATTATTCGTA ATGTTGCTCA ATATTATGGG GTCTCTCAGG AGAGTATTTT 



190 



WO0027994 f file:/A\dcwas03\firmdata\lp\FoleyPat\PatentDocuments\WO0027994.c pc] 



WO 00/27994 PCTAJS99/26923 

256451 AGGACGTTCT CAGTCCCGAG AATATGTATT GCCACGTCAG GTAGCCATGT 

256501 ACTTTTGTCG TCAGAAGCTT TCACTATCAT ACGTGAGAAT AGGCGATGTC 

256551 TTTTCAAGAG ATCATTCGAC GGTAATCTCA TCCATACGAT TGATTGAACA 

256601 AAAAATAGAA GAAAATAGCC ATGATATTCA CATGGCTATT CAAGATATTT 

256651 CTAAGAATTT AAATTCCTTG CATAAGAGTT TGGAATTTTT CCCTAGCGAA 

2 56701 GAGATGATTA TTTAGAGGAG TAACGATCCT CTAATTTTTG CTTCTGGACA 

256751 GTTGCAGCAG CCGGCGCAGA TATCATTTTC ATTGAGTTCA ACTCCGCTGT 

256801 TCTAGCTTGT TTAAATTGTC TAATATGATT GAAAGCAAGG ACAAGAGCAA 

256851 TACCAGCAAG TACTATCTGT ATAGCAAGAA CAATGCCGAT TCCTAAGGAT 

256901 AGGGTGACTC CACTGGCAAT AACCAGTCCT AGGGTGGTTA AGGAAGCAAG 

256951 GAGGAGACCT AGACTAACTA AAATAGGCTT AAAAAATACC CAGGAACAAT 

257001 AACGATGTGT TGCCTTGTGA GATACTGATG GTTGTGGTTG TGTAGTCTGA 

257 051 GGTGTTTGTG CTACTGTAGC CATGTTGTTC TCCTGAGTAA AAAATTAAAT 

257101 AATTTTAATC TGTCTATATA ATTGTTGTCA ATAATATTTT TATAGATATT 

257151 ATTTTAATTT TATTAAAATT TATCAACTAA TTTTATTTAT AGTCTTTCTC 

257201 TAGTTCAGAA CTATTAAGCT CTCTTAATAG GCTCTCTTTT ATTTACCAAG 

2 57251 GCGAAGTTCT TTTTTTTTTT TCAAATTTCG CATATGGAGG GTTCTAGATG 

257301 GTATTTGAAA TGGTTGCATT GTGGAAGATT TTTCGAGTTT TGATAAGAAC 

257351 AAAGTCAGTG TTGACTCTAT GAAACGGGCG ATTTTAGATC GTCTGTATTT 

2 574 01 AAGTGTTGTA CAATCACCAG AGTCCGCATC TCCTAGAGAT ATCTTCACAG 

2 57451 CTGTTGCAAA AACTGTTATG GAATGGTTGG CCAAGGGGTG GCTGAAAACT 

2 57 501 CAAAATGGCT ACTATAAAAA TGATGTAAAA AGAGTTTATT ACCTTTCCAT 

2 57551 GGAATTTCTC TTAGGGAGAA GTCTAAAAAG CAATCTTTTG AATTTAGGAA 

257 601 TTCTAGATTT AGTAAGGAAG GCACTAAAAA CTTTAAATTA TGACTTTGAC 

257651 CACCTTGTAG AAATGGAATC CGATGCAGGA TTAGGAAATG GTGGTTTGGG 

257701 GAGACTGGCA GCTTGTTACT TGGATTCTAT GGCTACATTA GCAGTTCCAG 

257751 CCTACGGCTA CGGTATACGC TATGATTATG GTATTTTTGA TCAGAGGATC 
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257801 GTCAACGGGT ATCAAGAGGA AGCTCCTGAC GAGTGGCTAC GTTATGGAAA 

257851 TCCTTGGGAA ATCTGTAGGG GAGAGTACCT CTATCCCGTA CGATTTTATG 

257901 GAAGGGTCAT TCATTATACC GATTCTCGAG GGAAACAGGT GGCAGATCTT 

257951 GTCGATACCC AAGAGGTATT GGCGATGGCT TATGATATTC CGATTCCTGG 

258001 GTACGGTAAT GATACTGTAA ATTCTCTAAG GCTATGGCAA GCACAATCTC 

258051 CGCGAGGCTT TGAATTCAGC TATTTTAACC ACGGGAACTA TATCCAGGCT 

258101 ATAGAAGATA TCGCCTTGAT AGAAAACATC TCTCGCGTCC TCTATCCTAA 

258151 TGATTCTATT ACTGAGGGGC AGGAATTGCG TCTCAAACAA GAGTATTTTT 

258201 TAGTTTCAGC AACCATTCAA GATATTATCC GCAGATATAC AAAGACACAT 

258251 ATTTGTTTGG ATAACCTTGC GGATAAAGTC GTAGTACAAT TAAACGATAC 

258301 CCATCCCGCT TTAGGGATTG CTGAAATGAT GCATATTTTA GTCGATAGGG 

2 58351 AAGAATTACC TTGGGATAAG GCTTGGGAGA TGACTACAGT CATCTTTAAC 

258401 TATACCAATC ATACAATCCT CCCAGAGGCT TTAGAGAGAT GGCCTCTCGA 

258451 TTTATTCTCT AAGTTATTAC CTCGGCATTT AGAGATTATT TATGAAATAA 

258501 ATTCCCGTTG GTTAGAAAAA GTTGGCTCTC GCTATCCTAA AAATGATGAT 

258551 AAGCGCCGGT CTTTATCCAT TGTTGAAGAA GGGTATCAAA AGCGTATCAA 

258601 TATGGCAAAC CTTGCCGTAG TAGGTTCTGC AAAAGTAAAT GGAGTTTCGT 

258651 CATTCCACTC TCAGCTGATT AAAGATACTC TCTTTAAAGA GTTTTATGAG 

258701 TTTTTCCCTG AGAAGTTTAT CAATGTGACC AATGGGGTGA CTCCACGACG 

258751 ATGGATTGCT CTCTGTAATC CTCGTTTGAG TAAGCTTCTC AATGAAACTA 

258801 TAGGGGATCG TTATATCATT GATCTTTCTC ATCTTTCATT GATCCGTTCC 

258851 TTTGCCGAAG ATAGTGGTTT CCGAGATCAT TGGAAAGGGG TAAAATTAAA 

258901 AAATAAGCAG GATCTAACAA GTAGAATTTA TAATGAAGTT GGAGAAATAG 

258951 TAGACCCTAA TTCTCTCTTT GACTGTCATA TTAAGCGTAT TCATGAGTAT 

259001 AAACGACAAC TAATGAATAT TTCTTAGAGT CATCTATGTT TATAATGACT 

259051 TGAAAGAAAA CCCTAATCAA GATGTCGTCC CTACAACAGT AATTTTTTCT 

259101 GGTAAGGCGG CTCCTGGCTA TGTCATGGCC AAACTCATTA TCAAGTTAAT 
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259151 CAATAGCGTT GCTGACGTTG 

259201 TTAAGGTTCT TTTTTTACCT 

259251 ATTCCTGGTA CAGATCTTTC 

259301 TTCTGGAACA GG AAATATGA 

2593 51 GAACTATGGA CGGTGCAAAT 

259401 AATATGTTTA TTTTTGGTCT 

259451 GGAATACTGT CCTCAGACAA 

259501 TTTTAGATTT GCTAGAACAG 

259551 TTTAAACCGA TAGTACATCG 

259601 CTTGGCTGAC TTGGAGTCTT 

259651 TCTTTAAGGA ACCAGATTCA 

259701 GGAATGGGCT TTTTCTCTAG 

259751 TATTTGGCAT GTTCCTACAA 

2 59801 TAGACAGGAT GGAATCAGGG 

259851 GAGTCCTTTT TGTTCAAAGA 

259901 TTCTAAGATC TTTTGTAATC 

259951 CAATCACTCT ATGATCTACA 

260001 GTAATTTCTC CGTCAAGAAC 

260051 AAGAATCGCC GCTTGAGGAG 

260101 CTGTCATTCC TAAGTTAGAG 

260151 TCTTGAAGAG ATTGATTTCT 

260201 TGAAATCATG CCGAGATTTT 

260251 TAATTCCATC TGGAATGGCC 

260301 CGGACGATTT TATTATCGAC 

260351 CTCTTTGAGC GCCAGAGCAC 

260401 GTTTGATTCC CTGAGCTTGA 

2604 51 GCGTAGACCT GCTGCCTTAC 
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TAAATCAAGA TTCTCGAGTT AATGATAAGC 
AACTATCGAG TTTCTATGGC TGAGCATATC 
AGAACAGATT TCTACAGCTG GAATGGAGGC 
AATTTGCTTT GAATGGAGCT CTGACTATAG 
ATAGAAATGG CAGAGCATAT TGGTAAGGAG 
TTTGGAGGAG CAAATTGTAC AACTGCGGAG 
TTTGTGATAA GAATCCTAAG ATCCGTCAGG 
GGATTTTTCA ATAGCAATGA TAAAGATCTG 
CCTACTGCAT GAAGGAGATC CCTTTTTTGT 
ATATCGCTGC CCATGAAAAT GTGAACAAAC 
TGGACTAAGA TTTCTATTTA TAATACTGCA 
TGACAGAGCC ATTCAGGATT ATGCCAGAGA 
AATCTTGCTC TGGAGAAGGA AATTAAGAAA 
ACTCTTTCCA TAGCCAAGGA GCTATAGAAA 
TTGCTAGTTT AATAGTAGGA CAGCCGGAGC 
GTTTCATAAA CATCGCAGCA GGATAACCAT 
GATAGGGTAA GATTGCAGGT AGATCCTATA 
AAGAGCTTGT TCTGTAACAC TTCCTACGGC 
GATTGACAAT CGCTGTAAAT TCAGTGATTC 
ACACAGAAGG ACCCTCCTTT GTATTCAGTG 
TGCTTTTAAC GCTAAGCTCT TAATTTCTGC 
TACGGTCTGC GCAGCGTATA ATTGGCGTAA 
ACAGCTATCG AGATATCGAT AGTATCAAAA 
ACTGTTAAAT CCTGAATTGA TAGAAGGGAA 
AGGCACGTAC AATGCAATCG TTAATAGAGA 
AGTTCTTTGA GCAGATTAAG GAGAGGTGAG 
ATAGAAGTGA GGAATAGAGA TCTTAGCAGC 
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260501 TTGTAGGCGT GCAGCAATCA CTTCCCGAAT CGGAGAGAGA TTCTCCTCAT 

260551 GATAGGAACC TGGAGGCACT TCGGGAGACT CAGGATAGCC AAAACCAGCA 

260601 ATGCTTTTAG GAGGAGCTTT CTCTAAATCT TTTTTTACTA TACGTCCTCC 

260651 AGGACCACTC CCTTGAATTG ATGAGACATC TATGTTTTTC TCTTTTGCTA 

2607 01 GTTGTCTAGC TAATGGAGAG AGATTATTCG TAGTGCCTAC GTGTTTGAAG 

260751 ACTAAAGGCG AGGAGAGAGG TGGCTCTGGC TTAAAAGTTA CTGCTGTGAA 

260801 TGTTGCTGAG GCAGCTTGTG GAGTTGTTGC AGGCGAGACC TCTTCAGAAG 

260851 AACCTTTTGG AGATGCTTCA AGGTTAGAAG GTTCTGTCTT AGGAAGAAGT 

260901 TCTTCTAGAT TAAAGGGCTC GTTGGCTTCT GTAGAGAGTA CCGCAATAGG 

260951 GGTGCCTATA ACGATTTTCT CGCCTTCATG ACGTAAGATT TCACGAATCC 

261001 AGCCATCTTC ATTTGCTGTA TGTTCTAAAA TAGCTTTGTC TGTAGAGATC 

261051 TCTACAATGA CGTCTCCAAA ACTGACCTGA TCATTACTTT TTTTATGCCA 

261101 TTTCACTATA GTGCCCACTT CCATAGTTGG AGAAAGCTTT GGCATTTTCA 

261151 ATAAGGAGAT CACAAACTTA CCTCATGACT TTTTCAATGG TATCTAAGAT 

261201 TCGGTTAACA TTAGGCAAAG TGGCCTGTTC TAAGATTTTA CTATAGGGCA 

261251 TAGGCGTTTC TTTTTGGCAT ACCCTTAAGG GGGGAGCATC AAGAGAATCA 

261301 AAAACATGCT CAGTAATCAG GGCAATAATT TCAGAAGAAA TCCCAGCGAA 

261351 GTAGTGGCCC TCTTCAATTA CAATACAGCG TGAAGTTTTT CGTACCGATG 

261401 ATAAAATTGT TGATATGTCT AAAGGTTTGA TCGTTCTTAG ATCAATAATT 

261451 TCTATAGACA AGCCCCAACG TTTTTTGGCT AGAGAACACG CTTCTTTTGT 

261501 AATGGAAACC ATACGGCTAT AAGTAATAAT TGTAAGGTCA TTTCCTTCTT 

261551 GAACTCTATG TGCTTTCCCA ATAGGAACGA GATATTCTTC GGTGGGGACT 

261601 TCCCCTTTTA AGTTATATTC TAGCTCGTTT TCTAAAAAAA GAACGGGGTT 

261651 ATTATTTCTG ATTGCTGATT TTAATAAGCC TTTAGCGTCG TAAGGGTTCG 

261701 AAGGGGCTAT AATAATAAGA CCTGGAATAT TAGCATACAA CGACTCAACG 

261751 CAATGAGAAT GCTGGCAAGA TACCTGGGCT GCAGCACCAT TAGGGCCACG 

261801 AAAAACTATA GGAACGGAAA ACTTCCCTCC AGTCATAAAA TGCATCTTAG 
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261851 CTGCATGAGA AATGATTTGG TCCAAGGCTA CAAAGGAAAA GTTCCAGCTC 

261901 ATAAATTCTA TAATAGGGCG CAGGCCTGAC AATGCGGCTC CTATTCCAAT 

261951 TCCAGAGAAG GCTGCTTCAC TAATAGGAGC ATCAATGACT CTCTTAGGGC 

2 62001 CCCATTTATC TAATAAGCCT TTGGTGACTT TATAAGCACC ATTGTAGTCA 

262051 CCAACCTCTT CACCAAGAAT ACAGACATTA GGATCGCGAG ACATCTCTTC 

262101 GTCAATTGCT TCTCGGAGAG CTTCTCGAAT TTCTAATGTT TTATGTTTAG 

262151 GCATAGACTC CTTCCTCTAA TGTGGTGACG GATGGATCTG ATGAGAGTTT 

262201 TGCGTTAGAG AACGCTTCTA AAACAGCAGT TTTGCATTCT TGGCGTATAT 

262251 TTTGAAATTC CTCTTCAGTC AGAACCTCTA ATCGAATTAG CCAATCTTTA 

262301 GCTAGGACAA TAGGATCTTT TTTAAATAAA CACTGCATTT CTTCTTTCGA 

262351 TCTATATAAA TTAGGATCTG ATATAGAATG CCCTCGAAAT CGGGAGCAGA 

2 62401 GACACTCAAC TAAAACCGGA GATTCGGTAT CAACCATATA GCGATAAGCC 

2624 51 TCTCTAAATC CTAAAAGAGA GTTAAATAGA TCAAAACCAT TGACTGTGAC 

262501 TGCACGGATA TCGTAGGAAC TTCCTTGAGA CTCTGCTATG GGCTGTTTTG 

262551 CAACAGCACG ATTTAATGAC GTTCCCATAC TCCAGCCGTT ATTTTCAATA 

262601 ATAAGCATTA GAGGGAGTTG GTGAAGAGAA ACAAAGTTCA GAGTTTCATG 

262651 GAATACACCT TGAGCTACCG CACCATCTCC GATAAAGCAT AGAGAAACTC 

262701 TATTTTTTTG TTCTTGATAT TTGATGGTAA ATGCGGCTCC AGCTGCGAGG 

262751 GGAATTTGTC CTCCGACAAT ACCAAATCCT CCAGGGAAAT TAGGCCCACA 

262801 CATATGCATG GATCCTCCAC GACCTAAAGC GCATCCAGTT TCTTTCCCTA 

262851 AAAGTTCAGC AGCAATTTCT TGAAGGGGAA TGTTGAGAAG AATCGCAAGT 

262901 GCGTGGCAGC GGTATGAAGA GAACACCCAG GGATCTAGTC CTGTGTTTGC 

262951 GATTGCAGCA GTTGCTACAG CTTCTTGGCC AGCGTAAGAG TGGTAAAATC 

263001 CACCCACTAG CCCTTCTAGA TAGGCTTCTT CTCCTCGGGC TTCGAATTCA 

263051 CGAATCAGAA CCATCTGTTT TAAAAATTTA ATACAGGAAG CGGGCCCGTA 

263101 AAGGTCTAAG ATCCTTTCTA CTGTGGATTT CTCTGTGCCC TGAGAAGCTA 

263151 TATTATAAGG TGCTGAACTA TCCATAACTT TTTTATAAGG GAGACGCTTC 
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263201 • GGGAGCGGTT TTTGATCTCA TTATAGGAGC CTAGGTTTCA AAAGACAAGG 

263251 GCTATCCGAA GAAAAAATAA TTAAGTTTTT TTAAAAAATC GAAAAAATTT 

263301 TAATATGTTT CTCGTGGCAA TTTAAAGAGA ACAAAAAGGA TTACATCACA 

263351 TGGACGTTTC TCGTAAAATT AATCGACACA CTCAGTTTTA CGTGGATTCG 

263401 ATAGACGGTG TCATCAAAAA CTTTGATCAT AAGCCTAGTG AAGATAAATC 

263451 TCGAGATCAT GAAGAATTAG AGGAAAAGCT TTTAACTATA ACAAAACGTA 

263501 TTGTTGCTTC TGCTCAAGAA TTTCAGAATC GTAAGACGGA CTCTAAAAAC 

263551 TACTACTTAA AAAAAACTCA ATGGTTGCCA TTCAAAAATG AAGAGTTGGA 

263601 GCAAACTAAG GAATTGTTTG CCATGTTAAC TTCGATGGAT AAAAAAATAG 

263651 CTCAGTTATT TTTTTATTCT CCCGGATGTA GTAGCGACTG GGTAGAATTT 

263701 ACCGAAGTAA TTTGTCATTT AAATGATTCT ATAGGTTTGG GAGGGGTTTT 

2637 51 ACTGTGTTGC GGATTATTCG AACAACAGTG TGAGCATGTT GTTACTGTGA 

263801 ATAAAAAGCT AGACCTACCG CTTCTTTTAG GAACTACCGT TGTAAATAGT 

263851 CTACGTTATT ATTTAACCTA TAGGAATATC TCTCTTTTGA ATTGTCAGAG 

263901 CATGAGTGAG CTAGGCAAGG AGCTCGGGGA TGTTTTAAAG CAACATGGAG 

263951 TTGCTTTCAC TTTAATTTTT AAAGAGATTG TGGATATAGA TTTATTAAAC 

264001 TATGTGAAAC TAATTCAAGG ACTCAAGCGT AGTGGAAATA TTCAAGCACG 

264051 TATTTATGAT AATGATGTGC CTACTTTACC TTCAGTTTCA TCGAGTCCTA 

264101 TTGCGTTGAG ATATAGCTTA GCGAATACAA TTCGGGGCCT TGCTTTACAT 

264151 GTAGATTTTT CTTCATTGAA GTTTATAAGC CCGTCTATAC TTTCCAATAC 

264201 AGAGCACACT GCAAAAGCTT TAAACTCTGG CGGAGAATGT TTTATCTTTT 

264251 CTAATCTGGA TGAATTCAAT CTTGGAATGA AAATAGTCAT GCAACTATTG 

264301 CGGACAGGGA AGATTAGCCC AGAAATTTTA AATAAAAATA TCATGAAAAT 

2643 51 TCTCATGATA AAAAGAAGAG TACGTTCTTT ATATATTTAA TTATTTGGTC 

264401 TTTTTTTTGA TTTCAAAACT AATAAAATAA ATAGAATTTT AGATCTCTGA 

264451 AGGAATCTCA GCAAGGCTGG GAGTCGATAG ATCTCTTACT TGTTTTTCTA 

264 501 ACTTACTTAG TCTTTCTTCA GTTTTAGGAA GGTTCCGAAT TTTAGCAATC 
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264551 AACCGATGTG TTTCTTGATA AGGTCGTGCT GGAGCGCCTC CATAAATGCC 

264601 TGGAGAGGTG ATAGATTTTG TGACTCCAGT TTGAGCAATC ATGATCACAT 

264 651 GGTCTGCAAT AGAAATATGC CCAGTAATTC CGGTTTGCCC TCCAATGATG 

264701 ACATGTTCAC CAATTTTTGT AGAACCTGCA ATGCCTGCTT GGGCAACAAT 

264751 AATACTATGC TTTCCAATTT CTACGTGATG AGCTACTTGT ACTTGGTTAT 

264801 CTATTTTAGT TCCTTCATGG ATCACGGTGT TCTTGAATCG ACCACGATCT 

2 64851 ATCGTAGTGT TGGCTCCGAT TTCTACATCA TCACCTACAA TCACATAGCC 

264 901 TAGATGCTTT AAAGGTTTGT GATGACCAAA AGCATTTGTA ATATAACCAA 

2 649 51 AACCACAGGA TCCTAAAACA GCTCCAGGTT GAACAACTAC ACGGTTTCCC 

2 65001 ATGAGGACTC TTTCTCGAAT CACCACCTTA GGGTGAATCA GACAGTTAGC 

265051 ACCTAGAACG CTGTGAGCTC CAATGACACT TCCAGCTCCG ATGTATGTGT 

265101 CAGAGCCGAT ATGGGCATGT TGACTAATGA CAACGTAAGG TTCTATGGTT 

265151 ACATTTTTCT CAATACGTGC AGTAGGATGA ATCACTGCAG TAGGATGAAT 

265201 ACCAGGAAAC CCTGATGTTA CGGGTTCAAT AAACAACTCT ATGCACTTTT 

265251 GAAATGTTAG AGAAGGGGAT TCATTGGTAA TAAGAAAGTT TTTCTTTAGG 

265301 TGGGCATGTT GCATTGCCTG AGATCTAGAT AAAATAATAG CACCAGCTTT 

265351 GGTGTTTTTT AGAAAGCTAG AGTATTTCTC ATTATCTAAA AAAGCAATAT 

265401 GGTGAGGTTG CGCCTGACTA ATATCTTCAA CACCTGAAAT AGGAGTTTCT 

265451 ATATTTCCTT GAACTTCGAC TTGTAGTAGC TCAGCTAACT GTTTAAGAGT 

265501 GTAGACTGGT GCTTCGGACA TAGAAAACTC CTTAAACTTG GACTAGTTTT 

265551 GTTTTTTGAA AGATTCGTTA AGAATAGCAA TAATTTCGGT TGTTTTATCA 

265601 GTCCCAGGTG CTATTGCTAA GACAGGTTCT TCATTAAGGA TAGCTTCTAG 

265651 TTTTTCTTTG "GACCGCACTG ATTCTGCAGC TATTTTTACT TCTTGAATGA 

265701 GTTTTTGAAT GCGTTTTACA TGTACTTTGA TTGATAGATT GATAGTACTG 

265751 AGACTGGTAC GCATTGTACT CTCCTGAAAG ATCTTCGAAT TTCTTTCGCA 

265801 ACTCTTCAGA GGCAGAATCC GATAGGCfTTT CCATGTAATC TTCATCTTGC 

265851 AACTTATTAT AAATAGAAGT GAGTTCTTCT TCTATTTTCT CAGCATTTTT 
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265901 TACAAACTGC TGTTTCATAG CTTCCAATTC TTCAGTTTCC TTTTTACCTA 

2 65951 GATCGGATTC TTCAAGACAT CGCTTTAAAT TAACATAGCC TAAATTTGCA 

266001 TGAGCTGCGC TTGTTGATCC TAAAACAAGA AGAAATGTAG AAAATAATAA 

266051 TTTTTTCATA ATTACCTGTA CTAACTTGGG TATAGAAAAA GGGTACCAAA 

266101 AGCCTTTTCT GAAAACAACA AAGATTTCCT TCGATAAGTC CTTAATTTAT 

266151 ATCTTAGAAC ATGCCCCCTA AAGCAAAGAA GAATCGCTGA GATACATCAA 

266201 TTTTTTCTCC ATTCAAAGTC TCGGTTGGAC GGAAGGGCCA ACCAAATCCT 

266251 AACATAACAG GAACATTATT CATTACATCG AAGCGCAGAC CAAATCCAGC 

266301 ACTACTACGT AGATCTTTTA ACGAAATCTT ATACTCTTGT AAACCGACAA 

266351 AACCTGAGTC TAAGAATACA AAGGCACTAA TATTAGGTTG TCTGATGAGA 

266401 GGGTATTGAA ACTCTTCTGA AATAAGGAGC GAAGAGAGTC CTCCCTGAGG 

266451 TTCTGTAGCA GAGTATTTTG GACCGATAAT AAAGGATTTA TATCCCCGAA 

266501 CTGTAGTCTC TCCACCTAGG AAGAAGCGCT CACTGACAGG AACTCCTTCA 

266551 GCTGTAGTAT TGCTATAGGG TTTAATAAAT TGAGCTTCCC CTTTGATTTT 

266601 CAAAATACCT TTACGCGTAA GTTTTCTATA GATAGAGCTG TTTAAAGAGA 

266651 GTTTTGTAAA ATGATAAGTT CCTCCCAAAC CAGAAACCTC AAAAGTCACC 

266701 CCCCCGCGAA TCCCTGTAGT TGGAGTTCTA GGACTATCTA CAGAATCGTA 

266751 ATTCAAGTTG ACACCTGCAG CAGAGACAAA TCCTTTATTG CTGTCTATAT 

266801 TTGGCCCTAG GAGGAACTTA CGTTTTTCAT GTAAACTCGT TTGACTTCCT 

266851 CGATAAAATA GACCGTATTT CAGGTGTTCG TTCAAGATAT ACGTTGTGCT 

266901 GACGTTCCCG CCATAGGTTT GGACAGCATA ATCTTTAGAT AATGCTCTGT 

266951 TAATTGATTT ATCTAATTCA ATTCCTAAAA TCCAAGGAGT GTTTAGAAAA 

2 67001 TGAGGTTTGG TCCACTTCAA AGTATAGTCT GTGACTTTGT CCCCGAAGTT 

267051 GGCTTTTAAG AATAGATGTT CTCCACCGCC TCTTAGACAA CGAAAACCTT 

267101 TAGAAAATAT ATTTCTAGCT CCAAATAGAT CAAAATTACT TTCAGATAGT 

267151 TCAATTCCTC CAAAAAGATT GTCAAGAGAA CTAAATCCTA AGAATAAGCC 

267201 TAAGTTTCCT GTTGTTGTTT CTTTGACTTC TACAAAAATA TCTCGGTATT 
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267251 GATCCGCATT GCCCATAGGA TCAAGTTGAG AACGAACTGT ATAGACACTA 

267301 ACGCTTTGGA AGTAGCCTGT ATTTCTTAAA CGTTGCTCAG TATCTTCTAG 

267351 CTTTAAGCGA TTGAATGTAT CTCCTGGGAA GAGACTGGTT TCGTGTAAAA 

2674 01 TAACGTCAGA TTTTGTATGG GTATTCCCAG TAATTTTAAT TAACCCAACT 

267451 TTATAAGGAG ACCCTTCACT TACCTCATAA GTTACATCAT AAATAGGGCG 

267501 GGTTGCGTGA GGGATGAAGA GAACGTCTAC ATTGGTATTG ATGTAGCCAT 

267551 ACTTTGCATA AGTTTGTTTG ATCTTATGAG CCCCATCCCA TATTTTATCG 

267601 GGGCAATAAA GATCATTGGG GCCGACTTGG GATTGCTTTT CTATAAGGCG 

267651 TTTTGGCAAA ACCTCAAACC CTTGGATATG GACGTGTCCT AAGGTATATC 

267701 GCGACCCTCG ATCAATATCC ATGTAAAGAA GAATATTCCC TTTGTCGTCA 

267751 AGGTCATAGT GAGAGTTGAC TATAGCATCA GCGTACCCGT TATTATGTAG 

267801 GTAATTCGTA ATTGCCAAGC TATCTTGTTC AACAATATCT GGGTGATAGA 

267851 GTCCAGCTCC AGTAAACCAA CTTGTAGTTG TAGAGTGCTG CTTGGTTTGA 

2 67901 ATAAATTCTT GGATATCTGA TTTTTCTGAT CGAGAGATTC CTGAGAACGT 

267951 AAGCTGTTTA ATTTTCCCGC AAGGACCTTC ATTGATTTTA ATTAAAACAT 

268001 CGATGTGACC TTTTTCTTGA TTGTGTTCCA GACTGTAGTC TACACTGGAT 

268051 GCGAAATATC CTCGCTTGAG ATAATACGTT CTTAGATCAT CAAGACCCTT 

268101 AAGAAATTTT TCTCGTTCAA AGAGATCATT ACGGTAAATT TGTAGGGTTT 

268151 TAAGAATTTT ATGTTCAGGA ACGACTTGAT TTCCTGAGAT ATGAATATTT 

268201 CGAATTGAGG GTTTAGCTAT TAGGTGAAGG GCTATGTTAG TTTTCCCTTC 

268251 AGAAAATTCT ACTTTAGGCT CAACAGAGTC GTATTCTTTA GCTAGAATTC 

268301 TCAAGTCTTC ATCAAAATCT AATTGAGAAA AAAGAGCCCC ACTTCTGGTC 

268351 TTTAATTTGG GTAAGGGATG TTTATTTGAA GCATTTTCTC CTTCCGTTAT 

268401 GATTGTGATA GAGTCTACCA CCACATGGCC TTCTTTAACT TTTTCAGTAG 

2684 51 AAAATAAAGT TAAAGGGGTT TGGATTAACG CTAGAATAGA TATTTGCAAG 

268501 ATAACTTTAT TTCGCATGAT GAGCATTCCC AAGAGTCTTC CCTAGATAGA 

268551 AAGCTTGTTT ATCAGATAAT GAGGTAAGCA CAAAAATAGG GACAAAGAAG 
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268601 TTTTTTAACA AGACCTTACT ACTTAGATTA TTTTTGTATT TCTAAAGAAA 

268651 TTTAAATTGT GTTTGATTTT TTCCCGTCCT AACACCGGGA TTATTACTGT 

268701 GCTCAGACTA GAGCCTGAAT ATTTCGGCGG GTTTTTTGCT TCCTTAAAAA 

2687 51 GCTTAATAAA CCTTGTCGCT CAAATAACTC TACTGATTAC ATGGTTTTAA 

268801 GGCAAATTCG ATAATTCACT GTGTCCTAGG GGAAAGTAAG AAAGACTAGC 

268851 ATAGAAGAGA ACCTTGTTTA CTTTTAGGAT AAGAGAGCTG CTAATAGGAG 

268901 TGTCGTCCAG AAAAAGCTCT TGCCAGTGTC CCTGAATCTA CATAATCAAA 

268951 AGATAAGCCT ATAGGAAGAC CTAAAGCTAG ACGGGAAATA TTTACAGAGA 

269001 AATGTTGTAA TTCTTGTTTT AGAAAAAGGG CAGTAGCATC TCCTTCTAAG 

269051 GTTGCATCAA TGGCTAGGAT AATTTCTTTT GGGCATAGCG TTTCTATGCG 

269101 TGATTTTAAA ATGGAGAGAC GCTCGTTTTC TATATGTTTC CCTGTAATGG 

269151 GCGATAAGAG TGAACCAAGA ACATGATAAC GTCCCTTGAA TACTTTAGAA 

269201 CGTTCTAGAA AGAAAACATC TTTTGGAGAA GCGACAATAC ATAGACTTTG 

269251 GTTATCTCTT TCTTCTCTAC AAAAGTGACA GTCTGCCTCT TTAGATTCTT 

269301 TGAGAGTAAA ACATAGGGGA CAGTGACTAC GCTCACTAGC AACATTATGA 

269351 AAAGCGTTAC CTAATATTTT TAATTGTTCG CTGTCCCAAG AGATGAGTTC 

2694 01 AAAAGCAAGT TTTTCTGCTG TTTTAAATCC AATTCCTGGA AGTTTTCGTA 

2694 51 AAAAGAAAAT TAATTTAGAT AAGTAATCTG GATATCTTGT CATAGTGATG 

269501 TGAATTTTAT TTTTACATTC GGGTCTGGGG CCTAAATTAA GGTTAGAGTA 

269551 TAAACTCTCT GAATAGTATA CTAGCTTTTT TCTTTACATG TGGTTCTCTG 

269601 TGAATAAAAA CAAAAAAGCA GCAATTTGGG CAACGGGTTC CTATTTGCCT 

269651 GAGAAAGTTC TTTCAAACGC AGATTTAGAA AAAATGGTAG ATACCTCTGA 

269701 TGAGTGGATC GTGACCAGAA CGGGGATCAA AGAGCGTCGT ATTGCTGGAC 

269751 CTCAGGAGTA CACTTCTCTT ATGGGAGCCA TCGCTGCAGA GAAAGCTATA 

269801 GCAAATGCGG GTTTAAGCAA GGATCAGATT GACTGTATCA TTTTCTCGAC 

269851 AGCAGCACCA GATTATATTT TCCCATCAAG CGGAGCTCTT GCTCAAGCAC 

269901 ATTTAGGCAT TGAGGATGTC CCTACATTTG ATTGCCAGGC GGCTTGTACT 
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269951 GGGTATTTGT ATGGTTTGTC TGTAGCTAAG GCTTATGTAG 

270001 ATATAACCAT GTATTGTTAA TTGCTGCTGA TAAGTTGTCT 

270051 ATTATACAGA TCGGAATACC TGTGTGTTGT TTGGAGATGG 

270101 TGTGTCATAG GGGAGAGTCG GCCAGGATCT TTAGAGATTA 

27 0151 TTTAGGCGCA GATGGTAAGC TAGGAGAGTT ATTAAGCCTT 

270201 GTAGTCGTTG TCCTGCTTCT AAAGAGACTT TACAATCAGG 

270251 ATTGCTATGG AGGGAAAAGA AGTTTTTAAG CATGCTGTGA 

270301 AACGGCAGCT AAACATTCGA TAGCCCTGGC AGGCATTCAG 

270351 TAGATTGGTT TGTACCTCAT CAAGCTAATG AAAGAATAAT 

2704 01 GCGAAGCGTT TTGAGATTGA TGAGTCTAGA GTGTTTAAGA 

270451 GTATGGAAAT ACTGCGGCCT CGTCTGTGGG CATTGCTTTG 

270501 TTCATACAGA ATCCATTAAG CTTGATGATT ATTTACTTTT 

270551 GGGGGCGGTT TGTCTTGGGG CGCAGTAGTT TTAAAGCAGG 

270601 ACGATAATTT CATGAAAAAA CGTTATGCTT TTTTGTTCCC 

27 0651 AGCCAATATG TAGGTATGGG ACAAGACCTA TATATGGAGT 

270701 TAGAGAGCTT TTTGATTTTG CTAATGAAAG GTTAGGATTT 

270751 CAATTATGTT TGAAGGTCCT GAGGATCTTT TGATGGAAAC 

270801 CAGCTAGCTA TTTATCTTCA TAGCATGGCT GTGGTAAAGG 

270851 GCGTTCTTCT ATTCAGCCTT CTTTAGTCTC TGGATTAAGT 

270901 ATACTGCTTT AGTTGCTTCC GATAGAATCT CCGTGCTCGA 

270951 CTTGTTAGAA AGCGTGGTCA GTTAATGAAT GAAGCTTGTA 

271001 AGGGGCTATG GCGGCTTTAT TAGGGCTTCC CTCTGAAGTT 

271051 ATATAACAAG TCTTGGTCAA GGAATTTGGA TTGCTAATTA 

271101 AAACAGCTTG TAGTGGCTGG AATAGCAGAA AAAGTAGACC 

271151 GTTATTTCGT GATTTAGGAT GTAAAAAAGC AGTTCGTTTA 

271201 GAGCATTTCA TACTCCTTTA ATGCAAGTTG CTCAAGATGG 

271251 GACATTTATG CTTTATGCAT GAAAGATTCT AGCCTTCCCT 
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AATCAGGTAC 
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271301 CGTGGTAGGA AAATCTTTAG TAAATACTGA AGAAATGCGA GAGTGTTTAG 

271351 CTCGGCAAAT GACATCACCT ACGTTATGGT ATCAGAGTTG TTACCATATC 

271401 GAATCAGAGG TGGATGAGTT TTTAGAATTA GGTCCAGGAA AAGTTTTGGC 

2714 51 TGGTTTAAAT CGCTCTATAG GGATTTCTAA ACCGATTACA AGTCTTGGTA 

271501 CTTTTGCTCA GATTGAAAAA TTCCTATCAG AGGTATGATT TGTATGGATA 

271551 TAACATTAGT AGGCAAAAAA GTTATAGTAA CTGGAGGATC TCGAGGAATT 

271601 GGACTCGGGA TAGTTAAGCT TTTTCTTGAG AACGGAGCAG ATGTAGAAAT 

271651 TTGGGGATTG AATGAGGAGC GAGGTCAGGC TGTTATAGAA AGTTTAACAG 

271701 GCTTGGGTGG CGAAGTTTCT TTTGCTCGTG TGGATGTGAG TCATAATGGT 

271751 GGAGTGAAAG ATTGCGTGCA GAAATTTTTA GATAAGCACA ACAAAATAGA 

271801 TATTTTGGTA AATAATGCAG GCATTACCAG GGATAATTTG TTGATGCGTA 

271851 TGTCTGAGGA CGACTGGCAA TCGGTGATTA GCACCAACTT GACTTCCTTG 

271901 TATTATACAT GTTCCTCAGT GATTCGCCAT ATGATTAAGG CG CGTTCAGG 

271951 ATCTATTATA AATGTGGCTT CTATTGTTGC TAAGATCGGT AGTGCGGGCC 

272001 AGACCAACTA TGCTGCTGCT AAAGCTGGGA TTATTGCTTT CACAAAATCT 

272051 TTAGCTAAGG AAGTAGCTGC AAGAAATATT CGTGTCAACT GCCTTGCTCC 

272101 AGGCTTTATT GAAACAGACA TGACAAGCGT GTTGAATGAC AATTTAAAAG 

272151 CTGAGTGGCT TAAGTCGATC CCTTTAGGTA GGGCTGGCAC TCCAGAAGAT 

272201 GTTGCTCGTG TGGCGTTGTT TTTAGCCTCG CAGTTATCGA GCTATATGAC 

272251 CGCGCAGACA CTGGTTGTTG ATGGGGGATT GACTTACTAA GACAATAGAA 

272301 GAAAGGGATT TGAAAATTTC TCTTCGAGAA CTAATTAAGT AACCGTCGAA 

272351 TAAAAAATGA TTTTTTGCGA TACTAATTCT CTTTCTCTTT GTCCCTAGGG 

2724 01 AATAGTGAAG TATTGTATAG TTTAAATAGT AAAAGGATAT AAGCAATGAG 

272451 TTTAGAAGAT GATGTAATAG CAATTATTGT TGAGCAGTTA GGAGTGGATC 

272501 CAAAAGAAGT TAATGAGAAC TCTTCTTTTA TTGAAGACTT GAATGCTGAT 

272551 AGTTTAGATT TAACAGAATT GATTATGACT TTAGAAGAAA AATTTGCTTT 

272601 TGAAATTTCA GAAGAAGATG CTGAGAAGCT TCGTACTGTC GGGGATGTAT 
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272651 TTACTTATAT TAAGAAACGT CAAGCTGAAC AATAAACTTT CTTATATTCT 

272701 GGGTGCGTCG TAATCTTTTT TAAAGCTATT GTTTTTCAAT AGTGATTACG 

272751 GCGTGCCTTT TTTTCTATAG AGTAGCTCAT CTAGAAAGAT TTATTTTGTC 

272801 TTTTTAAGGT TCTCTGAACT TGATTTGTTT AGCATAGAGC TCTAAAAAAG 

272851 ATAAAGCTAC GGATGGGCAC TCTTCCACAA fGTTTAGAAT TTGTCCTTTG 

272901 CTAAGAACTA GCATGCGGAC TTGTGTATTT GCAGAAGCAT TGTATTCCCT 

272951 GGGCTTATTA TTGAATAAGC TTTCCTCTCC AAAACAATCT AAAGGTTTTA 

273001 AATTTAGAGG AGACTCTAGT TTTTCTTTAG AGATCGTAAT GTATCCTTCT 

273051 ACAATGATAT AAAAGCTGAA TCCAGGTTGT CCTATAGAGA ATACATTGCT 

273101 GCCAGGCTTA AATATTATCG TTTCAGTTTT ATCGGCAATT GTTAAAAGAA 

273151 GGTCCATGTC TAAAGATTGG AATATAATCG TTTTTTTTAG TAGAAAGGCG 

273201 CGATCGATCA AATTCATAAA AAAGTTCCTT ATTCACACCA TAGTTTTAGT 

273251 TTTT 
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